Skip to main content

Theme Settings

Layout Style

Color

Direction

Image

Archive Search using OCR- Optical Character Recognition

Body

Some archives and record offices are housed in your local museum or library; others have their own stand-alone building. Wherever they are, they are a treasure trove.
                                                                                                              Kate Williams, British Historian

 


Organisations especially those in the print and publishing industry generate a large volume of content - Notes, features, reports, adverts, editorials, pictures etc. As they age these are relegated to the archives only available to the occassional scribe who is interested in digging deeper about a topic or subject. 

At Sastra we understand how busy journalists and reporters can be especially in the Digital age when news breaks happen every second. To help we have developed a cloud based tool that take your existing print archives, digitises them and makes them serachanble.

The application can search PDF based text and scanned images of text. We have put up a demo here. You can try searching for strings like "India". Click the "All" or"Images" tab to view the results.

The tool is developed on the latest Open Source Technologies available for the Cloud. With this tool Jornalists can qucikly search the archives for topics and subjects in their area of research and provide reliable citations.

Leave a comment

Restricted HTML

  • Allowed HTML tags: <a href hreflang> <em> <strong> <cite> <blockquote cite> <code> <ul type> <ol start type> <li> <dl> <dt> <dd> <h2 id> <h3 id> <h4 id> <h5 id> <h6 id>
  • Lines and paragraphs break automatically.
  • Web page addresses and email addresses turn into links automatically.

Get the latest in Fintech

Subscribe to our online newspaper dedicated to Financial Technology