A discussion forum for people interested in digital humanities across the disciplines

Open source OCR tool for historic printed text

1 voice, 0 replies
Viewing 0 reply threads
  • Author
    Posts
    • #49748

      Antonia Karaisl
      Participant
      @arescribe

      Hello all!

      I am one half of a 2-person not-for-profit company called Rescribe, developing OCR solutions for historical printed works such as you would find on Internet Archive and Google Books. We usually provide digital humanities research projects and libraries with transcriptions of digitized corpora of historic texts; additionally to that, our aim is to make the bespoke software we develop for these projects free and accessible to all.

      Recently we’ve developed an open source desktop tool to perform high-quality OCR on modern and historic printed text. At the moment, this tool can only be run from the command line, but we’ve been encouraged by anecdotal feedback and a very positive independent review  to make this tool more accessible. We would like to achieve this by creating a simple Graphic User Interface (GUI) for Windows, Mac and Linux, potentially with additional functionality built in.

      Since this is a purely community-oriented project, we are planning a crowdfunding campaign in order to finance the development of this GUI and get interested parties directly involved in the actual development process. We’ve composed a 30-second, 3-question survey where potential participants can register interest and have their say on future features of the tool, see following link: https://forms.gle/AZcKpsbzQQajSE4j6

      If you are indeed an interested party, we’d be excited for you to fill this out, and forward it on to just about anyone else. In the meantime, any questions, comments and suggestions are all very welcome, either as a reply to me here or to info@rescribe.xyz!

Viewing 0 reply threads
  • You must be logged in to reply to this topic.