homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Visit PubCon.com
Home / Forums Index / WebmasterWorld / Accessibility and Usability
Forum Library, Charter, Moderators: ergophobe

Accessibility and Usability Forum

Google and Open Source OCR

 10:33 am on Jun 22, 2007 (gmt 0)

From TV Raman, Research Scientist, Google.

As someone who cannot see, I prefer to live in a mostly paperless world. This means ruthlessly turning every piece of paper that enters my life into a set of bits that I can process digitally. I scan in everything. Until now, I have relied on commercial OCR packages to convert these images into readable text.

The advent of our own open source OCR initiative, OCRopus (source code: Ocropus Sources) is a welcome change in this regard. I introduced support for OCRopus in Emacspeak recently, and the HTML output this produces compares favorably with output from commercial OCR engines, provided you place the page at the right orientation on the scanner. OCRopus' extensibility, and the ability to express the OCR as a structured HTML document makes it an ideal starting point for producing rich spoken output. The possibilities are enormous for people being able to collectively train, customize and improve an OCR engine.

Google and Open Source OCR [googleblog.blogspot.com]




 8:29 pm on Jun 22, 2007 (gmt 0)

Excellent find, engine! I wonder what current implementations there are out there ...

Global Options:
 top home search open messages active posts  

Home / Forums Index / WebmasterWorld / Accessibility and Usability
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved