Page is a not externally linkable
- Google
-- Google SEO News and Discussion
---- Google Now Using OCR on Scanned PDF Documents


tedster - 6:06 am on Nov 1, 2008 (gmt 0)


Even with this OCR technology trying to improve search, it's still a very good idea to pay attention to the embedded meta data in a PDF file. If you or the people who create onlive PDF documents for you do not know how to do this, it's to learn how to locate and modify the meta-data.

As the linked articles indicate (and your own experience can verify) accurcay in OCR is still a difficult problem. The Google results from this new adventure are most likely not going to be ideal for quite a while. If you don't want mismatched information, or coffeee stains being turned into text, then make sure you take some helpful steps.


Thread source:: http://www.webmasterworld.com/google/3777195.htm
Brought to you by WebmasterWorld: http://www.webmasterworld.com