Page is a not externally linkable
tedster - 6:06 am on Nov 1, 2008 (gmt 0)
As the linked articles indicate (and your own experience can verify) accurcay in OCR is still a difficult problem. The Google results from this new adventure are most likely not going to be ideal for quite a while. If you don't want mismatched information, or coffeee stains being turned into text, then make sure you take some helpful steps.
Even with this OCR technology trying to improve search, it's still a very good idea to pay attention to the embedded meta data in a PDF file. If you or the people who create onlive PDF documents for you do not know how to do this, it's to learn how to locate and modify the meta-data.