Welcome to WebmasterWorld Guest from 54.158.51.150

Message Too Old, No Replies

Phrase searching PDF content that crosses line boundaries

     
9:20 pm on Aug 11, 2010 (gmt 0)

10+ Year Member



I work on a content site where users upload PDF's that they naturally want to have found. We get frequent complaints when people clip a few sentences of text from one of their PDF's, drop it into Google as a phrase search, and find nothing.

For example (not from my site):
"Adobe is committed to providing solutions that improve the accessibility of Acrobat, Adobe Reader, and the content of PDF documents." (no pdf).

"Adobe is committed to providing solutions that improve the accessibility of Acrobat, Adobe Reader" (got it)

In contrast, Bing! apparently can't phrase search PDFs at all, so it falls back to word search right away. Anyone else with phrase search PDF issues have tips to share? Is there a solution?
4:55 am on Aug 12, 2010 (gmt 0)

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member



I don't even know of any trick to make an exact phrase search work all the time for html files - so I'm not surprised that it's a problem with PDF files, too. Not trying to be a wise guym here. It's one area where Google is kind of borked right now fro some pages, and it used to be a sure thing.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month