Welcome to WebmasterWorld Guest from 107.20.75.63

Message Too Old, No Replies

Phrase searching PDF content that crosses line boundaries

     
9:20 pm on Aug 11, 2010 (gmt 0)

Junior Member

10+ Year Member

joined:Aug 9, 2003
posts: 118
votes: 0


I work on a content site where users upload PDF's that they naturally want to have found. We get frequent complaints when people clip a few sentences of text from one of their PDF's, drop it into Google as a phrase search, and find nothing.

For example (not from my site):
"Adobe is committed to providing solutions that improve the accessibility of Acrobat, Adobe Reader, and the content of PDF documents." (no pdf).

"Adobe is committed to providing solutions that improve the accessibility of Acrobat, Adobe Reader" (got it)

In contrast, Bing! apparently can't phrase search PDFs at all, so it falls back to word search right away. Anyone else with phrase search PDF issues have tips to share? Is there a solution?
4:55 am on Aug 12, 2010 (gmt 0)

Senior Member

WebmasterWorld Senior Member tedster is a WebmasterWorld Top Contributor of All Time 10+ Year Member

joined:May 26, 2000
posts:37301
votes: 0


I don't even know of any trick to make an exact phrase search work all the time for html files - so I'm not surprised that it's a problem with PDF files, too. Not trying to be a wise guym here. It's one area where Google is kind of borked right now fro some pages, and it used to be a sure thing.