homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

Google WMT reports crawl errors on my PDFs

 8:46 pm on Oct 6, 2011 (gmt 0)

In the last 3 months I have continually ran into a problem where many of the PDFs on my site keep coming up in the not found area of webmaster tools. These are valid links and I can load them every time I click them. I was wondering does Google bot have a problem with PDFs? Like is there a way I need to host or optimize them to get them to not error. This doesn't happen for all of my pdfs but a good amount.




 8:57 pm on Oct 6, 2011 (gmt 0)

Do the URLs for those PDF errors contain any odd characters - things that need to be escaped (turned into coded entities) before the browser can send the request?


 9:02 pm on Oct 6, 2011 (gmt 0)

What format are the links pointing to those files?

Are they relative, root relative or absolute?

Are there any
../ constructs in those links?

 9:40 pm on Oct 6, 2011 (gmt 0)

Thanks for the quick responses. Their are no special characters just alpha numeric upper and lower case and dashes. I wouldnt think these would be a problem right?

The links are relative (they dont include the domain name, i believe that means relative) and are like this example <a href="/pdf/file-name.pdf">

Global Options:
 top home search open messages active posts  

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved