homepage Welcome to WebmasterWorld Guest from 107.20.73.188
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

    
Google WMT reports crawl errors on my PDFs
Joshmc




msg:4371662
 8:46 pm on Oct 6, 2011 (gmt 0)

In the last 3 months I have continually ran into a problem where many of the PDFs on my site keep coming up in the not found area of webmaster tools. These are valid links and I can load them every time I click them. I was wondering does Google bot have a problem with PDFs? Like is there a way I need to host or optimize them to get them to not error. This doesn't happen for all of my pdfs but a good amount.

Thanks

 

tedster




msg:4371665
 8:57 pm on Oct 6, 2011 (gmt 0)

Do the URLs for those PDF errors contain any odd characters - things that need to be escaped (turned into coded entities) before the browser can send the request?

g1smd




msg:4371666
 9:02 pm on Oct 6, 2011 (gmt 0)

What format are the links pointing to those files?

Are they relative, root relative or absolute?

Are there any
../ constructs in those links?
Joshmc




msg:4371681
 9:40 pm on Oct 6, 2011 (gmt 0)

Thanks for the quick responses. Their are no special characters just alpha numeric upper and lower case and dashes. I wouldnt think these would be a problem right?

The links are relative (they dont include the domain name, i believe that means relative) and are like this example <a href="/pdf/file-name.pdf">

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved