Forum Moderators: martinibuster

Message Too Old, No Replies

Crawler Errors

         

freelanceit

9:24 pm on May 20, 2011 (gmt 0)

10+ Year Member



Recently when visitng the Google Adsense site I have noticed the following message at the top of the page: "Your ads have recently appeared on websites you haven't authorised. To avoid lost revenue, make sure that you authorise any sites where you display ads by visiting your account settings."

On the odd occasion I have actually found a site that I needed to 'allow' (mainly google translate, bing, etc), but lately I have not been finding any sites that needed allowing; instead I have 'crawler errors' instead.

My problem is that all the crawler errors relate to my robots.txt file. This is the type of error I see (please note I have removed the website details and replaced the site name with with an *):

[tw.babelfish.yahoo.com...]

The contetns of my robots.txt file is s follows:

user-agent: mediapartners-google*
disallow:

That is it, nothing more.

Where are these errors coming from? They are not in the robots.txt so how is google finding them and reporting them as so called errors? Currently I have 17 such errors starting from the 15 May.

azlinda

9:55 pm on May 20, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I'm wondering the same thing. I've found all kinds of robots.txt errors when I don't even have those things in my robots.txt file.

freelanceit

10:39 pm on May 20, 2011 (gmt 0)

10+ Year Member



@azlinda, Could it be something to do with Panda? Probably not but this problem has only surfaced, in my case, anyway, from the 15th May and I believe Panda was rolled out here in the UK on the 10th May.

azlinda

10:59 pm on May 20, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I've wondered about that. My problems began on April 11 with the second Panda fiasco. I'm also wondering if we have links from other sites, perhaps someone else's robots.txt is blocking those pages.