Welcome to WebmasterWorld Guest from 50.16.31.61

Forum Moderators: goodroi

Googlebot ignoring robots.txt?

Or is my robots.txt file wrong...

   
4:03 pm on Apr 16, 2007 (gmt 0)

10+ Year Member



I noticed Googlebot trying to submit a form. This, by itself, might not be weird, however, it posts using the first value in a selectbox, resulting in an error (since the first value reads "select your item here").

Also, the form posts to the following URI:

www.domain.com/?step=11&sid=34r534ydf343434 (where sid is variable)

I've tried to block all bots from accessing those pages by adding the following rule to my robots.txt

User-agent: *
Disallow: /?step=11

How come G is not abiding my rules? The IP I logged (72.14.220.136) resolves to fg-out-f136.google.com

4:05 pm on Apr 16, 2007 (gmt 0)

5+ Year Member



You need to add wildcard:

User-agent: *
Disallow: /?step=11*

9:28 am on Apr 17, 2007 (gmt 0)

10+ Year Member



I dont think adding the wildcard is necessary. I use Google Webmaster tools for some analysis, and a few of those /?step=11 pages show up as "URL restricted by robots.txt". So the rule is correct I think.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month