Welcome to WebmasterWorld Guest from 54.196.175.173

Forum Moderators: goodroi

Message Too Old, No Replies

Googlebot ignoring robots.txt?

Or is my robots.txt file wrong...

     

designaweb

4:03 pm on Apr 16, 2007 (gmt 0)

10+ Year Member



I noticed Googlebot trying to submit a form. This, by itself, might not be weird, however, it posts using the first value in a selectbox, resulting in an error (since the first value reads "select your item here").

Also, the form posts to the following URI:

www.domain.com/?step=11&sid=34r534ydf343434 (where sid is variable)

I've tried to block all bots from accessing those pages by adding the following rule to my robots.txt

User-agent: *
Disallow: /?step=11

How come G is not abiding my rules? The IP I logged (72.14.220.136) resolves to fg-out-f136.google.com

wrongdoze

4:05 pm on Apr 16, 2007 (gmt 0)

5+ Year Member



You need to add wildcard:

User-agent: *
Disallow: /?step=11*

designaweb

9:28 am on Apr 17, 2007 (gmt 0)

10+ Year Member



I dont think adding the wildcard is necessary. I use Google Webmaster tools for some analysis, and a few of those /?step=11 pages show up as "URL restricted by robots.txt". So the rule is correct I think.
 

Featured Threads

Hot Threads This Week

Hot Threads This Month