homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

Googlebot ignoring robots.txt?
Or is my robots.txt file wrong...

 4:03 pm on Apr 16, 2007 (gmt 0)

I noticed Googlebot trying to submit a form. This, by itself, might not be weird, however, it posts using the first value in a selectbox, resulting in an error (since the first value reads "select your item here").

Also, the form posts to the following URI:

www.domain.com/?step=11&sid=34r534ydf343434 (where sid is variable)

I've tried to block all bots from accessing those pages by adding the following rule to my robots.txt

User-agent: *
Disallow: /?step=11

How come G is not abiding my rules? The IP I logged ( resolves to fg-out-f136.google.com



 4:05 pm on Apr 16, 2007 (gmt 0)

You need to add wildcard:

User-agent: *
Disallow: /?step=11*


 9:28 am on Apr 17, 2007 (gmt 0)

I dont think adding the wildcard is necessary. I use Google Webmaster tools for some analysis, and a few of those /?step=11 pages show up as "URL restricted by robots.txt". So the rule is correct I think.

Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved