homepage Welcome to WebmasterWorld Guest from 54.167.177.180
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

    
Googlebot ignoring robots.txt?
Or is my robots.txt file wrong...
designaweb

10+ Year Member



 
Msg#: 3312580 posted 4:03 pm on Apr 16, 2007 (gmt 0)

I noticed Googlebot trying to submit a form. This, by itself, might not be weird, however, it posts using the first value in a selectbox, resulting in an error (since the first value reads "select your item here").

Also, the form posts to the following URI:

www.domain.com/?step=11&sid=34r534ydf343434 (where sid is variable)

I've tried to block all bots from accessing those pages by adding the following rule to my robots.txt

User-agent: *
Disallow: /?step=11

How come G is not abiding my rules? The IP I logged (72.14.220.136) resolves to fg-out-f136.google.com

 

wrongdoze

5+ Year Member



 
Msg#: 3312580 posted 4:05 pm on Apr 16, 2007 (gmt 0)

You need to add wildcard:

User-agent: *
Disallow: /?step=11*

designaweb

10+ Year Member



 
Msg#: 3312580 posted 9:28 am on Apr 17, 2007 (gmt 0)

I dont think adding the wildcard is necessary. I use Google Webmaster tools for some analysis, and a few of those /?step=11 pages show up as "URL restricted by robots.txt". So the rule is correct I think.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved