homepage Welcome to WebmasterWorld Guest from
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
Forum Library, Charter, Moderators: goodroi

Sitemaps, Meta Data, and robots.txt Forum

To use... or not to use? That is the question

 1:23 am on Oct 30, 2003 (gmt 0)

I allow all content on my site to be crawled. With that said, should I simply not use robots.txt, or have one with the following lines :

user-agent: *
allow: *

Would having the latter of the two improve the number of pages crawled by the spiders?




 1:28 am on Oct 30, 2003 (gmt 0)

Make it even easier and just through in an empty robots.txt. No muss, no fuss, no confusion, no 404s.


 1:40 am on Oct 30, 2003 (gmt 0)

The robots.txt protocol only allows for "disallow:" statements (not allow: statements) and wildcards don't belong in the disallow.

What you need if you want to allow all spiders to roam your site without restriction is:

user-agent: *


 2:13 am on Oct 30, 2003 (gmt 0)

The most logical approach is not to have one; the only function of that file is to disallow access (hence the syntax as explained by tedster).

The only downside is that you will get lots of 404s in your log files. Should you want to eliminate them then use an empty file as suggested by jimbeetle.

Robert Charlton

 7:10 am on Oct 30, 2003 (gmt 0)

Extended discussion here...

Google and having *no* robots.txt file
could this be hurting your site?


 1:29 am on Nov 5, 2003 (gmt 0)

I noticed more of my pages are getting indexed now that I'm using a robots.txt that allows everything to be crawled.


Global Options:
 top home search open messages active posts  

Home / Forums Index / Search Engines / Sitemaps, Meta Data, and robots.txt
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved