http://www.webmasterworld.com Welcome to WebmasterWorld Guest from 38.103.63.16
register, login, search, glossary, subscribe, help, library, PubCon, announcements , recent posts, unanswered posts
Subscribe to WebmasterWorld
Home / Forums Index / Glossary / Robots.txt

robots.txt


A file on a web site in the root directory of a website that is used to control which spiders have access to which pages within a website. When a spider or robot connects to a website, it checks for the presence of a robot.txt. Only spiders that adhere to the Robots Exclusion Standard will obey a robots.txt command file

There are several specific fields in a robots.txt such as User-agent specifies which User Agents are allowed to access the site and "Allow/Disallow" specifies which directories a spider may access.

For full specifications see Altavista.

Home / Forums Index / Glossary / Robots.txt
All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About
WebmasterWorld ® and PubCon ® are a Registered Trademarks of WebmasterWorld Inc.
© WebmasterWorld Inc. / SearchEngineWorld 1996-2008 all rights reserved