Forum Moderators: phranque
User-agent: googlebot [OR]
User-agent: yandex [OR]
User-agent: slurp [OR]
User-agent: scooter
Disallow: /admin/
Disallow: /banners/
Disallow: /images/
Disallow: /includes/
Disallow: /language/
Disallow: /main/
Disallow: /users/
User-agent: *
Disallow: /
What will it do? I don't understand the user-agent lines, does it allow thowe and does it disallow them to go through the admin, banners etc?
And does adding this to the httpdocs dir prevent privacy invading?
Robots.txt Tutorial [searchengineworld.com]
also keep in mind that not all bots follow robots.txt. You may need to get into a little htaccess [webmasterworld.com] or bad bot trapping [webmasterworld.com]
[edited by: jatar_k at 8:34 pm (utc) on Nov. 18, 2003]
Here's a good reference: [robotstxt.org...]
Start with creating and uploading a blank text file, named: robots.txt
That will at least serve to fulfill requests for it, and it's rumored that some spiders will not continue their indexing if they do not find it.