Forum Moderators: goodroi

Message Too Old, No Replies

What bots ignore robots.txt?

         

Jon12345

2:19 pm on Aug 16, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ok, so there are various bots and crawlers parading around the internet. I am using this...

User-agent: *
Disallow: red.php

...in my robots.txt file. I presume this is correct if I don't want any bots to go to follow to the red.php page. yes?

But what percentage of bots actually ignore such a request? Any idea?

Also, should I use a no-follow tag instead?

Thanks,

Jon

Dijkgraaf

4:09 am on Aug 17, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Not quite correct, it should have a leading slash before the page name as below.

User-agent: *
Disallow: /red.php

This will tell bots not to do a GET /red.php from your domain.

Any bots that don't obey robots.txt are unlikely to obey the nofollow tags.

Jon12345

10:01 am on Aug 17, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Are you saying my version won't work at all?

Lord Majestic

1:57 pm on Aug 17, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Are you saying my version won't work at all?

Yes, your version is incorrect because robots.txt does not specify pattern or sub-string match, but requires to exclude URLs that start with provided value. Since all URLs start with /, it is essential to include said /.

Jon12345

4:42 pm on Aug 17, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ok, thank you. I have changed it now.

Jioffri

6:14 am on Aug 18, 2005 (gmt 0)

10+ Year Member



Jon,

Would the following also work to block the red.php?

User-agent: red.php
Disallow: /

Or is it more correct to use the example you gave to deny every bot a person wants to block and list them individually beneath in a series on separate lines as the need arises?

Dijkgraaf

9:28 pm on Aug 18, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Jioffri, your example is saying that a BOT called red.php isn't allowed anything starting with / (i.e. the whole site).

Jioffri

10:56 pm on Aug 18, 2005 (gmt 0)

10+ Year Member



Ok, thanks! I understand now.