Forum Moderators: goodroi

Message Too Old, No Replies

boitho.com bot violating robots.txt

Specifically requested only forbidden files

         

jazzguy

8:08 pm on May 5, 2005 (gmt 0)

10+ Year Member



"boitho.com-dc/0.75 ( http*//www.boitho.com/dcbot.html )" came from 129.241.104.168. It specifically targetted disallowed files from robots.txt, ignoring all other pages.

The info page says it's a distributed crawler, so just like my policy for the cronic robots.txt violater Grub, I banned the user agent and the entire IP block associated with the offending IP.

bcolflesh

8:00 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Regardless of what the bot owner said it would be in the best interest for everyone to see the file.

Again, that just shows that you either haven't read the thread or you're trolling. Already asked and answered multiple times in this thread.

jazzguy

8:00 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



I cant add any value to that statement

That would be in keeping with your previous posts considering that you haven't added anything that hasn't been previously discussed.

bcolflesh

8:03 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



...you haven't added anything

I'm pretty sure he's added necessary information when prompted by forum members in previous threads.

jazzguy

8:05 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



One of us sure is in denial

Care to elaborate?

Now where is that robots.txt you so persistently denied to me?

Now you're trolling too aren't you? Already asked and answered multiple times in this thread.

rj87uk

8:06 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



.. Already asked and answered multiple times in this thread.

Now can you truly answer - Give the best answer you have the whole time being here and post the robots.txt file?

Lord Majestic

8:14 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ohhhhhh, my bad, I just found robots.txt he was referring to all th etime in the beginning of the thread, reposting it here:

----- file starts -----
# robotz.txt (c) jazzman

Botz pleaze do not visit ma site or else u w1ll 3nter w0rld of p@1n

EOF
P.S. MJ13bot tr@p here
----- file ends -----

I am off to pub and then will get back to the drawing board!

[edited by: Lord_Majestic at 8:15 pm (utc) on June 14, 2005]

jazzguy

8:15 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



I didnt miss it, Its still not the point

On the contrary--it's exactly the point.

you had problem

I didn't have a problem. I posted about a bot the violated robots.txt.

and posted information, The bot owner asked for more information as what you had gave wasnt enough to look into it

So I offered more information which he rejected.

and also asked for the robots.txt file and any URLS.

That was already covered on the very first page of this thread and multiple times since.

You didnt post the information.

That's because he rejected my multiple offers. The offers were later rescinded due to the bot owner's sarcasm and insults.

Regardless of what the bot owner said it would be in the best interest for everyone to see the file.

Already covered multiple times in this thread.

bcolflesh

8:17 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



----- file starts -----
# robotz.txt (c) jazzman

Botz pleaze do not visit ma site or else u w1ll 3nter w0rld of p@1n

EOF
P.S. MJ13bot tr@p here
----- file ends -----

Jizzguy, if this is indeed the contents of your robots.txt file, it is not valid - check Google for some validators that will help you fix this up.

jazzguy

8:17 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



post the robots.txt file

Already covered multiple times in this thread.

rj87uk

8:18 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Jizzguy, if this is indeed the contents of your robots.txt file, it is not valid - check Google for some validators that will help you fix this up.

I think he may be on to something... worth looking into I would say.

This 111 message thread spans 12 pages: 111