Forum Moderators: goodroi

Message Too Old, No Replies

boitho.com bot violating robots.txt

Specifically requested only forbidden files

         

jazzguy

8:08 pm on May 5, 2005 (gmt 0)

10+ Year Member



"boitho.com-dc/0.75 ( http*//www.boitho.com/dcbot.html )" came from 129.241.104.168. It specifically targetted disallowed files from robots.txt, ignoring all other pages.

The info page says it's a distributed crawler, so just like my policy for the cronic robots.txt violater Grub, I banned the user agent and the entire IP block associated with the offending IP.

bcolflesh

8:00 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Regardless of what the bot owner said it would be in the best interest for everyone to see the file.

Again, that just shows that you either haven't read the thread or you're trolling. Already asked and answered multiple times in this thread.

jazzguy

8:00 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



I cant add any value to that statement

That would be in keeping with your previous posts considering that you haven't added anything that hasn't been previously discussed.

bcolflesh

8:03 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



...you haven't added anything

I'm pretty sure he's added necessary information when prompted by forum members in previous threads.

jazzguy

8:05 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



One of us sure is in denial

Care to elaborate?

Now where is that robots.txt you so persistently denied to me?

Now you're trolling too aren't you? Already asked and answered multiple times in this thread.

rj87uk

8:06 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



.. Already asked and answered multiple times in this thread.

Now can you truly answer - Give the best answer you have the whole time being here and post the robots.txt file?

Lord Majestic

8:14 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ohhhhhh, my bad, I just found robots.txt he was referring to all th etime in the beginning of the thread, reposting it here:

----- file starts -----
# robotz.txt (c) jazzman

Botz pleaze do not visit ma site or else u w1ll 3nter w0rld of p@1n

EOF
P.S. MJ13bot tr@p here
----- file ends -----

I am off to pub and then will get back to the drawing board!

[edited by: Lord_Majestic at 8:15 pm (utc) on June 14, 2005]

jazzguy

8:15 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



I didnt miss it, Its still not the point

On the contrary--it's exactly the point.

you had problem

I didn't have a problem. I posted about a bot the violated robots.txt.

and posted information, The bot owner asked for more information as what you had gave wasnt enough to look into it

So I offered more information which he rejected.

and also asked for the robots.txt file and any URLS.

That was already covered on the very first page of this thread and multiple times since.

You didnt post the information.

That's because he rejected my multiple offers. The offers were later rescinded due to the bot owner's sarcasm and insults.

Regardless of what the bot owner said it would be in the best interest for everyone to see the file.

Already covered multiple times in this thread.

bcolflesh

8:17 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



----- file starts -----
# robotz.txt (c) jazzman

Botz pleaze do not visit ma site or else u w1ll 3nter w0rld of p@1n

EOF
P.S. MJ13bot tr@p here
----- file ends -----

Jizzguy, if this is indeed the contents of your robots.txt file, it is not valid - check Google for some validators that will help you fix this up.

jazzguy

8:17 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



post the robots.txt file

Already covered multiple times in this thread.

rj87uk

8:18 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Jizzguy, if this is indeed the contents of your robots.txt file, it is not valid - check Google for some validators that will help you fix this up.

I think he may be on to something... worth looking into I would say.

jazzguy

8:21 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



Ohhhhhh, my bad, I just found robots.txt he was referring to all th etime in the beginning of the thread, reposting it here

<snip ridiculous bogus robots file>

A perfect example of the sarcasm that resulted in my rescinding the offer to help you.

Lord Majestic

8:23 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A perfect example of the sarcasm

Thanks dude, you sure can make other people feel appreciated!

that resulted in my rescinding the offer to help you.

I was trying to get you to help me help you, however it is now clear that you need the kind of help that is beyond my field of expertise.

I suggest you refer to post #2 in this thread where I asked for robots.txt rather politely:

Just out of interest, can you post your robots.txt?

100 posts and counting -- and no robots.txt yet.

[edited by: Lord_Majestic at 8:25 pm (utc) on June 14, 2005]

rj87uk

8:23 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I will say sorry for him(is it even needed?) and also say that he didnt mean to reject your multiple offers.

bcolflesh

8:23 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A perfect example of the sarcasm...

It's not sarcasm to hold out your hand to a friend in need - robots.txt mistakes can cause spiders to improperly interact with your site - there are tons of good validators that can clear up these issues - check 'em out!

jazzguy

8:25 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



if this is indeed the contents of your robots.txt file, it is not valid

Either you're a complete idiot or you're still trolling. If the latter, I guess you can carry on as you see fit. If the former, that's something you'll have to work out for yourself.

Oh and in case it was the former, of course it's not the contents of my robots.txt file.

jazzguy

8:26 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



I think he may be on to something... worth looking into I would say.

See above

bcolflesh

8:26 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



...of course it's not the contents of my robots.txt file

Doh! Sorry about the mixup friend - can you post the contents of your actual robots.txt file here?

jazzguy

8:27 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



Doh! Sorry about the mixup friend - can you post the contents of your actual robots.txt file here?

Troll,

Already covered multiple times in this thread.

Lord Majestic

8:28 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Oh and in case it was the former, of course it's not the contents of my robots.txt file.

Hey, it was a fair guess given the circumstances! :)

At least now when you send people away to check thread for that mystical robots.txt, something remotely relevant to it can be found.

If I made a few people smile today then posting here was not a complete waste of time.

rj87uk

8:29 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I thought Trolls only make one post then leave again? Hes been here for a while now asking for the 'robots.txt' file so we can all help you!

Ps. Do you really have one?

bcolflesh

8:30 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Already covered multiple times in this thread.

I keep searching the thread but I cannot find the contents of the robots.txt file you claim to have posted - what does "troll" mean in this context? If you have "troll" as a robots.txt directive, this may be the problem - please check one of the validators I mentioned earlier.

This 111 message thread spans 4 pages: 111