Forum Moderators: goodroi

Message Too Old, No Replies

boitho.com bot violating robots.txt

Specifically requested only forbidden files

         

jazzguy

8:08 pm on May 5, 2005 (gmt 0)

10+ Year Member



"boitho.com-dc/0.75 ( http*//www.boitho.com/dcbot.html )" came from 129.241.104.168. It specifically targetted disallowed files from robots.txt, ignoring all other pages.

The info page says it's a distributed crawler, so just like my policy for the cronic robots.txt violater Grub, I banned the user agent and the entire IP block associated with the offending IP.

bcolflesh

6:33 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Rejecting offers to supply more information...

So he rejected the robots.txt file you sent him?

Lord Majestic

6:33 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Rejecting offers to supply more information is another easy way to dismiss reported bugs.

The information that matters is the robots.txt file and URLs to check that you refused to provide.

Its like saying that a browser is not rendering page correctly and then refusing to provide HTML to verify bug report.

jazzguy

6:38 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



So you finally supplied him with the only info that would matter

<Sigh> As has been covered already in this thread, there's no way to know if the information I offered would have mattered because the offer itself was rejected -- no information was ever examined.

the guy never supplied the robots.txt

Already covered multiple times in this thread.

instead he offered him his advise as consultant :P

Your sarcastic remark highlights a big part of what caused this thread to degenerate so quickly, i.e., assuming that a report is inaccurate or that a reporter is a rookie.

Lord Majestic

6:41 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Already covered multiple times in this thread.

Its a rather safe guess that it would have taken you less time to re-type the whole of your robots.txt manually using small finger on the left hand than to post as much as you did in this thread.

jazzguy

6:43 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



So he rejected the robots.txt file you sent him?

Already covered multiple times in this thread. Since this is an old thread, I think some people might benefit from re-reading it before they post as most of the recent questions have already been covered.

fischermx

6:43 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



From my 20 years of my programming experience I can tell there's nothing more annoying than a user complaining about bugs without showing proof.

bcolflesh

6:47 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



...or that a reporter is a rookie

A typical rookie mistake is not providing the necessary data to fix a problem - ex: telling someone their spider is broken, then not providing their robots.txt and a sample URL.

jazzguy

6:49 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



Its a rather safe guess

More assumptions.

that it would have taken you less time to re-type the whole of your robots.txt manually using small finger on the left hand than to post as much as you did in this thread.

Okay, now you're just trolling. Supplying the robots.txt has already been covered multiple times in this thread. Do you really want to spend another three pages saying the same things over and over again?

Lord Majestic

6:51 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



A typical rookie mistake is not providing the necessary data to fix a problem

I always ask for specific information that helps track bug, and asking for robots.txt is pretty much given, so much that I never came across with someone actually refusing to show it -- jazzman is the first, and hopefully last.

Its as if the guy does not want his problem fixed, or the problem did not exist in the first place. Note that he started this thread accusing boitho spider of violating robots.txt, but then quickly switched to bash my bot, in both cases not providing any details that can reasonably help investigate the problem.

jazzguy

6:51 pm on Jun 14, 2005 (gmt 0)

10+ Year Member



A typical rookie mistake is not providing the necessary data to fix a problem - ex: telling someone their spider is broken, then not providing their robots.txt and a sample URL.

You're way behind -- try to keep up. That was covered way back at the beginning of the thread.

This 111 message thread spans 12 pages: 111