Forum Moderators: open

Message Too Old, No Replies

serpstatbot

         

lucy24

10:17 pm on Dec 22, 2019 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



IP: 144.76.68.abc
UA: serpstatbot/1.0 (advanced backlink tracking bot; curl/7.58.0; http://serpstatbot.com/; abuse@serpstatbot.com)
robots.txt: yes, compliant
Most robots walking in off the street would be summarily blocked, but this one had good enough headers that it waltzed right in--and proceeded to demonstrate robots.txt compliance by carefully avoiding a roboted-out directory that is linked from all pages. Goood robot.

Requests come in clumps, anywhere from a few seconds to several minutes apart, except that the root was requested immediately after robots.txt. On my personal site, 19 pages took a bit over an hour. (This is the only site they visited, so they probably really are following-up someone else's links.)

Barring information to the contrary, I’m going to put them in the “You scratch my back, I’ll scratch yours category”. For me this means: I do not personally use the service suchandsuch robot provides. But other humans do, and the service is improved by being able to crawl freely. Conversely, I use services that other humans may not use, but which again are more effective if they’re allowed to crawl. (Only consider how exasperating it is when you have to check a link manually because the w3 link checker wasn’t allowed in.)

lucy24

10:12 pm on Apr 5, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Seen again from what turns out to be another Hetzner range, 94.130.etcetera. As on the previous visit--which I'd forgotten all about--they began with an interior page and slowly worked their way outward.

If all Hetzner-based robots behaved so nicely, I would never need to supplement my header-based blocking with flat IP lockouts.