Forum Moderators: open

Message Too Old, No Replies

SISTRIX Crawler

sistrix.com/net/de SEO ignores robots.txt

         

Pfui

12:10 pm on Aug 9, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



46-4-122-90.crawler.sistrix.net
Mozilla/5.0 (compatible; SISTRIX Crawler; http://crawler.sistrix.net/)

robots.txt? Yes BUT -- immediately ignored and hit /

WHOIS for sistrix.net shows:

e-mail: sistrix.com
nameserver: sistrix.de

Project Honey Pot shows they run from these IPs (presumably partial listing; WHOIS also says they have about 232 domains):

46.4.122.74
46.4.122.75
46.4.122.76
46.4.122.77
46.4.122.78
46.4.122.79
46.4.122.80
46.4.122.81
46.4.122.82
46.4.122.83
46.4.122.84
46.4.122.85
46.4.122.86
46.4.122.87
46.4.122.88
46.4.122.89
46.4.122.90
46.4.122.91
46.4.122.92
46.4.122.93

dstiles

8:53 pm on Aug 9, 2011 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Those IPs belong to German Hetzner, which should be blocked with prejudice.

Ones I knw about are...

46.4.0.0 - 46.4.255.255
78.46.0.0 - 78.47.255.255
85.10.192.0 - 85.10.255.255
88.198.0.0 - 88.198.255.255
176.9.0.0 - 176.9.255.255
178.63.0.0 - 178.63.255.255
188.40.0.0 - 188.40.255.255
213.133.96.0 - 213.133.127.255
213.239.192.0 - 213.239.255.225