Forum Moderators: open

Message Too Old, No Replies

crawl.mp3realm.org

Another maggot.

         

jmccormac

10:32 am on Aug 24, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



One of my blogs on domain names and statistics (no mp3s at all) got hit by this maggot this morning:
crawl.mp3realm.org
Mp3Bot/0.7; +http://mp3realm.org/mp3bot/

The associated IPs seem to be 66-147-236-#*$!.hrwebservices.net

No robots.txt requests.

The bot page has some rather ominous text:

[mp3realm.org...]
"Q: Can you tell me IP's so i can block it?

Unforunately,, we can not. Mp3Bot is a distributed crawler meaning many hosts on different addresses run the indexer. Also due to DHCP, dynamic IP's are constantly renewed and refreshed with a new address periodically. You are better off using the Robots Exclusion Standard to block the bot."

Regards...jmcc

Pfui

5:51 pm on Aug 24, 2010 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Too bad this is back/still at it. I've blocked the bot by name and by its name-sake Host for years. Note the multiple bot names even from the same places (semi-obfuscated):

MOTHER SHIP: mp3realm.org
AGENT:
Mp3Bot/0.2 (http://mp3realm.org/mp3bot/)
NOTES: Hit root; robots.txt? Yes

FROM: aircr*ftpost.com
AGENT:
Mozilla/5.0 (compatible; Mp3Bot/0.4; +http://mp3realm.org/mp3bot/)
NOTES: Hit root; robots.txt? NO
AGENT:
Mp3Bot/0.1 (http://mp3realm.org/mp3bot/)
NOTES: Hit .html file; robots.txt? NO

FROM: .sagittariusg*llery.com
AGENT:
Mp3Bot/0.1 (http://mp3realm.org/mp3bot/)
NOTES: Hit .html file; robots.txt? NO
AGENT:
user_agent=Mp3Bot/0.1 (http://mp3realm.org/mp3bot/)
NOTES: Hit root; robots.txt? Yes -- x10 in 1 SECOND

FROM: .nycap.res.rr.com
AGENT:
user_agent=Mp3Bot/0.1 (http://mp3realm.org/mp3bot/)
NOTES: Hit root; robots.txt? Yes

Although the 'crawlees' -- a.k.a. resource-sucker-upper suckers -- sometimes ask for robots.txt, more often they don't.