UA: SiteSucker for macOS/2.10.4 Protocol: HTTP/1.1 Robots.txt: Yes Host: Various
disobeyed robots.txt where it is disallowed
not2easy
8:40 pm on Jul 12, 2017 (gmt 0)
That's a pretty old OS there... I wonder if they have a version for WIN98?
keyplyr
8:52 pm on Jul 12, 2017 (gmt 0)
SiteSucker is pretty old but I've not seen "SiteSucker for macOS" before. Since you use Apple stuff, you may have seen it previously.
lucy24
9:38 pm on Jul 12, 2017 (gmt 0)
That's a pretty old OS there.
You mean “macOS/2.10.4”? Haha, yeah, that wouldn't even be a real OS; they were called System suchandsuch up until, I think, 8.5. Or was it 7.6? In any case I suppose that's really just the robot's version number.
It would make me very, very uneasy if a robot visiting my website professed to have some knowledge of the operating system on my home computer. Fortunately I've never set eyes on 'em.
keyplyr
10:11 pm on Jul 12, 2017 (gmt 0)
...if a robot visiting my website professed to have some knowledge of the operating system on my home computer
I don't think that's what's going on, in fact I'm sure of it. It's just the name of the bot.
not2easy
10:53 pm on Jul 12, 2017 (gmt 0)
I think you're right, keyplyr - and no, I've never seen it. Pretty sure it was fabricated on the spot, in the space where you see that [Name Your Robot] thing. It was meant to be a funny (there never was an OS/2. anything). Sorry.
keyplyr
11:29 pm on Jul 12, 2017 (gmt 0)
Any User Agent can be renamed to anything. It doesn't have to be that "[Name Your Robot] thing." This includes browsers.
So many malicious bots name themselves something benign in order to not be blocked.
lucy24
12:22 am on Jul 13, 2017 (gmt 0)
Yes, and then they blow it all by barging in demanding 17 variations on /wpadmin/
keyplyr
12:26 am on Jul 13, 2017 (gmt 0)
Well, since this the first time I've seen this UA ever, I've not seen them request /wpadmin/. In fact, I've never seen *any* stand alone UAs request /wpadmin/, it's always done from scripts that pretend to be human.
lucy24
4:47 am on Jul 13, 2017 (gmt 0)
I don't remember the percentage now, but a fair number of /wpadmin/ and allied malign robots come in with no UA at all. One can only assume they're looking for sites that are utterly wide-open with no protections of any kind.
Incidentally I didn't mean to imply that SiteSucker, specifically, goes around requesting /wpadmin/. It was a general response to the “something benign in order to not be blocked”.