Forum Moderators: open

Message Too Old, No Replies

AccompanyBot

         

tangor

11:30 pm on Aug 15, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



That's the full UA

robots.txt: NO
IP: various ranges, four different isp hosts (all North American, primarily Kansas and Pennsylvania, a one off in New York state).

rips html only, no images, no css, no ico (don't use js so don't know about that)

What little I could find info wise is this bot is involved in senior citizens research leading to "care" of some kind toward an eventual AI ...

Count me out!

lucy24

5:23 pm on Aug 16, 2020 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



:: detour to raw logs ::

Yup, there they are. Out of sight, out of mind.

104.243.197.abc - - [19/Jul/2020:11:52:10 -0700] "GET /ebooks/chapman/iliadintro.html HTTP/1.1" 403 7354 "-" "AccompanyBot"
207.188.146.abc - - [08/Aug/2020:01:32:04 -0700] "GET /ebooks/teatable HTTP/1.1" 403 7354 "-" "AccompanyBot"

Noteworthy in the second case is that the URL is actually /teatable/ (as in, a real physical directory) suggesting it’s one of “Those” bots, like Applebot, that wants everything to be extensionless. Also worth noting that both are deep interior pages, not linked from the root at time of requesting.

I cross-checked to see if there were any human requests for the same URL from a roughly similar IP in the same general time period, but no go.

And who the heck would ever use /iliadintro as an entry page? (Answer, based on further log-searching: people sent by Google. I really wonder what they were searching for and why the search engine thought they would find it on this page.)