Forum Moderators: open

Message Too Old, No Replies

Internet Memory

         

keyplyr

7:59 am on Apr 1, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Anyone know how to get these guys to stop? Getting 2k to 3k hits to the same file day after day. Email requests to stop have so far been ignored.

Normally I would assume the server instance was poorly written (without the ending code) but this seems to happen all the time with these guys. Both UA & range blocked but more trash to process I don't need.

"GET /example.html HTTP/1.1" 403 1543 "-" "Python-urllib/2.7"


Internet Memory Research, France
internetmemory.org
37.16.72.0 - 37.16.72.255

lucy24

8:10 pm on Apr 1, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I was going to say I'd never heard of the guys ... but it turns out they used a different UA with me.

37.16.72.213 - - [05/Feb/2015:06:06:11 -0800] "GET /robots.txt HTTP/1.1" 200 799 "-" "Mozilla/5.0 (compatible; memorybot/1.21.14 +http://mignify.com/bot.html)"

:: pause to scream in excitement upon discovering that TextWrangler has an April Fool's Day Easter egg ::

The name implies it's the same underlying organization, except this one appears to have been robots.txt compliant. (I found one day where they got hundreds of files, but didn't ask for anything in roboted-out directories that link from all over.) It varied between "memorybot" and "memoryBot", otherwise identical.

Past tense because they haven't been around since April 2015. When did the new one show up?

keyplyr

12:44 am on Apr 2, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Thanks for the additional UAs.

If these runaway requests continue, I may need to employ a more *creative* response.

stan_i

9:17 am on Apr 28, 2016 (gmt 0)

10+ Year Member



Keyplyr, I work for Internet Memory and I noticed your post. I would like to apologize for the inconvenience that was caused by a bug in one of our services that by now should be fixed. Also, can I ask which email address you used to report its roque behavior? We don't take lightly the feedback of the sites we crawl. I checked with our QA department that they didn't receive any comments regarding this...

keyplyr

1:12 pm on Apr 28, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



@stan_i Thanks for the response.

On or about March 31 I sent the complaint to the admin email listed in your company's WHOIS registration. There was no reply.

It is a non-issue now.