Forum Moderators: open
168.1.128.* [24/Jul/2018:04:20:28 GET / HTTP/1.1 301 231 - Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com)
196.52.43.* [24/Jul/2018:10:37:50 GET / HTTP/1.1 403 636 - Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com) [edited by: keyplyr at 12:21 am (utc) on Aug 5, 2018]
[edit reason] Delinked URL [/edit]
Request Headers: did not trigger request headersThe second request--the one that got as far as a 403--should have triggered header logging. (You've got it on your 403 page, right?) It's always frustrating to see a 301 followed by a 403 for the same underlying request. It makes sense in this rare case: where they happened to use different IPs, and only the second one is blocked. (Tangent: Currently I'm most likely to see the 301-to-403 sequence if the initial request was for a directory without final / slash, and then the redirected request runs into a narrowly constrained rule intended only for page requests.)
Mozilla/5.0 (compatible; YandexBot/3.0; +http://yandex.com/bots)
Mozilla/5.0 (Linux; Android 6.0.1; ASUS_Z00UD Build/MMB29P; wv) AppleWebKit/537.36 (KHTML, like Gecko) Version/4.0 Chrome/55.0.2883.91 Mobile Safari/537.36 GSA/6.8.23.21.arm64
Mozilla/5.0 (compatible; YandexBot/3.0; +http://yandex.com/bots)
Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
Mozilla/5.0 (compatible; YandexImages/3.0; +http://yandex.com/bots)
Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)
Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)
Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com)
Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/51.0.2704.106 Safari/537.36
196.52.43.abc - - [24/Jul/2018:13:39:16 -0700] "GET / HTTP/1.1" 400 1785 "-" "Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com)"
...
168.1.128.abc - - [24/Jul/2018:15:22:26 -0700] "GET / HTTP/1.1" 400 1785 "-" "Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com)"
That’s on the HTTPS side. On the HTTP side they got a couple of 403s (both using the 196 address). Is it possible they blundered in some way when trying to make the HTTPS request? I don’t see a lot of 400s. Not that I'm complaining, mind you...
196.52.43.* [26/Jul/2018:13:59:00 GET / HTTP/1.1 403 636 - Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com) 2018-07-26:13:59:00
URL: /
IP: 196.52.43.*
Host: www.example.com
User-Agent: Mozilla/5.0 (compatible; nsrbot/1.0; +http://netsystemsresearch.com)
If you are referring to the host I think you areYup, the host who returns 418 Teapot Error responses to requests blocked by mod_security. (It was originally 403, but I think they figured out that it's better to use a different code.)
In that case, an HTTPS header would not do anythingI made two concurrent changes: one, adding HTTPS information to all header logging, and two, designating an ErrorDocument for the 400 response. Since headers are all logged into the same directory, I would otherwise have to cross-check with access logs to see whether a given request came in on the HTTP side or the HTTPS side. (At a much earlier date, I added the requested filename to header logging because that, too, doesn't count as a header. Again, to prevent having to cross-check against access logs.) And then, since headers are logged as part of the error document preparation, the information should exist even if the unwanted visitor ended up being unable to receive the error document.