Forum Moderators: open
IP: aa.bb.cc.dd
Connection: close
X-Fowarded-For: aa.bb.cc.dd
User-Agent: compatible;Baiduspider/2.0; +http://www.baidu.com/search/spider.html
Host: example.com
note spelling. Unlike "referer", that is not how the word is customarily spelled. Or, if you prefer, spelt. IP: aa.bb.cc.dd
Accept: */*
Accept-Language: zh-cn,zh-tw
Accept-Encoding: gzip
User-Agent: Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)
Connection: close
Host: example.com
but the fake one is currenly more common. Not aware of it used as a space anywhere.
They seem to be partial to URLs with "fck" in them somewhere (is this a WP thing?)* which should be an automatic block.
a great way to collect scanning IPs
Useragent:
Most of the time it's followed by a correctly spelled User-Agent header-- always specifying a different UA-- such as Useragent: Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; WOW64; Trident/5.0)
User-Agent: Mozilla/4.0 (compatible; Win32; WinHttp.WinHttpRequest.5)
but sometimes it isn't. Which is fine, because then the request reads as "no user-agent" and is blocked forthwith. And most of the duplicates are Chinese robots who are handily blocked on other grounds.