I was looking at my Apache server logs and noticed there are hundreds of very strange GET requests from all sorts of bots including Googlebot, Yahoo Slurp, Baidu spider, Bingbot, and others.
They appear to be randomly generated urls for .html pages that
do not exist and are totally unrelated to my website.. (My site is for doors.)
Here are a few examples:
67.195.111.184 - - [18/Oct/2011:04:28:36 -0700] "GET www.mysite.com/ofsR-free-printable-probability-worksheets.html HTTP/1.0" 404 24074 "-" "Mozilla/5.0 (compatible; Yahoo! Slurp; [
help.yahoo.com...]
157.55.18.23 - - [18/Oct/2011:02:15:01 -0700] "GET mysite.com/ofsR-map-us-printable-free.html HTTP/1.1" 404 24074 "-" "Mozilla/5.0 (compatible; bingbot/2.0; +http://www.bing.com/bingbot.htm)"
180.76.5.26 - - [18/Oct/2011:05:37:04 -0700] "GET www.mysite.com/ofsR-free-printable-birthday-templates.html HTTP/1.1" 404 24074 "-" "Mozilla/5.0 (compatible; Baiduspider/2.0; +http://www.baidu.com/search/spider.html)"
208.115.111.67 - - [18/Oct/2011:06:12:25 -0700] "GET www.mysite.com/ofsR-printable-coloring-disney-pages.html HTTP/1.1" 404 24074 "-" "Mozilla/5.0 (compatible; Ezooms/1.0; ezooms.bot@gmail.com)"
Here are 2 very odd request examples where both the requested page and the referral page
do not exist (the IPs trace to Amsterdam; second example traces to Beijing):
91.224.247.82 - - [18/Oct/2011:05:28:56 -0700] "GET www.mysite.com/images/indext.php HTTP/1.0" 404 24074 "http://www.mysite.com/images/indext.php?u=freelance-hairstylist-powered-by-phpbb" "Mozilla/4.0 (compatible; MSIE 6.0; MSIE 5.5; Windows NT 5.0) Opera 7.02 Bork-edition [en]"
222.33.62.9 - - [18/Oct/2011:08:52:42 -0700] "GET www.mysite.com/images/indext.php HTTP/1.0" 404 24074 "http://www.mysite.com/images/indext.php?u=freelance-hairstylist-powered-by-phpbb" "Mozilla/4.0 (compatible; MSIE 5.0; Windows 3.1)"
Does anyone know why such strange requests are being made?
Does any of this appear to be potentially threatening?
The ones from Amsterdam and China concern me, but I don't know if it is something to worry about or not. If someone could add some insight I would appreciate it.