Forum Moderators: open
I just realized that most requests are direct and have no referrer. The referrer is blank. I used the following in my htaccess as of now and the spam seems to have completely stopped:
RewriteCond %{HTTP:Accept-Language} ^$ [OR]
RewriteCond %{HTTP_REFERER} ^$
RewriteRule .* - [F,L]
I know this will block quite a few legit users as well but is there any other solution to this? Also will this hinder search engine bots from crawling my site?
I also noticed that all these bots are requesting only one page on my site. This page is the largest page my site has. Is it possible to apply this htaccess rule to this single page alone and not the whole site?
Some of the most consistent UAs are as follows:
Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; WOW64; Trident/5.0)
Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; Trident/5.0; FunWebProducts)
Mozilla/5.0 (compatible; MSIE 9.0; Windows NT 6.1; Trident/5.0) w:PACBHO60
Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; WOW64; Trident/4.0; FunWebProducts; GTB7.0; SLCC2; .NET CLR 2.0.50727; .NET CL
Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0; SLCC2; .NET CLR 2.0.50727; .NET CLR 3.5.30729; .NET CLR 3.0.30
Mozilla/4.0 (compatible; MSIE 8.0; Windows NT 6.1; Trident/4.0; BTRS28059; SearchToolbar 1.2; GTB7.2; SLCC2; .NET CLR 2.0.50727;
I am not exactly sure if referring pages and requested pages are the same, but I think they are.
PS: the last three UAs are truncated. I am not able to copy the full text for some reason.
tangor,
Think your biting off more than your able to chew here.
Please see his initial thread in the Apache forum.
One thing I noticed is that this bot is requesting only one page on my site.
Perhaps: grep "GET /attacked_page.php" ?
Do you ref to spiritualseo problem, or are you generally speaking?
you should hide your site behind CloudFlare.com
Exactly why many of us block them. Stop hiding.
What good does it do to block a service which blocks incoming spammers? Your server isn't likely to be bothered by CloudFlare's talking to the web servers with CloudFlare clients' content. Unless one of their clients is stealing your images.
Maybe you should hide your site behind CloudFlare.com because they block spammers. You do have to tinker with log if you want to track IPs (they have a WordPress plugin).
The bot I had, I discovered it would action a 301 redirection, so redirected it to its own IP, which reduced the server load to practically zero.