Forum Moderators: phranque
Unknown robot (identified by 'spider')
Unknown robot (identified by 'crawl')
and I can't seem to stop them. I tried using my htaccess:
RewriteCond %{HTTP_USER_AGENT} ^spider [OR]
RewriteCond %{HTTP_USER_AGENT} ^crawl [OR]
RewriteCond %{HTTP_USER_AGENT} ^robot.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*WebCopier.*$
RewriteRule .\.([gG][iI][fF]¦[jJ][pP][gG])$ - [F]
but it was no good. The robots strikes everyday. Please advice, thank you.
BTW, that RewriteRule is seriously-bad. Try:
RewriteRule \.(gif¦jpe?g¦png)$ - [F,NC]
Jim
Search for the partial name that your stats gives you.
You can block whatever you want, but identifying the user-agents is the first step. To block all accesses, use
RewriteRule .* - [F]
Jim
Xaldon WebSpider 2.7.b6
So my htaccess should contain:
RewriteCond %{HTTP_USER_AGENT} ^.*Teleport.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Wget.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*WebSpider.*$ [OR]
RewriteCond %{HTTP_USER_AGENT} ^.*Xaldon.*$[OR]
RewriteCond %{HTTP_USER_AGENT} ^.*WebCopier.*$
RewriteRule .* - [F]
Am I right? thanks.
Aparently this software is for off-line browsing. Their website had no info on blocking it. (I used Google's translation of the German so I could've missed something.)