Welcome to WebmasterWorld Guest from 50.19.34.234

Forum Moderators: goodroi

Message Too Old, No Replies

How can I block Cyveillancebot(63.148.99.****)

It doesn't look like a useful bot...

     

itisgene

3:36 am on Mar 25, 2004 (gmt 0)

10+ Year Member



I saw 63.148.99.**** are coming to one of my sites and grabs files. I checked it from web and found that it is Cyveillancebot. Since the site is new, it is not quite listed in search engines and get the traffic yet. Is there any way that I can block this? But it doesn;t have UA name to block.

from the web, I got this

******************
Cyveillancebot uses IP addresses in the range of 63.148.99.224 - 63.148.99.255, and may use others (but unconfirmed). Here's a list of other 'media enforcer' bots, servers et al.

Cyveillancebot ignores robot.txt, as far as anyone can tell. Cyveillancebot spoofs its identity, naming itself various flavors of Windows browsers:

63.148.99.232 - - [02/May/2003:13:01:37 -0700] "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 3.51)"
63.148.99.232 - - [02/May/2003:13:01:37 -0700] "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 5.0)"
63.148.99.232 - - [02/May/2003:13:01:58 -0700] "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 3.51)"
63.148.99.232 - - [02/May/2003:13:01:58 -0700] "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 3.51)"
63.148.99.232 - - [02/May/2003:13:02:57 -0700] "Mozilla/4.0 (compatible; MSIE 5.05; Windows NT 4.0)"

**************************

Any way to block it?

Thanks,

jbgilbert

4:19 am on Mar 25, 2004 (gmt 0)

10+ Year Member



see this thread for solution
[webmasterworld.com...]

jbgilbert

4:24 am on Mar 25, 2004 (gmt 0)

10+ Year Member



I also found this, but have not used it before:

<Limit GET>
order deny,allow
deny from 155.212. 199.171.167. .aol.com 207.51.72.139 grog.ric.edu
</Limit>

Claim is it works for partial IP address and domain names.

jdMorgan

4:45 am on Mar 25, 2004 (gmt 0)

WebmasterWorld Senior Member jdmorgan is a WebmasterWorld Top Contributor of All Time 10+ Year Member



That code only denies access if the request is a GET or HEAD, and not if it is a POST, EDIT, SEARCH, DELETE, etc. I strongly suggest you use a <Files> container instead:
[code]
<Files *>
order deny,allow
deny from 155.212. 199.171.167. .aol.com 207.51.72.139 grog.ric.edu
</Files>
Also, be aware that using a hostname such as .aol.com in code like this invokes a reverse-DNS lookup -- a request from your server to its DNS server -- for each HTTP request received by your server, and that is very slow. Therefore, it is preferable to use IP addresses only if at all possible.

Jim

itisgene

4:04 pm on Mar 25, 2004 (gmt 0)

10+ Year Member



Thanks, guys.
I think the posting jbgilbert mentioned is for Apache server. I am using Windows server with .ASP.
So, it is not that usefule fo me.

for jdMorgan's suggestion,
************************
<Files *>
order deny,allow
deny from 155.212. 199.171.167. .aol.com 207.51.72.139 grog.ric.edu
</Files>
*************************
Do I put this on robots.txt of normal web pages?
Sorry, I haven't used robots.txt that much.
I can include it as SSI in asp files with the specific IP addresses, if it is needed.

Thanks,

jdMorgan

4:43 pm on Mar 25, 2004 (gmt 0)

WebmasterWorld Senior Member jdmorgan is a WebmasterWorld Top Contributor of All Time 10+ Year Member



No, sorry -- All of this is for Apache, and none of it has to do with robots.txt.

Robots.txt only works with cooperative robots. The bad ones either don't check it, or they do check it, but ignore it.

Jim

 

Featured Threads

Hot Threads This Week

Hot Threads This Month