Forum Moderators: open
I don't want this thing on my site. It's using my bandwidth with no return to me and if it's cruising for copyright violations, of which I have none, then it can stuff it's corporate clients where the sun never shines.
I'm hoping for some specific html code to ban that IP#. I've never put up a robot.txt file since the site went up, Sept 2002, because I wanted all the spiders I could get. It's PR6 now.
Any chance of some nice clean html code that will ban 63.148.99.247? (I searched a bit here and couldn't find specifics). It would be much appreciated.
Simply look it up in the arin database :
whois 63.148.99.247@whois.arin.net
[whois.arin.net]
Qwest Communications NET-QWEST-BLKS-2 (NET-63-144-0-0-1)
63.144.0.0 - 63.151.255.255
Cyveillance QWEST-63-148-99-224 (NET-63-148-99-224-1)
63.148.99.224 - 63.148.99.255
So yes, it is a Qwest IP block, but the subblock in question
is indeed owned/used by Cyveillance.
Create a bot which is advertised as one that searches for duplicated/pirated content....then actually use that bot to jack code from these unsuspecting sites.
Good bad or indifferent....it should be banned. Save your bandwidth for a real engine which might potentially send users.
The old approach of banning/unbanning bots by name cant handle the everincreasing number of new spiders. We simply need a standard to ban bots on purpose.
Like: Only allowed for free public searchengines not selling collected information to third partys.
Or: Free for educartional use etc.
Perhaps webmasters may even have a case based on copyright to enforce their TOS against professional collectors and resellers of information. But first we need something to communicate our TOS to bots.