Forum Moderators: open

Message Too Old, No Replies

Why is there no mention of BotBuster on WW?

Good or bad, there should be some mention, right?

         

MichaelBluejay

10:03 am on Oct 15, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I gave up on writing my own homegrown script to monitor and block bots because I don't really have the proper expertise, or the time. So I started looking for off-the-shelf solutions. I was surprised that I couldn't find much, and the one package I did find (BotBuster) hasn't been mentioned here on WW at all, and nothing else on any other website within the last three years. I'm wary of installing something on the server that hasn't been vetted....

So, does anyone have any experience with this package, good or bad? Or is there another package I should be looking at instead? Thanks for the help.

wilderness

12:47 am on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



What would be the benefit is discussing a commercial product that offers complete support (even installation) for their customers?

In addition, I believe that providing a URL to their web page would be a violation of the forum charter?

incrediBILL

1:25 am on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



My main concern is the website copyright is dated 2001 and many things have changed since then.

incrediBILL

1:32 am on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



My second concern is that they don't allow self installs of the product.

What happens if you move to a new server or you need to re-install one night after an OS fault, backup restore, recovery from a hack attack?

Or worse yet, they're out of business?

All of the above gives me pause.

MichaelBluejay

4:31 am on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Yes, I'd noticed the 2001 copyright date too. Worse, the copyright is credited to another site, and if you click that link, the page you get is a placeholder. Also, I tried to call the phone number and I got a fax beep. I hadn't noticed the bit about self-installs, but that alone is a deal-killer. I don't give anyone access to my server, much less some company I know little about. Of course, installing some shady product could inadvertently give someone access to the server (who knows what the code contains), but that's one reason I wanted to ask about this software here on WW.

Okay, so I guess I'm not going with BotBuster. There are still a couple of things which puzzle me -- one, why there's so little discussion (here or on the rest of the net) about the only product I could find which fights bots, and two, why there is no legitimate bot-fighting software available off the shelf from any other source. It seems that there is certainly a market for this kind of software, why has no one come up with it? I have money in hand, and no one to pay.

Quadrille

5:52 pm on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If it requires installation by the company, as well as the other drawbacks, I'd surmise that it's neither user friendly nor a bargain.

Maybe if you go back to your own script, you could fill that niche.

Someone needs to; I don't have the expertise to even think about such an item, let alone start working on it ... and with kwells and others stomping all over the place, botstoppers (and even bot identifiers) would be a useful addition to the arsenal.

wilderness

6:03 pm on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Someone needs to; I don't have the expertise to even think about such an item, let alone start working on it ... and with kwells and others stomping all over the place, botstoppers (and even bot identifiers) would be a useful addition to the arsenal.

It's simply impossible to create an across-the-board implementation.

Different websites have different goals.
As a result allowed bot traffic must be determined by what is beneficial or detrimental to your own site (s).

Were there a such a product?
Constant monitoring and communication bewteen the software operator and their customers (thus requiring regular fees and a contract to enforce for time spent)woud be required for continued functionality.

The entire process (like the rest of the internet) is constantly being revised.

Who would locate all the Colo's that intiate crawls which are of no benefit to websmasters? I've had two new ones in the past two days.

incrediBILL

6:13 pm on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



It's simply impossible to create an across-the-board implementation

That's not entirely true.

You create all sorts of rules and let the individual webmaster pick and choose which rules to apply to their situation.

If it's an all-or-nothing situation, use all the rules or don't use it at all, then it's unlikely to be used.

Who would locate all the Colo's that intiate crawls which are of no benefit to websmasters?

You just block all data centers in the first place.

Plus, you don't have to look for them because crawls can be detected and stopped in real-time without knowing it's a bot in advance.

Detecting a single page access by a stealth bot is sometimes hard to do unless you block based on the source of the crawl or if there are tells in the user agent or HTTP headers.

However, if that bot accesses multiple pages automated detection and deflection become somewhat trivial with the right script.

Also, blocking all bots that plainly identify themselves using whitelisting is pretty trivial as well which can be accomplished already using robots.txt and .htaccess in combination.

Receptional Andy

6:17 pm on Oct 16, 2008 (gmt 0)



The other bot-blocking script I come across quite frequently is Bad Behaviour (GNU license). Haven't used it extensively, but seems OK.

Quadrille

6:47 pm on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Constant monitoring and communication bewteen the software operator and their customers (thus requiring regular fees and a contract to enforce for time spent)would be required for continued functionality.

Surely a regularly updated online database could be linked to the product, allowing users to keep in touch and select according to need and categorization of new bots.

Antiviruses typically have a subscription model that allows for updating for a year.

Failing that (or as well), a users forum could deputize.

Receptional Andy

6:59 pm on Oct 16, 2008 (gmt 0)



I don't think the lack of scripts is a technology problem, but a supply/demand one. The average site owner barely knows what a search engine spider is, never mind a malicious bot. It's likely to be a growing niche though IMO.

incrediBILL

7:04 pm on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Bad Behaviour

It's only anti-spam related for blogs and wikis if I'm not mistaken and some of the methods used to detect search engines aren't accurate and could come back to bite people eventually, especially if they don't keep it updated.

wilderness

7:18 pm on Oct 16, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



It's simply impossible to create an across-the-board implementation

That's not entirely true.

You create all sorts of rules and let the individual webmaster pick and choose which rules to apply to their situation.

If it's an all-or-nothing situation, use all the rules or don't use it at all, then it's unlikely to be used.

It appears to me that you have a business plan in place?
Perhaps you could sell it to the two inqiring parties in this thread ;)

Mokita

9:06 pm on Oct 16, 2008 (gmt 0)

10+ Year Member



A product called CrawlWall (google it) looks very promising ;) - but it has been under development since around 2006 and the beta still hasn't appeared.

Perhaps if we all chip in and buy the developer a slab of good beer, he could be persuaded to finish it! :)

Quadrille

12:26 am on Oct 17, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Looks like a lot of work - and a lot of planning has gone into it; I wonder what it would need to get it moving again?

Judging by the frequency of badbot / scraper / kwells threads, I've no doubt there's a few customers out there.