Forum Moderators: open

Message Too Old, No Replies

OpenHoseBot

         

keyplyr

8:11 am on Jun 17, 2015 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Welcome to Openhose. Openhose is a radical new real-time big data analytics platform that will be open-sourced soon. The SDK's and documentation are currenlty in Beta. Thanks for being a champ and checking it out!

UA: Mozilla/5.0 (compatible; OpenHoseBot/2.1; +http://www.openhose.org/bot.html)
Robots.txt: no
Host: AWS
54.144.0.0/12
54.144.0.0 - 54.159.255.255

Note: since AWS cloud hosting is dynamic, ranges will surely change

lucy24

5:30 pm on Jun 17, 2015 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Openhose

There's a pun lurking in there somewhere...

Thanks for being a champ and checking it out!

Investigating robot hits in your logs now qualifies as "being a champ"? Whee!

:: detour to confirm that I've currently got 54.144.0.0/12 blocked anyway ::

keyplyr

8:15 am on Jun 18, 2015 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



There's a pun lurking in there somewhere...
Almost went for it, then thought... nah.

keyplyr

12:30 am on Oct 5, 2015 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Seems the web site has been taken down, but this bot continues to hit my server a dozen or more times a day.

tangor

2:54 am on Oct 5, 2015 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The irony of the title bot offers many puns. I know I have one or two... but since it is a bot I'll not. :)

And that range is already in my deny, so only shows up as 403s...

Sadly my log filters are more of those these days than 200s.... sigh.

keyplyr

6:15 pm on Oct 5, 2015 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



And that range is already in my deny, so only shows up as 403s... Sadly my log filters are more of those these days than 200s.... sigh.
A little over a year ago I've switched my approach from blocking, to seeing what I can allow. I want traffic :)

My personal site gets between 27k and 32k successful daily page loads. Before, I was serving an additional 6k in 403s. Now I have it down to around 5 or 6 hundred daily 403s and still reducing.

What I once thought of as bad & blockable, I have since found to be beneficial in some cases. I used to block all marketing and research type companies. Doing some research myself, I have now found that some of these companies furnish potential advertisers with data they use in choosing who to add to their Adsence publishers, which as an Adsense & direct ad publisher myself, benefits me.

Then there's all the server farms, colos, VPNs and data centers I used to block with prejudice. Almost all of them now have cloud ranges that host mobile ISPs, Apps, proxies & social media. Also, many companies now either furnish cell phones to their employees or allow employees to connect on the company wifi. I am constantly adding conditions to my block rules to allow these agents.