Forum Moderators: open
CLIQZ.COM IP Address 35.187.22.222
Host cliqz.com
City Mountain View, CA 94043
Organization Merit Network
ISP Merit Network
AS Number AS15169 Google Inc.
If it were coming from its own addressThere are lots of distributed crawlers out there, not all of them undesirable. In my specific case, the fallback is always
BrowserMatch YourName !bad_range
meaning that the robot in question comes from one of the very few IP ranges I deny by default. If it were coming from it's own address I might consider letting it in. But with a different address each time, it's not getting in, sorryCliqzbot uses cloud computing with multiple nodes at AWS. The ranges may change, nothing abnormal about this. It's the way cloud computing works. Also nothing abnormal about
There are lots of distributed crawlers out thereAFAIK Cliqzbot is not distributed.
but what do they do for me?Sometimes we have to look beyond the initial appearance of things. Data gets supplied to many end sources, which may in turn develop products used for company intranet firewalls, web security, marketing info for Adsense advertisers, directory dumps used by smaller search upstarts, etc.