I was looking through the list of IP's rejected by my router's http/https blocking list from the past 2 days and found this:
144.217.135.152 crawl-144-217-135-152.dataproviderbot.com (OVH SAS)
Not unusual, I see (and block) tons of hits from OVH. I was curious about "dataproviderbot.com". I've seen it turn up before, somewhat frequently. Today I tried to hit that with a browser, but I got one of those odd google pages:
-----------------
www.google.com/images/errors/robot.png
The requested URL was not found on this server. That’s all we know.
----------------
I'm resolving dataproviderbot.com to 216.239.32.31 which is a google IP (any-in-201f.1e100.net). Because I've blocked the OVH IP I don't know in this case what the UA would have been if I had allowed the hit, but I will check that subnet for past hits before I blocked it.
Is there a deeper connection between the "data provider bot" and google? Might google be operating a different tier of bots (but not from their IP's) that perhaps are more aggressive in terms of data mining?
And is the "dataproviderbot" the same bot as the dataprovider spider?
[
dataprovider.com...]
See also:
[
udger.com...]
Some sort of connection to lipperhey.com. I've come across hits from lipperhey.com before, but don't really know what they do or what their business model is.