Forum Moderators: open

Message Too Old, No Replies

resolver

         

keyplyr

7:20 pm on Oct 28, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month




UA: resolver/0.1.12 (+http://github.com/andris9/resolver)
Protocol: HTTP/1.1
Robots.txt: No
Host: googleusercontent.com (Google-Cloud)
104.154.0.0 - 104.155.255.255
104.154.0.0/15

jonasjacek

9:48 am on Oct 29, 2016 (gmt 0)

5+ Year Member



This is an interesting bot:
"This module Resolves destination URLs. Feed it a source url and the resolver returns with the destination url if any redirects happened."

From https://github.com/andris9/resolver

2 two cents:
What the resolver does is actually useful for the health of the Internet. I consider shortURLs and redirects a bad thing. Link rot, cookie stuffing, and multiple redirects come to my mind.

I suggest not to enforce robots.txt for this bot. It can be used to do good.

keyplyr

10:04 am on Oct 29, 2016 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



What the resolver does is actually useful for the health of the Internet.
There are dozens of url resolvers. I allow them all.
I consider shortURLs and redirects a bad thing
Now that Twitter (the main reason for the rise of the short url) no longer counts urls as characters, they should start to disappear from popular use.

But here's the important thing - how/why is this agent being used? Is it being used to scrape all short urls from social media sites for some unknown purpose? Why is it attempting to crawl web sites?