| Welcome to WebmasterWorld Guest from 22.214.171.124 |
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
|Become a Pro Member|
|Yellow Pages bot impersonating Googlebot?|
I just saw this on my log:
126.96.36.199 - - [19/Apr/2011:04:37:59 -0400] "GET /filename.html HTTP/1.1" 403 380 "-" "YPBot/Raven1.1.3 (compatible; Googlebot/2.1;+http://www.yellowpages.com/about/legal/crawl)"
Yellow Pages IP with both their bot and Googlebot names...?
Do the yellow pages and google have a partnership?
What is going on...should I allow them access?
Any advice would be appreciated.
anything calling itself googlebot that doesn't pass a reverse dns test gets blocked by us.
I blocked them, but thought it was very odd that a relatively mainstream company would attempt to pull a fast one.
... actually there is no hostname associated with the ip address you gave ...
quite possibly not yellow pages either!
Yellowpages.com SAVV-S265785-1 (NET-64-209-132-0-1) 188.8.131.52 - 184.108.40.206
ok so it is yellow pages, but no hostname set up!
I have no bot recorded as having come from that range and certainly not a googlebot, which would get rejected on that IP range, or a YP bot.
There is no rDNS, as topr8 mentions, so it's due a blocking.
The parent of that IP range is Savvis, which is worth blocking anyway.
Got a visit today from 220.127.116.11 - null.ev1.yellowpages.com
YPBot/Raven1.1.3 (compatible; Googlebot/2.1; http: //www.yellowpages.com/about/legal/crawl)
Took robots.txt and left, well good luck to them when it comes back ;o)
The 208. range was never in my good books
Any possible relationship to this activity [webmasterworld.com]?
This is Ypages, the other thread is Ybook
There's a LOT of stuff in 208. much of it not good.
All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved