Forum Moderators: open
I always saw requests for robots.txt - at least 3 a day, and nothing else.
For the past few days, I started seeing requests for random pages, but from a different IP block.
The regular robots.txt requests come from 66.*, while the new requests came from 209.*. The block of that IP is owned by Yahoo, so it's them. I just can't figure out if it's a new spider location or if it's a human editor reviewing our site.
User agent is always the same "Mozilla/5.0 (compatible; Yahoo! Slurp; [help.yahoo.com...]
and in my case it is always 209.131.40.31
while i don't think it is human, i belive that yahoo editors use 209.131* blocks too.
NetBlock ¦ IP Address ¦HTTP Accept¦User Agent
Yahoo ¦ 209.131.40.67 ¦ */* ¦ Mozilla/4.0 (compatible; MSIE 5.0; Windows NT)
Yahoo ¦ 209.131.40.82 ¦ */* ¦ Mozilla/4.0 (compatible; MSIE 5.0; Windows NT)
Yahoo ¦ 209.131.40.83 ¦ */* ¦ Mozilla/5.0 (compatible; Yahoo! Slurp/si-emb; [help.yahoo.com...]
Yahoo ¦ 209.131.40.132 ¦ */* ¦ Mozilla/5.0 (compatible; Yahoo! Slurp; [help.yahoo.com...]
Yahoo ¦ 209.131.40.179 ¦ */* ¦ Slurp/si-emb (slurp@inktomi.com; [inktomi.com...]
Yahoo ¦ 209.131.40.155 ¦ ¦ Randy
LocalLink¦ 209.131.227.250 ¦ */* ¦ Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1)
Who's RANDY?
Randy was the only user agent not to specify an HTTP Accept type.
It looks like the address block belonging to Yahoo is 209.131.40....
Peace,
Kaz