Forum Moderators: open
IP: 67.68.234.nnn (Bell Canada HSE DSL)
UA: Mozilla/5.0 (compatible; Dow Jones Searchbot)
Only one header field (HTTP_ACCEPT) present.
Took home page plus another (iframed page) THEN robots.
Took images, CSS, JS from home page.
Took subsequent pages but only HEAD for pics - no CSS/JS.
Nothing obvious on google, not even in bot directories. Looks new this month.
I know the financial crunch is hitting hard but operating Dow Jones from a Canadian broadband IP? I imagine a small shack in the woods surrounded by bears and bull elk... :)
Also, note cloaked UAs in --
-----
SEPTEMBER-OCTOBER, 2009
205.203.134.19n <== Plainsboro Dow Jones-telerate
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)
robots.txt? Yes BUT... Ignored.
Notes: HEADs for pics as described and GETs. See also Project Honey Pot [projecthoneypot.org].
-----
JANUARY, 2009
208.138.254.15n <= Richboro Dow Jones & Company
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.0)
robots.txt? Yes BUT... Ignored.
Notes: Ditto. See also Project Honey Pot [projecthoneypot.org].
Dow Jones-Telerate
205.203.96.0 - 205.203.159.255
DOW JONES & COMPANY (under Savvis)
208.138.254.0 - 208.138.254.255
In fact from a robtex Class C check it looks like the 205 range is a server farm, or at least a large virtual shared server. No rDNS for that specific IP. I can't get rDNS for the few IPs I've tried in the 208 range.
Don't suppose it could be tracking sites for DJ that show up well on twits?