Forum Moderators: open
I found Jakarta under "Included Software and Licenses for the Java Language Version of App Engine" here:
[code.google.com...]
Why are they looking for googlehostedservice.html? And who is "they"? Is this Google using their own App Engine, or a 3rd party hosted on Google's cloud computing?
Between all the crap from Google now, how can we differentiate and verify Googlebot, Google Adsbot, Google stealth checks, Google manual site reviews, Google employees just browsing, Google Wireless Transcoder, translate.google.com, Google Keyword Tool and Google-Sitemaps -- some of which all use the same IP addresses?
However, it's really sloppy programming on Google's part not to identify the user agent so we can make some actual sense of what it's supposed to be doing.
That would get blocked on my server and I would simply stop using Google Apps opposed to letting all the default Jakarta user agents run amok on my server.
74.125.46.81
Mozilla/5.0 (X11; U; Linux i686 (x86_64); en-US; rv:1.9.1.3) Gecko/20090824 Firefox/3.5.3
robots.txt? NO
referer: None
74.125.46.82
Mozilla/4.0 (compatible; MSIE 6.0; Windows NT 5.1; SV1)
robots.txt? NO
referer: http://www.google.com/search?hl=en&q=www.mysitename.com+filename
(The ref's filename was incomplete both as to title and suffix.)