homepage Welcome to WebmasterWorld Guest from 54.196.18.51
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Visit PubCon.com
Home / Forums Index / Google / Google News Archive
Forum Library, Charter, Moderator: open

Google News Archive Forum

    
216.239.45.4
Maybe this will shed more light on this IP
brandi01




msg:64084
 3:30 pm on Jan 8, 2003 (gmt 0)

I've seen some other threads about this IP, I thought maybe this can shed a little more light on it, maybe not.:) I hope so.

20:41:40 216.239.45.4 W3SVC242 80 GET /index.shtml <B>Failed+to+process+SSI+file+'/index.shtml'</B><BR>++ 200 9241 159 Java1.3.1_02 -
20:41:40 216.239.45.4 W3SVC242 80 GET /robots.txt - 200 359 169 Java1.3.1_02 -
20:41:40 216.239.45.4 W3SVC242 80 GET /index.shtml - 200 9188 154 Java1.3.1_02 -
20:41:40 216.239.45.4 W3SVC242 80 GET /index.shtml - 200 9188 159 Java1.3.1_02 -
20:41:41 216.239.45.4 W3SVC242 80 GET /robots.txt - 200 359 185 Googlebot/2.1+(+http://www.googlebot.com/bot.html) -
20:41:41 216.239.45.4 W3SVC242 80 GET /index.shtml - 200 9169 175 Googlebot/2.1+(+http://www.googlebot.com/bot.html) -
20:41:41 216.239.45.4 W3SVC242 80 GET /index.shtml - 200 9188 159 Java1.3.1_02 -
20:41:41 216.239.45.4 W3SVC242 80 GET /index.shtml - 200 9188 159 Java1.3.1_02 -

The above is from a website that I admin. that was recently dropped from the Google index for reasons unknown (not an seo or spam penalty).

I e-mailed help@google.com about it and received a reply stating the site had not been manually removed and listing some reasons why it may have fallen out of the index. Anyway, that's not the point here.

The above log entries timestamp is approx. 2 minutes before the timestamp on the e-mail I received from Google. Interesting? Maybe.

My question is this:
Was this a human GooglePerson manually checking the site in question, and if so, how can it also be a GoogleBot using the same IP? I guess it could be, I'm not technically inclined when it comes to what one can do with IP's. If it was a human manually checking the site, did/could they send GoogleBot to reindex the site (the 2 log entries from GoogleBot)? I'm hoping the answer is yes here.

Thanks All!

 

Dreamquick




msg:64085
 11:42 pm on Jan 8, 2003 (gmt 0)

That appears to be something operating from within a google-owned IP running what appears to be an application or utility which uses "Java1.3.1_02".

There could be a great many reasons why this is there - maybe they were manually checking your site, maybe they were testing a pet project, maybe they were testing a new version of the googlebot! Who knows...

To my untrained eye the first line appears to be an error of some description in your site code.

Was this a human GooglePerson manually checking the site in question

Might be but they certainly weren't doing it using a browser - more likely some kind of in-house application or perhaps it was an in-house bot.

How can it also be a GoogleBot using the same IP?

Most likely answer is that they have a series of proxies or load balancers so that when the machines that are "GoogleBot" actually go out and crawl they can share IP addresses rather than each being assigned a unique IP and wasting hundreds > thousands of unique IPs for machines which don't really need a unique IP.

You have to figure that in order to do a monthly crawl they must have thousands of machines acting as GoogleBots and certain clusters of them get aggregated into a single IP address.

Also if a human really was checking something such as if you cloaked or not then doing it from inside the GooglePlex (but not using the GoogleBot UA) would be the first step as they can then check if you are using basic UA-only cloaking.

Look on the plus side they say you didn't do anything wrong and weren't dropped...

If it was a human manually checking the site, did/could they send GoogleBot to reindex the site (the 2 log entries from GoogleBot)?

GoogleBot is an application which most likely runs as a distributed system across many machines. If one of their techs wanted to start another copy off and have it re-spider a certain list of URLs I'd give you good money that's exactly what would happen.

- Tony

brandi01




msg:64086
 1:54 pm on Jan 9, 2003 (gmt 0)

Thanks for the detailed reply, Tony. I'm going to keep watching this IP to see when/if it shows up again and what it does.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google News Archive
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved