Forum Moderators: open

Message Too Old, No Replies

Are users visiting my site from Google.com?

Some browser like agents are visiting my site from Google.com

         

schnee

10:55 pm on Dec 27, 2007 (gmt 0)

10+ Year Member



Hi,

I may be missing something, but I wonder how such a user agent:
"Mozilla/5.0 (Windows; U; Windows NT 5.1; en-US; rv:1.8.1.11) Gecko/20071127 Firefox/2.0.0.11"
can be visiting my site from IP address 66.249.84.10
in the Google.com net range.

Furthermore, this agent does not read robots.txt, falls in robotraps, submits hidden forms, request pages within an average of 4 sec., so looks more like a robot, but also executes Javascipts and supports cookies, which looks more like a browser.

Is Google also acting as a provider?

Any thought?

wilderness

4:54 am on Dec 28, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Google (and most other SE's) have a variety of tools available to users, most of which have nothing to do with bots and crawling.

schnee

5:15 am on Dec 28, 2007 (gmt 0)

10+ Year Member



>>Google (and most other SE's) have a variety of tools available to users

and these tools make requests from google server, using the user's browser identification?

wilderness

1:32 pm on Dec 28, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



some do.

vincevincevince

1:56 pm on Dec 28, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



One thing that Google does do is manually review selected websites to determine categorically if they are engaging in search engine spam techniques such as hidden text or deceptive cloaking.

schnee

4:48 pm on Dec 28, 2007 (gmt 0)

10+ Year Member



>>some do

An axample?

schnee

4:52 pm on Dec 28, 2007 (gmt 0)

10+ Year Member



>>One thing that Google does do is manually review selected websites

Ok, but if it is done "manually", how come they open robot traps and forms action pages with no submit button?

schnee

5:11 pm on Dec 28, 2007 (gmt 0)

10+ Year Member



>>Google (and most other SE's) have a variety of tools available to users

Pardon me for my septicism, but there is still something not correct from a company suposedly serious as Google:

Either they provide client side tools to users, as addon to their browser, but in that case the visitor will use his own provider's IP, either they provide tools to be used from their own server, but in that case it is not correct that their server mimics the visitor's browser.

Something is not clear here.

wilderness

6:39 pm on Dec 28, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Something is not clear here.

In the event your not content with the activities of Google?

Simply go to ARIN
type in "google" in the Whois-Search box
Copy the results
and then add every range to your firewall.

wilderness

6:40 pm on Dec 28, 2007 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



>>some do

An axample?

search the archives.

schnee

7:52 pm on Dec 28, 2007 (gmt 0)

10+ Year Member



>>In the event your not content with the activities of Google?

C'mon, this is not the point.
I'm just trying to improve my robot evaluation module based on behaviour of robots.
I'm actually having problems with Google, which I do not want to tag as a bad robot of course, but the fact is that I get suspicious hits from an IP address in the google range, and I'm trying to understand why? That's all.

schnee

7:58 pm on Dec 28, 2007 (gmt 0)

10+ Year Member



>>search the archives.

Forget it, I'll rather search for another forum with more cooperative people giving sensible answers or with a decent search engine, or both ;-)

Your search engine is just crap: it mostly returns pages in which the content has scrolled away :-(

awaken

4:40 pm on Apr 10, 2008 (gmt 0)

10+ Year Member



I was about to start a new thread for this, but found that someone has already run into this. Here is my version:

Google (66.249.84.10), not Googlebot hit my site today while doing a "site:" search for a specific page URL. So, basically I'm assuming that this is a human review. They were checking to see if a certain URL was in their index or not. Funny thing is, that when I do the same search on my end the URL isn't indexed, but I guess the data center they are using shows the indexed URL, otherwise I wouldn't have seen the footprint in my analytics program.

Interesting :)

Hobbs

5:32 pm on Apr 10, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



or it was as simple as a real human at the plex looking for something not related to SE business, I hear there are still some humans left down there..

Other Google services:
Wireless Transcoder
Translation Service

awaken

7:13 pm on Apr 10, 2008 (gmt 0)

10+ Year Member



Ya, I'm not too sure about that Hobbs.

If the real human is searching for a product unrelated to SE business, wouldn't they just search using the product part# as the keyword?

Why on earth would they do something like this?:

site:www.bluewidgets.com/blue-widgets-blah-blah-blah-specifications-blah-blah-blah.html

Unless of course, they are checking to see if that URL exists in Google's index.

Ocean10000

4:15 pm on Apr 12, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The Algo-Shmalgo, It Is Mostly Powered by Humans? [webmasterworld.com]
This might be related to explain some of the actions seen coming from the Googleplex.

Bewenched

4:12 am on Apr 13, 2008 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



I get alot of the translation ones myself.