Forum Moderators: DixonJones

Message Too Old, No Replies

Searching for Googlebot ips in database

My pages have been updated in Google, but I can't find Googlebot in my db

         

rubenski

8:44 pm on Dec 28, 2003 (gmt 0)

10+ Year Member



Hi,

My pages were updated in Google two times in the past three weeks. I am very sure about this, because part of my forum is now in Google's cache and I can tell by the dates on the forum pages that they have recently been indexed.

When searching my stats database I can't find the Googlebot ips anywhere.

I have performed the following search:

SELECT * FROM pageviews WHERE LEFT( ip, 7 ) = '209.185' OR LEFT( ip, 5 ) = '64.68' OR LEFT(ip, 7 ) = '216.239'

I got the starting numbers for the Googlebot ips from this site and this topic specifically: [searchengineworld.com...]

Am I searching for the wrong ip? This search yields only 1 result, namely from Googl.ca, the Google site that is, not Googlebot.

edit: perhaps i should also tell you that my site is on a .nl domain in the Netherlands. So this is about google.nl.

Dreamquick

2:54 pm on Dec 30, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



The IP address ranges look right, which just leaves "pageviews" as a possible culprit - how is this table being populated?

Also have you tried searching for googlebot by UA rather than IP?

- Tony

amoore

4:07 pm on Dec 30, 2003 (gmt 0)

10+ Year Member



I can think of a few things to try:

1) use a wildcard instead of the "left" operator like this:
"select * from pageviews where ip like '64.68%' ..."
I'm not sure what good it will do, but it's more the way I do it.

2) find out more about the IP addresses that (useragents that identify themselves as) googlebot uses: "select distinct ip from pageviews where useragent like 'Googlebot%'". That way, you can make sure that your wildcard above encompasses all necessary IP addresses.

3) I just use the useragent to pull stuff out:
"select timeserved, concat(server, urlpath), useragent, remoteip from requests where useragent like 'Google%' order by timeserved"

4) Randal Schwartz may have some more hints for you: [stonehenge.com...]

rubenski

12:21 pm on Dec 31, 2003 (gmt 0)

10+ Year Member



Hi all. Thanks for your hints. I will log User Agents in the future as this gives better evidence of Googlebot's having (or not) visited my site.

Can I get the UA from the IPs in my database or should i track the UA right at the moment a page is requested?