Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

Mozilla Googlebot and the New Index at 64.233.179.104

Moved on from Jagger

         

Dayo_UK

9:58 am on Dec 13, 2005 (gmt 0)



OK - Jagger is over - long live "Big Daddy" - as named by MC for the test DC.

The index growing on 64.233.179.104 does seem to be largely a Mozilla Googlebot generated index - and this new index is being built for the future - so can we say Mozilla Googlebot is now taking over from normal Googlebot.

OK ignore supplimentals etc for a moment - as all DCs have this problem and have a look at the cache dates for pages that are indexed...... some of these pages have only been fetched by Mozzilla Googlebot (even on the same day as normal Googlebot visited)

Eg. On the test DC I have a homepage cached 30th November at 5:40 - fetched by Mozilla Googlebot - while on the other DCs it is cached on 30th November at 3:40 - fetched by normal Googlebot.

So in many ways this does look like building a whole new index parrellel to the existing index - with largely Mozilla Googlebot crawl data.

Some pages appear very old - eg another page is cached on the test dc on 6th November - but on the other dcs it has cache in December - checking the logs - 6th November was the last time Mozilla Googlebot visited this page.

OK - there are pages in the test DC only visited by normal Googlebot - however, pages crawled by Mozilla Googlebot do not appear on other DCs.

The newest pages on the DC crawled by Mozilla Googlebot seem to be in November - eg no pages crawled by Mozilla Googlebot in December have made it to the index yet.

Some pages crawled by Mozilla Googlebot in November have not made it to the index - so I dont know if G are working with a sample data size......

For confirmation that this is a whole new build of the index MC said on his blog:-

"the test data center certainly has some different crawling and indexing characteristics."

OK - folks remember also that MC said that this index will roll out in months and is in a test state so I guess no need for early panic stations and slagging of Google in this thread.

Now 301s, 302s, Canonicals - for me a lot more 301s Google has crawled and indexed correctly. 302s - still lots in the index (mainly supplimentals) - not seeing any new 302s that show the url of the linking site but the content of the destination site (seeing the newest at about August 2005 time) - no doubt others may find some.

What are other observations people have seen with the new crawling and indexing on this test dc.

Miop

5:30 pm on Jan 17, 2006 (gmt 0)

10+ Year Member



Does your domain forward to index.html or .php (or some other page)?

Miop

5:32 pm on Jan 17, 2006 (gmt 0)

10+ Year Member



Oops wrong thread...

webvivre

6:01 pm on Jan 17, 2006 (gmt 0)

10+ Year Member



Miop
The real page is www.domainname.co.uk/index.htm

All links are to www.domainname.co.uk

Miop

6:58 pm on Jan 17, 2006 (gmt 0)

10+ Year Member



is your www.domainname.co.uk/index.htm found in G's index?

webvivre

8:39 pm on Jan 17, 2006 (gmt 0)

10+ Year Member



Miop
No - index.htm not found

sore66

8:45 pm on Jan 17, 2006 (gmt 0)

10+ Year Member



One of my sites is #3 for its keyword so I think this is an excellent update. Go live Google!

Google...who loves ya baby?:-))

ps...on a serious note, the site that "should" be #1 for this word is not on the first page of results.

This 126 message thread spans 5 pages: 126