homepage Welcome to WebmasterWorld Guest from 54.197.65.82
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member

Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

This 134 message thread spans 5 pages: 134 ( [1] 2 3 4 5 > >     
Google Datacenters Watch 2006-02-06
Observations, Analysis and Remarks
speedshopping

5+ Year Member



 
Msg#: 33045 posted 10:07 am on Feb 6, 2006 (gmt 0)

< continued from [webmasterworld.com...] >

has anyone noticed when on Big Daddy DC you do a site: command on your site it brings back a number.

Ok now refresh countless times and you will notice it flips to another number.

My site is currently switching between 850 pages and 420,000!

Any ideas anyone?

[edited by: tedster at 10:41 pm (utc) on Feb. 6, 2006]

 

Ellio

5+ Year Member



 
Msg#: 33045 posted 11:33 am on Feb 6, 2006 (gmt 0)

I guess that your switching between Big Daddy and Default results on refresh. The lower figure is probably Big Daddy.

speedshopping

5+ Year Member



 
Msg#: 33045 posted 11:45 am on Feb 6, 2006 (gmt 0)

so the lower figure is how many pages now exist in BD? Surely they would not drop 420,000 pages?

Eazygoin

5+ Year Member



 
Msg#: 33045 posted 11:54 am on Feb 6, 2006 (gmt 0)

Uhmmmm.....I have 60,000+ for BD, and 850 results for classic DB's

Ellio

5+ Year Member



 
Msg#: 33045 posted 12:03 pm on Feb 6, 2006 (gmt 0)

Some sites have reported an increase of pages indexed on BD other have reported a drastic reduction.

I was guessing.

speedshopping

5+ Year Member



 
Msg#: 33045 posted 12:06 pm on Feb 6, 2006 (gmt 0)

gutted if they have reduced my site so much why would they do that?

300m

5+ Year Member



 
Msg#: 33045 posted 12:10 pm on Feb 6, 2006 (gmt 0)

This is interesting...

I created a blog about 2 weeks ago and it was through blospot, but it is hosted on my domain and my blogspot profile is ranked #15 on a competitive keyword.

Thats nice and all, but why? The profile has nothing on there that should have it ranking for that keyword.Especially so soon. Granted the blog and the domain it is sitting on is relevant to the keyword, but I have other .html pages that have always doen well with that keyword and they are not anywhere to be found.

i think the one thing that is making that happen is that I submitted the blog to about 30 blog search engines last week. Which would kind of make sense to my point that big daddy results for my area are still inundated with link spam.

Very strange.

arbitrary

5+ Year Member



 
Msg#: 33045 posted 4:00 pm on Feb 6, 2006 (gmt 0)

Here is my are my thoughts on the BigDaddy move. This is based on observations and speculation and it is presented in a simplified form.

Google has two spidering systems now. BigDaddy spidering (Mozilla) and non-BigDaddy spidering (non-Mozilla). Google is moving towards only BigDaddy spidering and is trying to stop non-BigDaddy spidering.

As a result of non-BigDaddy spidering decreasing, some sites on non-BigDadddy datacenters seem to be de-indexed. In other words, the number of pages that are cached in non-BigDaddy datacenters is decreasing for some sites. This means a loss of traffic since non-BigDaddy datacenters still represent the majority of datacenters. This loss of traffic is really caused by less pages in the index versus pages no longer ranking.

The de-indexing of pages is not intentional and does not reflect a site that is not 'worthy' of being in the index. It just means that sites/pages are getting temporarily lost in the move.

I suspect that even BigDaddy spidering does not have an accurate snapshot of the web. Sites will also not rank 'properly' (as Google really thinks they should) because many pages are missing that act as backlinks to other pages.

This is a state of flux for Google. Google is moving. When you move, it is hard to find stuff.

If anyone can share their views, for or against the above, that would be appreciated.

BillyS

WebmasterWorld Senior Member billys us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 33045 posted 4:06 pm on Feb 6, 2006 (gmt 0)

>>BMW can buy 10 times google and sale it to the last toilete cleaner in Germany for 1 EURO.

Just for the record, Google is four times larger than BMW in terms of market capitalization.

Powdork

WebmasterWorld Senior Member powdork us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 33045 posted 4:09 pm on Feb 6, 2006 (gmt 0)

damn BillS, you must have read my mind. I didn't know it was that much. the latest numbers I found were 36 billion to 23 billion.

stakaman

5+ Year Member



 
Msg#: 33045 posted 4:10 pm on Feb 6, 2006 (gmt 0)

please keep in mind that the beginning of every Google update sees spam sites surfacing and then dissapearing again.

Google are bringing those to the public's and quality team's attention by possibly relaxing the algo and at the end of the update by re-introducing their advanced filters

frakilk

5+ Year Member



 
Msg#: 33045 posted 5:25 pm on Feb 6, 2006 (gmt 0)

>>> Google are bringing those to the public's and quality team's attention by possibly relaxing the algo and at the end of the update by re-introducing their advanced filters

This is what I think may be happening as well. But this causes a lot of unnecessary damage for webmasters who struggle during the time the spam has risen. There has got to be a better way to do this. And besides it's not as if every piece of spam will be reported via spam reports. Do the general public ever submit these reports? Not sure about that.

texasville

WebmasterWorld Senior Member 5+ Year Member



 
Msg#: 33045 posted 5:37 pm on Feb 6, 2006 (gmt 0)

I have been checking [64.233.179.99...] the past few days and was kinda po'd that Google was back to trashing my site again. Listing original content as supplemental and dropping one o my product pages again, AND then, it suddenly hit me. This is an indexing I saw a month ago. They had cleaned it up, added new pages and taken my content out of supplemental. BANG! It has reversed to the old index. Tests out as BD. So what gives?
>edit< Just checked the cache dates on the supplementals and they show Jun 30, 2005 but there was fresh cache dates just last week in BD.
They have rolled back to a previous index.

McMohan

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 33045 posted 5:47 pm on Feb 6, 2006 (gmt 0)

Just for the record, Google is four times larger than BMW in terms of market capitalization.

Interesting, I didn't know that as well. However I feel, BMW's market cap is more fundamental than Google's, which IMO includes a lot of hype and sentimentals and prone to a shake-up.

randle

WebmasterWorld Senior Member 10+ Year Member



 
Msg#: 33045 posted 6:00 pm on Feb 6, 2006 (gmt 0)

a return to september 22 rseults.

This is our experience as well, across several different sectors.

300m

5+ Year Member



 
Msg#: 33045 posted 6:16 pm on Feb 6, 2006 (gmt 0)

"a return to september 22 rseults"

Only if it was September 21st I would be in good shape.....

Dayo_UK

10+ Year Member



 
Msg#: 33045 posted 6:23 pm on Feb 6, 2006 (gmt 0)

arbitrary

That is how I see it.

Although at the moment I am not sure where the "ranking" - as in PR,BL and any other unseen to us ranking indicators etc - is coming from on BD.

Eg. Is the ranking of the sites on BD using the existing criteria/ranks with the BD crawled data or using the ranking as calculated by the BD crawl.

With MC saying that PR is independant of BD - I think the rankings as calculated by the Mozilla Googlebot crawl may not have been applied.

arbitrary

5+ Year Member



 
Msg#: 33045 posted 7:35 pm on Feb 6, 2006 (gmt 0)

Dayo, thanks for the confirmation.

...I think the rankings as calculated by the Mozilla Googlebot crawl may not have been applied.

I agree - it seems that during this move, a lot seems to be up in the air. I look forward to having thing settle down to get an idea of where things are really at.

lordpercy

5+ Year Member



 
Msg#: 33045 posted 9:53 pm on Feb 6, 2006 (gmt 0)

Been watching google.co.uk most of the day Big Daddy results were live from early morning until 21:30 GMT now the data has reverted back to pre BD results.

Looks like there will be flux in regional engines using BD data.

LP

Eazygoin

5+ Year Member



 
Msg#: 33045 posted 10:16 pm on Feb 6, 2006 (gmt 0)

lordpercy>>

Still got BD as the default DC here in London.

lordpercy

5+ Year Member



 
Msg#: 33045 posted 10:25 pm on Feb 6, 2006 (gmt 0)

Just checked again .co.uk is giving www.sfgiants.com/ for "SF giants" so no big daddy for me I'm in London too. While the BD results were up it really helped me was on for a bumper day!

Ellio

5+ Year Member



 
Msg#: 33045 posted 11:43 pm on Feb 6, 2006 (gmt 0)

>>>Just checked again .co.uk is giving www.sfgiants.com/ for "SF giants" so no big daddy for me I'm in London too. While the BD results were up it really helped me was on for a bumper day! <<<

In fact Big Daddy has been live (part time) on Google.co.uk every day this year so far.

If you get a default result a few refreshes will normally switch the result to Big Daddy although it sometimes seems to revolve in "Shifts".

Some of our terms rank very high on Big Daddy but are missing on default and we chack results every couple of hours every day so I can confirm 100% that BD has been on Google.co.uk since 1.1.06.

Check the cache links in the .co.uk results to see either:

Big Daddy = [64.233.179.104...]

Default = [72.14.207.104...]

These IP's do change but have not for about 10 days.

I have reported this several times earlier in this thread.

Eazygoin

5+ Year Member



 
Msg#: 33045 posted 12:19 am on Feb 7, 2006 (gmt 0)

Ellio>

There are a number of DC IP's hitting the UK, some of which are BD and others are classic DC's. The two you quote are but amongst a number of DC's showing in the UK.

g1smd

WebmasterWorld Senior Member g1smd us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 33045 posted 12:41 am on Feb 7, 2006 (gmt 0)

>> More interestingly.. some of my pages that are being listed - incorrectly - at the moment, with the only content being... "301 - this page has moved..." are now ranking again for their terms, even though the caption text in Google is still the "301 - this page has moved...". Strange... <<

It should not be possible to index the content that is sent out along with a "301" status code. The 301 code is a redirect, and includes the URL of where to go instead.

Make sure, using WebBug, that the page really does return a HTTP status code of "301". It sounds like you have a page with the words "301 Page Moved" on it, but you serve it with HTTP Status "200 OK" and then you are doing some sort of funky JavaScript or other type of redirect, not a proper 301.

.

>> Q: Is there an easy way to test if a datacenter is running Bigdaddy? <<

>> A: There’s not a definitive way outside of Google. But the [sf giants] example that I mentioned in my pre-BigDaddy 302 post remains a pretty good test. If the query [sf giants] returns giants.mlb.com, the odds are pretty good that you’re hitting Bigdaddy. <<

There is NOT a definitive way. So, that means that there is a chance that you are NOT looking at BigDaddy results. The [sfgiants] test isn't 100% conclusive.

annej

WebmasterWorld Senior Member annej us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 33045 posted 3:07 am on Feb 7, 2006 (gmt 0)

I think BD is giving old results in terms of ranking, yet it contains pages that I've published as recently as January 26.

Powdork

WebmasterWorld Senior Member powdork us a WebmasterWorld Top Contributor of All Time 10+ Year Member



 
Msg#: 33045 posted 6:36 am on Feb 7, 2006 (gmt 0)

It should not be possible to index the content that is sent out along with a "301" status code.
It sounds like a glich to me. I have one from listing for a robots.txt protected page where both the Title and the snippet are "X-Google-Crawl-Date: Wed, 24 Nov 2004 17:02:30 GMT"
cleanup

10+ Year Member



 
Msg#: 33045 posted 8:40 am on Feb 7, 2006 (gmt 0)

Gs1md
Thanks, but I would not know what "funky JavaScript" was these days if I tripped over it!

Powdork.
A glitch? yes I think so but it has been like this for six weeks or so. During which time I have had my fingers crossed for a bot to come around and get it right.

Dayo_UK
suggested that it may be between bots at the moment ie that one bot had passed by and picked up the new address to be followed sometime later by another more patient one to pick up the content again...mmm

Day_UK could that be your unshakable optimism talking or do you stil think that may happen.

->
Here is the code I am using in a .htaccess

Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^mysite\.com
RewriteRule ^(.*)$ [mysite.com...] [R=permanent,L]

RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /.*index\.html\ HTTP/
RewriteRule ^(.*)index\.html$ /$1 [R=301,L]

->
And here is the response I get when using:
seoconsultants .com/tools/headers.asp#results

#1 Server Response: [mysite.com...]
HTTP Status Code: HTTP/1.1 301 Moved Permanently
Date: Tue, 07 Feb 2006 08:28:13 GMT
Server: Apache/1.3.34 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 FrontPage/5.0.2.2635 mod_ssl/2.8.25 OpenSSL/0.9.7a PHP-CGI/0.1b
Location: [mysite.com...]
Connection: close
Content-Type: text/html; charset=iso-8859-1
Redirect Target: [mysite.com...]

#2 Server Response: [mysite.com...]
HTTP Status Code: HTTP/1.1 200 OK
Date: Tue, 07 Feb 2006 08:28:13 GMT
Server: Apache/1.3.34 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 FrontPage/5.0.2.2635 mod_ssl/2.8.25 OpenSSL/0.9.7a PHP-CGI/0.1b
Last-Modified: Wed, 18 Jan 2006 18:08:07 GMT

Ellio

5+ Year Member



 
Msg#: 33045 posted 9:16 am on Feb 7, 2006 (gmt 0)

Ellio>
There are a number of DC IP's hitting the UK, some of which are BD and others are classic DC's. The two you quote are but amongst a number of DC's showing in the UK.

Eazygoin,

We are based in the UK and only use Google.co.uk and check results at least every one to two hours. For the last week at least the IPs I quoted are the only ones that have been used.

I fully understand and am aware that various IPs are being used on Google.co.uk I was simply pointing out "Current" experience.

The use of various IPs appears very "Structured" rather than random and that an interesting factor.

We are data centre watching after all.

Dayo_UK

10+ Year Member



 
Msg#: 33045 posted 9:34 am on Feb 7, 2006 (gmt 0)

>>>>Dayo_UK
>>>>suggested that it may be between bots at the moment ie that one bot had passed by and picked up the new address to be followed sometime later by another more patient one to pick up the content again...mmm

>>>>Day_UK could that be your unshakable optimism talking or do you stil think that may happen.

Cleanup

Yes it may happen - GG/MC have said before that 301 redirects are not followed immediately and that they are added to a queue for indexing.

I would hate to guess how long this process may take.

I try to stay optomistic nowadays.

curlykarl

10+ Year Member



 
Msg#: 33045 posted 9:37 am on Feb 7, 2006 (gmt 0)

We are based in the UK and only use Google.co.uk and check results at least every one to two hours. For the last week at least the IPs I quoted are the only ones that have been used.

I've been seeing the same since last week

and check results at least every one to two hours

Is that all!, you should be checking more than that :)

This 134 message thread spans 5 pages: 134 ( [1] 2 3 4 5 > >
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved