homepage Welcome to WebmasterWorld Guest from 54.198.42.105
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Visit PubCon.com
Home / Forums Index / Google / Google SEO News and Discussion
Forum Library, Charter, Moderators: Robert Charlton & aakk9999 & brotherhood of lan & goodroi

Google SEO News and Discussion Forum

This 134 message thread spans 5 pages: 134 ( [1] 2 3 4 5 > >     
Google Datacenters Watch 2006-02-06
Observations, Analysis and Remarks
speedshopping




msg:750104
 10:07 am on Feb 6, 2006 (gmt 0)

< continued from [webmasterworld.com...] >

has anyone noticed when on Big Daddy DC you do a site: command on your site it brings back a number.

Ok now refresh countless times and you will notice it flips to another number.

My site is currently switching between 850 pages and 420,000!

Any ideas anyone?

[edited by: tedster at 10:41 pm (utc) on Feb. 6, 2006]

 

Ellio




msg:750105
 11:33 am on Feb 6, 2006 (gmt 0)

I guess that your switching between Big Daddy and Default results on refresh. The lower figure is probably Big Daddy.

speedshopping




msg:750106
 11:45 am on Feb 6, 2006 (gmt 0)

so the lower figure is how many pages now exist in BD? Surely they would not drop 420,000 pages?

Eazygoin




msg:750107
 11:54 am on Feb 6, 2006 (gmt 0)

Uhmmmm.....I have 60,000+ for BD, and 850 results for classic DB's

Ellio




msg:750108
 12:03 pm on Feb 6, 2006 (gmt 0)

Some sites have reported an increase of pages indexed on BD other have reported a drastic reduction.

I was guessing.

speedshopping




msg:750109
 12:06 pm on Feb 6, 2006 (gmt 0)

gutted if they have reduced my site so much why would they do that?

300m




msg:750110
 12:10 pm on Feb 6, 2006 (gmt 0)

This is interesting...

I created a blog about 2 weeks ago and it was through blospot, but it is hosted on my domain and my blogspot profile is ranked #15 on a competitive keyword.

Thats nice and all, but why? The profile has nothing on there that should have it ranking for that keyword.Especially so soon. Granted the blog and the domain it is sitting on is relevant to the keyword, but I have other .html pages that have always doen well with that keyword and they are not anywhere to be found.

i think the one thing that is making that happen is that I submitted the blog to about 30 blog search engines last week. Which would kind of make sense to my point that big daddy results for my area are still inundated with link spam.

Very strange.

arbitrary




msg:750111
 4:00 pm on Feb 6, 2006 (gmt 0)

Here is my are my thoughts on the BigDaddy move. This is based on observations and speculation and it is presented in a simplified form.

Google has two spidering systems now. BigDaddy spidering (Mozilla) and non-BigDaddy spidering (non-Mozilla). Google is moving towards only BigDaddy spidering and is trying to stop non-BigDaddy spidering.

As a result of non-BigDaddy spidering decreasing, some sites on non-BigDadddy datacenters seem to be de-indexed. In other words, the number of pages that are cached in non-BigDaddy datacenters is decreasing for some sites. This means a loss of traffic since non-BigDaddy datacenters still represent the majority of datacenters. This loss of traffic is really caused by less pages in the index versus pages no longer ranking.

The de-indexing of pages is not intentional and does not reflect a site that is not 'worthy' of being in the index. It just means that sites/pages are getting temporarily lost in the move.

I suspect that even BigDaddy spidering does not have an accurate snapshot of the web. Sites will also not rank 'properly' (as Google really thinks they should) because many pages are missing that act as backlinks to other pages.

This is a state of flux for Google. Google is moving. When you move, it is hard to find stuff.

If anyone can share their views, for or against the above, that would be appreciated.

BillyS




msg:750112
 4:06 pm on Feb 6, 2006 (gmt 0)

>>BMW can buy 10 times google and sale it to the last toilete cleaner in Germany for 1 EURO.

Just for the record, Google is four times larger than BMW in terms of market capitalization.

Powdork




msg:750113
 4:09 pm on Feb 6, 2006 (gmt 0)

damn BillS, you must have read my mind. I didn't know it was that much. the latest numbers I found were 36 billion to 23 billion.

stakaman




msg:750114
 4:10 pm on Feb 6, 2006 (gmt 0)

please keep in mind that the beginning of every Google update sees spam sites surfacing and then dissapearing again.

Google are bringing those to the public's and quality team's attention by possibly relaxing the algo and at the end of the update by re-introducing their advanced filters

frakilk




msg:750115
 5:25 pm on Feb 6, 2006 (gmt 0)

>>> Google are bringing those to the public's and quality team's attention by possibly relaxing the algo and at the end of the update by re-introducing their advanced filters

This is what I think may be happening as well. But this causes a lot of unnecessary damage for webmasters who struggle during the time the spam has risen. There has got to be a better way to do this. And besides it's not as if every piece of spam will be reported via spam reports. Do the general public ever submit these reports? Not sure about that.

texasville




msg:750116
 5:37 pm on Feb 6, 2006 (gmt 0)

I have been checking [64.233.179.99...] the past few days and was kinda po'd that Google was back to trashing my site again. Listing original content as supplemental and dropping one o my product pages again, AND then, it suddenly hit me. This is an indexing I saw a month ago. They had cleaned it up, added new pages and taken my content out of supplemental. BANG! It has reversed to the old index. Tests out as BD. So what gives?
>edit< Just checked the cache dates on the supplementals and they show Jun 30, 2005 but there was fresh cache dates just last week in BD.
They have rolled back to a previous index.

McMohan




msg:750117
 5:47 pm on Feb 6, 2006 (gmt 0)

Just for the record, Google is four times larger than BMW in terms of market capitalization.

Interesting, I didn't know that as well. However I feel, BMW's market cap is more fundamental than Google's, which IMO includes a lot of hype and sentimentals and prone to a shake-up.

randle




msg:750118
 6:00 pm on Feb 6, 2006 (gmt 0)

a return to september 22 rseults.

This is our experience as well, across several different sectors.

300m




msg:750119
 6:16 pm on Feb 6, 2006 (gmt 0)

"a return to september 22 rseults"

Only if it was September 21st I would be in good shape.....

Dayo_UK




msg:750120
 6:23 pm on Feb 6, 2006 (gmt 0)

arbitrary

That is how I see it.

Although at the moment I am not sure where the "ranking" - as in PR,BL and any other unseen to us ranking indicators etc - is coming from on BD.

Eg. Is the ranking of the sites on BD using the existing criteria/ranks with the BD crawled data or using the ranking as calculated by the BD crawl.

With MC saying that PR is independant of BD - I think the rankings as calculated by the Mozilla Googlebot crawl may not have been applied.

arbitrary




msg:750121
 7:35 pm on Feb 6, 2006 (gmt 0)

Dayo, thanks for the confirmation.

...I think the rankings as calculated by the Mozilla Googlebot crawl may not have been applied.

I agree - it seems that during this move, a lot seems to be up in the air. I look forward to having thing settle down to get an idea of where things are really at.

lordpercy




msg:750122
 9:53 pm on Feb 6, 2006 (gmt 0)

Been watching google.co.uk most of the day Big Daddy results were live from early morning until 21:30 GMT now the data has reverted back to pre BD results.

Looks like there will be flux in regional engines using BD data.

LP

Eazygoin




msg:750123
 10:16 pm on Feb 6, 2006 (gmt 0)

lordpercy>>

Still got BD as the default DC here in London.

lordpercy




msg:750124
 10:25 pm on Feb 6, 2006 (gmt 0)

Just checked again .co.uk is giving www.sfgiants.com/ for "SF giants" so no big daddy for me I'm in London too. While the BD results were up it really helped me was on for a bumper day!

Ellio




msg:750125
 11:43 pm on Feb 6, 2006 (gmt 0)

>>>Just checked again .co.uk is giving www.sfgiants.com/ for "SF giants" so no big daddy for me I'm in London too. While the BD results were up it really helped me was on for a bumper day! <<<

In fact Big Daddy has been live (part time) on Google.co.uk every day this year so far.

If you get a default result a few refreshes will normally switch the result to Big Daddy although it sometimes seems to revolve in "Shifts".

Some of our terms rank very high on Big Daddy but are missing on default and we chack results every couple of hours every day so I can confirm 100% that BD has been on Google.co.uk since 1.1.06.

Check the cache links in the .co.uk results to see either:

Big Daddy = [64.233.179.104...]

Default = [72.14.207.104...]

These IP's do change but have not for about 10 days.

I have reported this several times earlier in this thread.

Eazygoin




msg:750126
 12:19 am on Feb 7, 2006 (gmt 0)

Ellio>

There are a number of DC IP's hitting the UK, some of which are BD and others are classic DC's. The two you quote are but amongst a number of DC's showing in the UK.

g1smd




msg:750127
 12:41 am on Feb 7, 2006 (gmt 0)

>> More interestingly.. some of my pages that are being listed - incorrectly - at the moment, with the only content being... "301 - this page has moved..." are now ranking again for their terms, even though the caption text in Google is still the "301 - this page has moved...". Strange... <<

It should not be possible to index the content that is sent out along with a "301" status code. The 301 code is a redirect, and includes the URL of where to go instead.

Make sure, using WebBug, that the page really does return a HTTP status code of "301". It sounds like you have a page with the words "301 Page Moved" on it, but you serve it with HTTP Status "200 OK" and then you are doing some sort of funky JavaScript or other type of redirect, not a proper 301.

.

>> Q: Is there an easy way to test if a datacenter is running Bigdaddy? <<

>> A: There’s not a definitive way outside of Google. But the [sf giants] example that I mentioned in my pre-BigDaddy 302 post remains a pretty good test. If the query [sf giants] returns giants.mlb.com, the odds are pretty good that you’re hitting Bigdaddy. <<

There is NOT a definitive way. So, that means that there is a chance that you are NOT looking at BigDaddy results. The [sfgiants] test isn't 100% conclusive.

annej




msg:750128
 3:07 am on Feb 7, 2006 (gmt 0)

I think BD is giving old results in terms of ranking, yet it contains pages that I've published as recently as January 26.

Powdork




msg:750129
 6:36 am on Feb 7, 2006 (gmt 0)

It should not be possible to index the content that is sent out along with a "301" status code.
It sounds like a glich to me. I have one from listing for a robots.txt protected page where both the Title and the snippet are "X-Google-Crawl-Date: Wed, 24 Nov 2004 17:02:30 GMT"
cleanup




msg:750130
 8:40 am on Feb 7, 2006 (gmt 0)

Gs1md
Thanks, but I would not know what "funky JavaScript" was these days if I tripped over it!

Powdork.
A glitch? yes I think so but it has been like this for six weeks or so. During which time I have had my fingers crossed for a bot to come around and get it right.

Dayo_UK
suggested that it may be between bots at the moment ie that one bot had passed by and picked up the new address to be followed sometime later by another more patient one to pick up the content again...mmm

Day_UK could that be your unshakable optimism talking or do you stil think that may happen.

->
Here is the code I am using in a .htaccess

Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^mysite\.com
RewriteRule ^(.*)$ [mysite.com...] [R=permanent,L]

RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /.*index\.html\ HTTP/
RewriteRule ^(.*)index\.html$ /$1 [R=301,L]

->
And here is the response I get when using:
seoconsultants .com/tools/headers.asp#results

#1 Server Response: [mysite.com...]
HTTP Status Code: HTTP/1.1 301 Moved Permanently
Date: Tue, 07 Feb 2006 08:28:13 GMT
Server: Apache/1.3.34 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 FrontPage/5.0.2.2635 mod_ssl/2.8.25 OpenSSL/0.9.7a PHP-CGI/0.1b
Location: [mysite.com...]
Connection: close
Content-Type: text/html; charset=iso-8859-1
Redirect Target: [mysite.com...]

#2 Server Response: [mysite.com...]
HTTP Status Code: HTTP/1.1 200 OK
Date: Tue, 07 Feb 2006 08:28:13 GMT
Server: Apache/1.3.34 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 FrontPage/5.0.2.2635 mod_ssl/2.8.25 OpenSSL/0.9.7a PHP-CGI/0.1b
Last-Modified: Wed, 18 Jan 2006 18:08:07 GMT

Ellio




msg:750131
 9:16 am on Feb 7, 2006 (gmt 0)

Ellio>
There are a number of DC IP's hitting the UK, some of which are BD and others are classic DC's. The two you quote are but amongst a number of DC's showing in the UK.

Eazygoin,

We are based in the UK and only use Google.co.uk and check results at least every one to two hours. For the last week at least the IPs I quoted are the only ones that have been used.

I fully understand and am aware that various IPs are being used on Google.co.uk I was simply pointing out "Current" experience.

The use of various IPs appears very "Structured" rather than random and that an interesting factor.

We are data centre watching after all.

Dayo_UK




msg:750132
 9:34 am on Feb 7, 2006 (gmt 0)

>>>>Dayo_UK
>>>>suggested that it may be between bots at the moment ie that one bot had passed by and picked up the new address to be followed sometime later by another more patient one to pick up the content again...mmm

>>>>Day_UK could that be your unshakable optimism talking or do you stil think that may happen.

Cleanup

Yes it may happen - GG/MC have said before that 301 redirects are not followed immediately and that they are added to a queue for indexing.

I would hate to guess how long this process may take.

I try to stay optomistic nowadays.

curlykarl




msg:750133
 9:37 am on Feb 7, 2006 (gmt 0)

We are based in the UK and only use Google.co.uk and check results at least every one to two hours. For the last week at least the IPs I quoted are the only ones that have been used.

I've been seeing the same since last week

and check results at least every one to two hours

Is that all!, you should be checking more than that :)

This 134 message thread spans 5 pages: 134 ( [1] 2 3 4 5 > >
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google SEO News and Discussion
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved