Forum Moderators: Robert Charlton & goodroi
has anyone noticed when on Big Daddy DC you do a site: command on your site it brings back a number.
Ok now refresh countless times and you will notice it flips to another number.
My site is currently switching between 850 pages and 420,000!
Any ideas anyone?
[edited by: tedster at 10:41 pm (utc) on Feb. 6, 2006]
I created a blog about 2 weeks ago and it was through blospot, but it is hosted on my domain and my blogspot profile is ranked #15 on a competitive keyword.
Thats nice and all, but why? The profile has nothing on there that should have it ranking for that keyword.Especially so soon. Granted the blog and the domain it is sitting on is relevant to the keyword, but I have other .html pages that have always doen well with that keyword and they are not anywhere to be found.
i think the one thing that is making that happen is that I submitted the blog to about 30 blog search engines last week. Which would kind of make sense to my point that big daddy results for my area are still inundated with link spam.
Very strange.
Google has two spidering systems now. BigDaddy spidering (Mozilla) and non-BigDaddy spidering (non-Mozilla). Google is moving towards only BigDaddy spidering and is trying to stop non-BigDaddy spidering.
As a result of non-BigDaddy spidering decreasing, some sites on non-BigDadddy datacenters seem to be de-indexed. In other words, the number of pages that are cached in non-BigDaddy datacenters is decreasing for some sites. This means a loss of traffic since non-BigDaddy datacenters still represent the majority of datacenters. This loss of traffic is really caused by less pages in the index versus pages no longer ranking.
The de-indexing of pages is not intentional and does not reflect a site that is not 'worthy' of being in the index. It just means that sites/pages are getting temporarily lost in the move.
I suspect that even BigDaddy spidering does not have an accurate snapshot of the web. Sites will also not rank 'properly' (as Google really thinks they should) because many pages are missing that act as backlinks to other pages.
This is a state of flux for Google. Google is moving. When you move, it is hard to find stuff.
If anyone can share their views, for or against the above, that would be appreciated.
This is what I think may be happening as well. But this causes a lot of unnecessary damage for webmasters who struggle during the time the spam has risen. There has got to be a better way to do this. And besides it's not as if every piece of spam will be reported via spam reports. Do the general public ever submit these reports? Not sure about that.
That is how I see it.
Although at the moment I am not sure where the "ranking" - as in PR,BL and any other unseen to us ranking indicators etc - is coming from on BD.
Eg. Is the ranking of the sites on BD using the existing criteria/ranks with the BD crawled data or using the ranking as calculated by the BD crawl.
With MC saying that PR is independant of BD - I think the rankings as calculated by the Mozilla Googlebot crawl may not have been applied.
In fact Big Daddy has been live (part time) on Google.co.uk every day this year so far.
If you get a default result a few refreshes will normally switch the result to Big Daddy although it sometimes seems to revolve in "Shifts".
Some of our terms rank very high on Big Daddy but are missing on default and we chack results every couple of hours every day so I can confirm 100% that BD has been on Google.co.uk since 1.1.06.
Check the cache links in the .co.uk results to see either:
Big Daddy = [64.233.179.104...]
Default = [72.14.207.104...]
These IP's do change but have not for about 10 days.
I have reported this several times earlier in this thread.
It should not be possible to index the content that is sent out along with a "301" status code. The 301 code is a redirect, and includes the URL of where to go instead.
Make sure, using WebBug, that the page really does return a HTTP status code of "301". It sounds like you have a page with the words "301 Page Moved" on it, but you serve it with HTTP Status "200 OK" and then you are doing some sort of funky JavaScript or other type of redirect, not a proper 301.
.
>> Q: Is there an easy way to test if a datacenter is running Bigdaddy? <<
>> A: There’s not a definitive way outside of Google. But the [sf giants] example that I mentioned in my pre-BigDaddy 302 post remains a pretty good test. If the query [sf giants] returns giants.mlb.com, the odds are pretty good that you’re hitting Bigdaddy. <<
There is NOT a definitive way. So, that means that there is a chance that you are NOT looking at BigDaddy results. The [sfgiants] test isn't 100% conclusive.
Powdork.
A glitch? yes I think so but it has been like this for six weeks or so. During which time I have had my fingers crossed for a bot to come around and get it right.
Dayo_UK
suggested that it may be between bots at the moment ie that one bot had passed by and picked up the new address to be followed sometime later by another more patient one to pick up the content again...mmm
Day_UK could that be your unshakable optimism talking or do you stil think that may happen.
->
Here is the code I am using in a .htaccess
Options +FollowSymLinks
RewriteEngine on
RewriteCond %{HTTP_HOST} ^mysite\.com
RewriteRule ^(.*)$ [mysite.com...] [R=permanent,L]
RewriteCond %{THE_REQUEST} ^[A-Z]{3,9}\ /.*index\.html\ HTTP/
RewriteRule ^(.*)index\.html$ /$1 [R=301,L]
->
And here is the response I get when using:
seoconsultants .com/tools/headers.asp#results
#1 Server Response: [mysite.com...]
HTTP Status Code: HTTP/1.1 301 Moved Permanently
Date: Tue, 07 Feb 2006 08:28:13 GMT
Server: Apache/1.3.34 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 FrontPage/5.0.2.2635 mod_ssl/2.8.25 OpenSSL/0.9.7a PHP-CGI/0.1b
Location: [mysite.com...]
Connection: close
Content-Type: text/html; charset=iso-8859-1
Redirect Target: [mysite.com...]
#2 Server Response: [mysite.com...]
HTTP Status Code: HTTP/1.1 200 OK
Date: Tue, 07 Feb 2006 08:28:13 GMT
Server: Apache/1.3.34 (Unix) mod_auth_passthrough/1.8 mod_log_bytes/1.2 mod_bwlimited/1.4 FrontPage/5.0.2.2635 mod_ssl/2.8.25 OpenSSL/0.9.7a PHP-CGI/0.1b
Last-Modified: Wed, 18 Jan 2006 18:08:07 GMT
Ellio>
There are a number of DC IP's hitting the UK, some of which are BD and others are classic DC's. The two you quote are but amongst a number of DC's showing in the UK.
Eazygoin,
We are based in the UK and only use Google.co.uk and check results at least every one to two hours. For the last week at least the IPs I quoted are the only ones that have been used.
I fully understand and am aware that various IPs are being used on Google.co.uk I was simply pointing out "Current" experience.
The use of various IPs appears very "Structured" rather than random and that an interesting factor.
We are data centre watching after all.
>>>>Day_UK could that be your unshakable optimism talking or do you stil think that may happen.
Cleanup
Yes it may happen - GG/MC have said before that 301 redirects are not followed immediately and that they are added to a queue for indexing.
I would hate to guess how long this process may take.
I try to stay optomistic nowadays.
We are based in the UK and only use Google.co.uk and check results at least every one to two hours. For the last week at least the IPs I quoted are the only ones that have been used.
I've been seeing the same since last week
and check results at least every one to two hours
Is that all!, you should be checking more than that :)