I see huge traffic from MSN.
Yes, I see quite a nice intermittent stream of traffic (primarily from Europe) when the BD test happen. My site is sandboxed... but not on BD.
when more BD results are online I see more G referrals to sites I have launched in the last 3 months - almost as though the sandbox has been lifted for those sites.
I not one hit from MSN, but my site appears to be b***ered there too!
300m - Hello again :)
You are seeing exactly what my take is on the current results I watch..
>>> I see an awful lot of pages that do not belong there. However, when you get beyond the pages that do not belong there, I see th ones that do.
Exactly. Granted the first page or two contains big guns, but then this is followed by unrelated results / spam and then 6 or 7 pages down you will see results that should be first page results.
It puzzles me. But here is my theory... Google wants to push crap to the top in order to receive spam reports that will help them clean up the index. However at the same time they want to keep users happy enough that they do not become frustrated with crap results, therefore the big guns on the first / second pages.
Big guns -> Spam -> Relevant results
My long-time #8 which dropped to #360 while the cache was out, is back up to #9 today.
The number of results returned in that search has been 110 million for a long time, inflated to 200 million late last week, went back to 120 million in recent days, and now shows as 277 million in some DCS (205 in others). What is going on?
The cache is still out at 18.104.22.168 at least. It is back in many other DCs where it was missing yesterday.
Looks like bigdaddy is spreading seeing it on 8 datacenters now
Seeing this on 10 DC's now at McDar
I am really suprised there rolling it out this quick, seeing alot of old data, cached dates are really old too.
Looks like a roll-back rather than a roll-out to me...
I'm not sure what people are seeing because I can't see anything!
>>But here is my theory... Google wants to push crap to the top in order to receive spam reports that will help them clean up the index.
I agree with this. It makes more sense for Google to let the spam and junk float up because then they will know if their right in their assumption as to what is and what is not spam. Having a group of people submit spam and dissatisfied reports is what I beleive to be a way of confirming algo changes and index updates.
However, it seems tha you and I are the only people who has posted with regards to the December 27 datarefresh. Which in my opinion may have been updated because they wanted to see how big daddy fished out the spam etc... with a much larger index.
I personally think that it would benefit everyone that has anything to do with google regardless if you are a webmaster or a searcher to have that spam go to page 2, and let people that do provide feedback, know that is where it is at and to please submit it there.
Fact is, not many people will even bother to go to page 2 on a search, unless they are hunting for something.
By having the spam on page one, Google may be inadvertantly be costing some companies thousands and thousands of dollars on a daily, weekly or monthly basis.
In my instances, i only do organic seo and i depend on google for natural revenue. We do have ppc and that has been covering the difference for us during this frustrating time frame, but the simple fact is, if its not organically ranking in googgle soon, my heads gonna roll.
Dont get me wrong though, if i am doing something wrong, I will admit to it, and also recognize it. However, since bd and the dc refresh prior, there is something else going on beyond my control.
>>>I am really suprised there rolling it out this quick,
Lol - this has been anything but a quick process.
Although I do get your point about the data - I hope that a big crawl/data refresh and PR update follow the roll out relatively quickly.
300m & frakilk
I think what you are seeing is that sites which have been effected by Google problems (Hijack, Canonical etc) seem to have a downranking effect - this downranking does not really effect the big big players (kelkoo, amazon etc) in the same way.
So you have serps which show:-
- Big Players
- Decent results (Downranked due to problems)
So I think it is not the spam being promoted up as such.
I am not 100% in agreement with that. I say this because there is a lot of spam on page one.
For one particular term and the #1 result, i go to msn and look at the link opertaor results and what do I see? I see about 20,000 anchor text links from irrelevant fourms. The term is not anywhere on the page that is doing this either. Unless i am not understanding the concept of fourm spamming, this is a clear cut case of spam.
However, I can somewhat agree with the rest of what you said, but as far as spam floating to the top, in the field i am looking at, it is apparent.
Also keep in mind that for me, this did not start until December 27th. Prior to that, i saw the same results on page one on the small index dc's and the inflated index dc's. When i say same results i mean the results that have historically been there for almost a year.
[edited by: 300m at 3:19 pm (utc) on Jan. 25, 2006]
I was talking about what Matt said, 1 to 2 months before its rolled out, think its been around 3 weeks.
I am trying to put a list of non-test english datacenters together. So far, this is what I have, can someone indicate what are the test DCs and what non-test DCs are missing from the list?
The real list is a LOT longer than that.
There are several lists out there already.
I would like to confirm that the 'good' big daddy results have been spreading (or my site keywords).
Yesterday 7 additional DCs and today 9 additional DCs.
I'm seeing some significant fluctuations as well, one of my tools produced the graph at the URL below for a key phrase for one of my sites across a large number of datacenters including the two big daddy IPs mentioned on Matt Cutts blog.
Now we can all dance in the streets! Google I love you long time! Big Daddy is FINALLY LIVE!
My site that has had small canonical issues that were fixed on BD when MC mentioned mlb.com, but then went completely awol with only 8 pages returned with the site:example.com search is now at #1 in the BD results that are spreading. The other two indexes I am seeing on mcdar are the one that does not contain my site (and does not meet the sfgiants test), and one that contains my site at the position it held with mild canonical issues, but now it is a url only listing.
However, even with the spreading results, my quick test of traffic levels, Adsense Stats, have not shown any increases. They are sending out payments though, which seems to cause a stat lag.
>> Now we can all dance in the streets! Google I love you long time! Big Daddy is FINALLY LIVE!
I wouldn't go that far ;-) They are probably just testing again!
I only love 22.214.171.124!
It's still the only DC which shows the correct pages for my site and the only one which shows a realistic number of pages for it too, no supplementals or url-only - full titles and descriptions. I want that one!
>> Now we can all dance in the streets! Google I love you long time! Big Daddy is FINALLY LIVE!
Don't get too excited. I have a feeling BigDaddy's results will change dramatically very soon. IMO
Can someone please explain what this means to someone who deosn't watch datacenters? Is this an update? I haven't seen any changes in my referrals.
>>>>I only love 126.96.36.199! <<<<
Yes but look how old the cache dates are. I see some as old as last june.
Not an update yet. Have a feeling this is stage two of the new infrastructure with phase three coming with the bigdaddy spreading to all data centers with a deep crawl following that and then....the update.
I think we will know it happens when caches update first.
>>> Yes but look how old the cache dates are. I see some as old as last june.
That seems to be a stretch... Cache dates seem to be about 7-10 days old, although some are current.
frakilk if you are right, which I think you may be, I would like to know how it turns out for you. I will do likewise.
>> Yes but look how old the cache dates are. I see some as old as last June. <<
Whaaaaa? June 2005? Is that all?
That's nothing. I have some stuff from December 2003 still there!
<Yes but look how old the cache dates are. I see some as old as last june.>
The canonical issue first hit my site on June 12th - my non-www home page is still cached as of that date.
Three of my sites had 120-190,000 pages indexed (site: query) ... now down 65-85%. Many competitors are also down (or up) by a large margin.
I also note that URLs further from the homepage don't appear with as much detail as before ... and that my rankings are lower than usual.
Strangely, my traffic is actually up, and my analytics show it coming from the same mix as usual.
I'm guessing these are also symptoms of BD changes ... but as the BD DC's differ from eachother, and the G SERPs fluctuate from showing BD to not, it's rather hard to be sure.
When y'all say definitively that "BD is showing on [insert location] right now [insert specific time]" ... how do you know? I'm new to the datacenter-watch habit, and for now it's over my head (unless you're referring to similar things as I am).
I am also seeing that using the site: command too. I have seen page count fluxuate huge on the test datacenters day to day, up or down 10,000 or more pages. Hope that settles down
| This 173 message thread spans 6 pages: < < 173 ( 1 2  4 5 6 ) > > |