Forum Moderators: Robert Charlton & goodroi
So: if your pages are still supplemental, feel free to write to sesnyc06 [at] gmail.com with the subject line of "stillsupplemental" (all one word), and I'll ask someone to check the emails out.
Hope that helps, and I'm glad that lots of people are seeing a full recovery,
GoogleGuy
[edited by: Brett_Tabke at 5:20 pm (utc) on Mar. 22, 2006]
Bigdaddy is a software upgrade to Google’s infrastructure that provides the framework for a lot of improvements to core search quality in the coming months
I understand that as saying that SERPs changes will be introduced on BD in the short term, and seeing our positions lately, I think its fair to infer that changes are already incorporated.
I think this is going to be the longest update saga in the history of long update sagas.
http:// www.mattcutts. com/blog/gone-supplemental/ (spaces added)
Some site owners over at WebmasterWorld have been discussing an issue where on Bigdaddy data centers, the site wouldn’t be crawled as much in the main index. That would result in Google showing more pages from the supplemental results for that site. GoogleGuy requested feedback with concrete details, and several people responded with enough details that we identified and changed a threshold in Bigdaddy to crawl more pages from those sites.I checked in that email queue tonight to see how the “gonesupplemental” feedback looked. I looked at an emergency responder site, a truck site, a ticket site, a karate site, a silver site, a T-shirt site, a site about memory, a site selling a type of document, a boating site, and a jewelry site. All were getting more pages crawled, and I expect over time that we’ll crawl more pages from these sites and similar sites that people mentioned. The biggest site that I saw had 711K pages reported, and I saw other sites with 40,400 estimated pages and 52,700 estimated pages for a site: search.
So the upshot is that if you’re one of these people who was paying attention to this issue, I think it has already improved quite a bit, and I would expect to see more pages indexed in the coming week or two. Some sites may see improvements earlier than others because of where a site happens to be in Google’s crawl cycle.
Matt Cutts has just posted a new thread dedicated to the "Gone Supplemental" issue.
Judging by this it was a wrongly set threshold that was causing the problem. But that could indeed be spin.
site:domain.com
site:domain.com -inurl:www
site:www.domain.com
to see which pages are listed as www and which are listed as non-www first.
Add &num=100 to the end of the Google search URL to get 100 results per page.
Next check how far you get before you see the Repeat search to show omitted results message, as that is giving you a clue that all pages NOT shown are classed as duplicates (just having the same title and/or meta description on multiple pages is enough for them to be hidden in this search).
Finally, click that link and look at all the listings. At 100 results per page, you'll only have 10 pages to look at for each search.
Of course, you need the site to have less than 1000 pages for this to be useful.
If the site is larger than that, then add folder names to the search to restrict it, like site:www.domain.com/widgets/ or exclusions using "-inurl", like site:www.domain.com -inurl:widgets.
It also seems to me that, thus far, Google remains entirely oblivious to these problems. None of Matt Cutt's feedback, for example, demonstrates any grasp of the actual problems. Instead they seem to go off on a tangent, assuming that the problem is simply a matter of sites not being crawled for example.
From all that I've read, and based on my personal experience, my guess is as follows:
1. All Big Daddy datacentres were kick-started from a December 2005 index.
2. Any pages that pre-date January 2006 are likely to have survived Big Daddy, although many have now been erroneously marked as "supplemental".
3. Any pages that are new since December 2005 are being completely ignored. The crawlers crawl them, but they fail to make it into the index at all.
4. These problems are present on ALL BD datacentres and absent on all non-BD datacentres. The bug is therefore somewhere in the Big Daddy code.
5. Nothing so far has managed to bring these serious issues to the attention of either Google, or the press. Unless this happens, this could theoretically go on for ever.
"pgaz, I’m not trying to minimize that this affects people. But some of this happens in the crawl/index cycle and I can’t force that to run differently. T2DMan, that was the best estimate I would have made at the time. There were some things about the Bigdaddy crawl/index cycle that I wasn’t aware of that made it take longer. I was in a meeting yesterday and re-emphasized that I thought it was important to get more pages from those sites crawled as soon as we could, because I know that this is stressful for the webmasters who were affected.
Steve, the Mozilla Bot is what fetches pages for the Bigdaddy data centers."
And therefore Big Daddy has a different calculation of BL/PR etc.
Has this been applied to the serps ranking though - I wonder.
[edited by: Dayo_UK at 5:22 pm (utc) on Mar. 23, 2006]
This is not true, i started after december and pages are indexed but dropped away to the bottom of SERP's since 8-3
4. These problems are present on ALL BD datacentres and absent on all non-BD datacentres. The bug is therefore somewhere in the Big Daddy code.
Rankings lost on all DC
5. Nothing so far has managed to bring these serious issues to the attention of either Google, or the press. Unless this happens, this could theoretically go on for ever.
Very true, Matt don't realize what is going on.
I heard today from a lot off other people that when they search they have to dig deeper in SERP's to find what they are looking for.(dutch and English results)
This people don't know SEO and just use the SE, they also said that this was not before the case.
Google is going to loose marketshare(and we loose income) if they don't take quick action and bring webmasters the appropiate info
We're ranked well for a particular KW but where they are pulling the title from has me completely stumped!
Normally its the META title, then the description is either from DMOZ, or the META, or from content.
In this case we don't even have the same words on the page thats listed, nor is it in the META tags.
Anyone have any ideas on where else they might be pulling this title from?
This seems to be exactly what's happened to me, thousands of pages since December, seems like none or very few in the index now, although heavy crawling.
Also the keywords for referals nowadays is very similar to pre December.
In my opinion the BD index is at least as good as it was prior to BD... I don't think the 'press' is going to write about the collapse of Google any time soon.
Also, I have, to date, seen many supplemental pages in my google travels... I can't think of a single one that I thought was put there wrongfully... In my opinion it would serve Google well to send MANY other auto generated 'filler' pages out to pasture.
Has it crossed anyone's mind that maybe this is just another, quite intentional, step in Google algorithmic evolution? Could it be time to reevalute your strategy too?
I sincerly hope the days of the 500,000 page two month old site is gone for good...
Hmmm. Nice sentiments, but somewhat wide of the mark I'm afraid. According to Google's own statements BD contains no algorithmic tweaks. Instead it is a fundamental change to their infrastructure, new bot, new indexes, etc. Trouble is, their new bot and/or index contains a serious bug that indiscriminately discards perfectly good pages (possible even all pages that are new since January-ish).
Yes this may well get rid of some Spam, but it will also get rid of all of the quailty content too. I guess you are just lucky that you aren't effefcted (yet). Maybe you should just count your blessings instead if coming over all smug?
Will they return? has anybodys page count started to increase?, mine has gone up by around 5 pages which isnt even 0.5% of whats missing
I just looked at a search that has been returning about 680 results from just those forums alone, and now it returns only about 250 results.
[google.com...]