Quadrille

msg:3617035 | 12:08 pm on Apr 2, 2008 (gmt 0) |
site: operator, like most webmaster searches, is not and never has been very reliable. Use webmaster tools. Or Yahoo! ;)
|
BillyS

msg:3617071 | 12:48 pm on Apr 2, 2008 (gmt 0) |
I don't think it's a "bug" only proof that Google is constantly testing / tweaking their index and / or their estimating logic for the site: command.
|
reseller

msg:3617255 | 3:38 pm on Apr 2, 2008 (gmt 0) |
BillyS Of course the other possibility is that the "site:" operator is functioning well, but the two data centers I mentioned contains different volume of data. As such we might expect [72.14.207.104...] to contain around 20% more data than [64.233.161.104...] . Having said that, I'm aware of what Matt Cutts wrote once in 2006 [mattcutts.com]: In the middle of that session, I talked about the frustration that modern data center watchers will encounter these days (because there are often slightly different things at different places) and I mentioned a slide from Boston Pubcon...... Can you imagine trying to monitor that, especially when the same IP address can query different data centers for different people? It wouldn’t be my preferred hobby. |
| [edited by: reseller at 3:51 pm (utc) on April 2, 2008]
|
tedster

msg:3617261 | 3:47 pm on Apr 2, 2008 (gmt 0) |
I don't think 72.14.207.104 really contains more data. I did a search for "the", one of the most common English words: 64.233.161.104 - 12.27 billion 72.14.207.104 - 12.63 billion In other words, they're just about the same size. I think BillyS has a good idea when he mentions "tweaking their...estimating logic for the site: command." With the current "flux" in Google, many webmasters have commented that thoe estimates. which had improved, have recently become less accurate.
|
reseller

msg:3617285 | 4:06 pm on Apr 2, 2008 (gmt 0) |
tedster and BillyS But that leaves us with the thought; which of the two DCs the folks at the plex are doing the tweaking on? because I can't imagine they are tweaking all over the place. I say [72.14.207.104...] in that case. However, we had witnessed high site: results problem before. And I wish to recall another interesting 2006 post [mattcutts.com] of Matt Cutts, were he mentioned the high site: results estimates - high site: results estimates. I believe that more accurate site: results estimates are live everywhere now. |
|
|
BillyS

msg:3617638 | 10:51 pm on Apr 2, 2008 (gmt 0) |
reseller - I do remember the problem back in 2006 because I believed it affected my site directly. Back then Google would show my site as having 10,000 pages where it really only had around 1,000. I felt, perhaps wrongly, that Google thought I was spamming their engine because I went from 980 pages to over 10,000 overnight. I believe that some of the dialog that took place here caused Google to rethink the accuracy of the site: command. I'm also from the camp that these small observations - especially on such a relatively obscure query (one used a lot by webmasters....) sometimes are fallout from larger changes behind the scenes. In other words, an unintentional change occuring from an intentional change. I also think this is why Matt is interested in these observations.
|
reseller

msg:3617653 | 11:02 pm on Apr 2, 2008 (gmt 0) |
BillyS I'm also from the camp that these small observations - especially on such a relatively obscure query (one used a lot by webmasters....) sometimes are fallout from larger changes behind the scenes. In other words, an unintentional change occuring from an intentional change. I also think this is why Matt is interested in these observations. |
| Agreed. Power to you! I'm beginning to think of a software update (infrastructure update) similar to BigDaddy might have been taking place during the last two weeks or so.
|
reseller

msg:3617822 | 6:55 am on Apr 3, 2008 (gmt 0) |
Just wish to mention that I have reported the current case of site: operator behavior to Google WebSpam Team as per Matt Cutts request [webmasterworld.com].
|
|