Forum Moderators: open
For those those that may have missed this in another thread:
GoogleGuy: I'm expecting SJ results to show up at other data centers, then gradually over time we're going to pull in newer spam filters, backlinks, etc.
Critter: Is all the data from the last deepcrawl in -sj?
GoogleGuy: Um, I doubt it's all in there right now, Critter. I would expect more pages to be pulled in over time.
Dolemite: I'm taking it to mean that newer backlinks will eventually be factored in, so SERPs will change
GoogleGuy: You bet. Dolemite, I think there are still backlinks to be truly added in over time. My hunch is that as that happens, SERPs will change as a result.
albert: Is it possible to give us some rough time line?
GoogleGuy: albert, I just can't to commit to any future dates. If you think about the sheer magnitude of data to be processed, that's a lot of data--easily several terabytes. That's not the sort of thing you process in a day or two. But we'll be going through it as quickly as we can.
I have only taken selected quotes above, see the whole thread for the full dialog. From this exchange, it seems safe to make the follow assumptions (IMO):
1. -sj does does not contain the complete results from the deep crawl.
2. Pages from the deep crawl will be added to the index in coming days.
3. As pages are added, backlinks will be added.
4. (At least some) SPAM filters have not been applied to the index yet. Thus -sj is currently more spammy than the final index.
5. Given the preceding 4 assumptions, the SERPs of the final index could be drastically different from what we currently see on -sj (IMO).
For my 2c:
www-sj is def different to www2 and www3. I have run numerous queries and can see vast changes. Things are still in flux very much as well as my results changed from yesterday to today in certain areas.
Good luck to all, but suggest that we all settle down a bit before we collectively blow a gasket. Remember we have all been asking for algo changes, assuming that they would help us..but what if they help other people more than us? Seems we sometimes do not consider that might happen. Now they are here, well lets wait and see what happens before we condemn them.
we covered off some summary back here about 9 hours ago, [webmasterworld.com...]
and last time I checked the IP's i got:
Site _ IP 9 hours ago: _____ IP just now:
www-sj 216.239.47.166 ___ 216.239.47.166
www2 216.239.47.166 ____ 216.239.47.166
www3 216.239.47.166 ____ 216.239.47.166
whereas:
www 216.239.48.242 ____ 216.239.47.2
www-ex 216.239.47.2 ____ 216.239.47.2
I can't see how you can be seeing different results on www-sj to www2 to www3?
I said before: "THIS one is being built differently - and for the first time - we are geting an insight into how it is being built. Normally - we see it after its built - we see it getting replicated. This time - we are seeing an index actually being built - ingredient by ingredient. Don't waste the opportunity!"
I agree with J_H_Maccan's 'world of continuous updates' hypothesis here [webmasterworld.com...]
Chris_D
Sorry - my last post was a little 'short' - I wasn't trying to humble you - or anyone! Its late, I'm tired, and I'm just trying to get a better handle on how this 'brave new world of continuous updates' is all playing out.
I've been watching IPs and datacentres and indexes and comparing results for hours. Then I saw your post & suddenly thought I'd missed something really really important!
Clearly now time for bed!
Best regards
Chris_D