Forum Moderators: open

Message Too Old, No Replies

Why does the 'Google Lag' exist?

Trying to understand its purpose.

         

bakedjake

1:43 am on Sep 29, 2004 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



I had some in-depth discussion this weekend with some friends about the sandbox. Every theory on how to beat it kept coming back to one central problem - no one is sure why it exists.

I feel very strongly that until we have a good grasp on why it exists, it will be very hard to beat.

I don't buy the explanation that it's intended to be a method of stopping spam. Why? One, there's too much collateral damage it is doing. Two, if you accept the 80/20 principle (20% of spammers are doing 80% of the spamming), and you realize that there are multiple ways already of beating the sandbox that all of those spammers are aware of, it doesn't make sense anymore.

So, why does the sandbox exist?

The most obvious effect of the sandbox is that it prevents new domains (not pages) from ranking for any relatively competitive term. So, start thinking like a search engine - what would be the benefit of this?

bakedjake

2:29 pm on Oct 1, 2004 (gmt 0)

WebmasterWorld Administrator 10+ Year Member Top Contributors Of The Month



The sandbox is a PR flaw, nothing else - when PR is updated, loads of problems will get sorted - I think.

Explain this - how can you even speculate that PR is the cause?

renee

2:38 pm on Oct 1, 2004 (gmt 0)

10+ Year Member



>>One way to beat the sandbox would be to have high PR.

looks like google does not calculate pr for sites/pages in the 'box' index. if so, this does not make sense.

>>The other sure shot way is placement in every relevant category of DMOz.

this has already been proved wrong by the previous posts.

>>the calculation of the query time importance score

pages in the 'box' index don't appear in normal queries so query time is not meaningful.

again, it's fun to postulate complicated theories but it's really a simple case of google indigestion - that a separate index is used to hide the fact that g has reached a capacity limit - and its is not due to some pea-brain reason like running out of disc space!

just ask yourself how your new sites are behaving compared to your supplemental pages.

the good news - it's been over a year of g multiple indices, so (i hope) they are close to a solution. every once in a while, i see supplementas mixed in with the normal serps. this must be an indication that they are testing.

BeeDeeDubbleU

2:48 pm on Oct 1, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



The sandbox is a PR flaw, nothing else - when PR is updated, loads of problems will get sorted - I think.

Some of the the "new" sites where I am seeing this problem are seven months old. There have been two or three PR updates during this period but the problem is still apparent

renee

4:23 pm on Oct 1, 2004 (gmt 0)

10+ Year Member



pr has noting to do with the sandbox. sites/pages in the sandbox are not included at all in the pr calculations. this "sandbox" index has the following characteristics:

- "sites/pages" are in the google index. yes, a separate index, just like supplemental pages.
- sandboxed pages show up in the google serps for non-competitive keywords (meaning low number of results) but won't show up for competitive terms. that's because g goes to the other indices (supplemental, sandbox) only if the number of results is low.
- sites/pages do not have any pr (or pr=0) since they are not included at all in the pr calculations. backlinks also will not show(?)
- only way a site gets out of the sandbox and into the main index is if room is created when old sites are dropped. we see a lot of "my site is gone from google" posts in this forum.

A big question is how google chooses which site leave the sandbox and enter the main index:

- the posts here indicate it is not age related
- definitely not pr since no pr is calculated
- quality of sites? spam metrics?

we can't seem to find a pattern and probably supports the theory that the selection process is a random process. this is probably the fairest way in the absence of a rational algorith.

subway

4:31 pm on Oct 1, 2004 (gmt 0)

10+ Year Member



>>> Explain this - how can you even speculate that PR is the cause? <<<

Ok, my explanation ...

- New sites and new pages are still indexed and ranked within 1 week - That's a fact. Sometimes they're ranked very, very well if they are *not* money terms.

- If you want your site to rank well for the money terms you need a strong page rank - a strong 5 or above normally does it. Once you've achieved a high PR, all the old tricks work like they always did.

- Because page rank has not been updated for so long, it's created the illusion of a "sandbox". I think webmasters on these forums always without fail give G much much more credit than they are due - it's not some amazing sandbox algo tweak we're witnessing, just an intentionally delayed page rank update.

renee

4:41 pm on Oct 1, 2004 (gmt 0)

10+ Year Member



>> New sites and new pages are still indexed and ranked within 1 week - That's a fact. Sometimes they're ranked very, very well if they are *not* money terms.

So do supplemental pages! That has nothing to do with PR.

>> If you want your site to rank well for the money terms you need a strong page rank - a strong 5 or above normally does it. Once you've achieved a high PR, all the old tricks work like they always did

Of course! Once you have PR, it means that you are out of the sandbox and included in the main index. Lack of PR is just a symptom that you are in the "sandbox" index!

pmac

5:16 pm on Oct 1, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>Once you have PR, it means that you are out of the sandbox and included in the main index.<

No it doesn't.

jaina2

5:33 pm on Oct 1, 2004 (gmt 0)

10+ Year Member



So I don't think DMOZ is necessarily the magic pill you suggest

Its actually the G directory your site should appear in. The last time the sites that came out of the box, were the ones that were picked up by the G directory update aroud April. Sites came out in May.

And dont forget the category. A regional listing is not going to be very effective.

renee

5:34 pm on Oct 1, 2004 (gmt 0)

10+ Year Member



>>No it doesn't.

PMAC,

there is absolutely no way you can prove your statement. why? because there is no way for you to prove that your site is in the sandbox!

pmac

5:41 pm on Oct 1, 2004 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



>prove that your site is in the sandbox! <

Site with loads of decent inbounds, TBPR of 6, scores allinanchor in the top 5. Nowhere to be seen for any query worth a damn. Proof enough for me.

This 354 message thread spans 36 pages: 354