Forum Moderators: Robert Charlton & goodroi

Message Too Old, No Replies

"Real" pages vs. Supplemental Pages

         

freelistfool

4:51 pm on Nov 14, 2006 (gmt 0)

10+ Year Member



Am I correct in assuming that the difference between these two commands are the number of pages in the "real" G index?

site:www.mydomain.com
This returns 1500 pages

site:www.mydomain.com -inurl:www
This returns 1000 pages

So the difference is 1500 - 1000 = 500 real pages.

Is that correct? If so, is there a way to just see the pages in G's primary index?

Thanks in advance for your help.

daveVk

12:06 am on Nov 15, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



To determine the figures I would do the following 3 seaches

site:www.mydomain.com
inurl:www.mydomain.com
site:www.mydomain.com -inurl:www.mydomain.com

site: normaly returns real+sup, but dont trust it.
inurl: normaly returns real only, but dont trust it.
third should give sups only, but dont trust it.

So do all 3 and see if figures tally. Note inurl: will return pages containing www.mydomain.com anywhere in the url.

Welcome to forum

g1smd

12:25 am on Nov 15, 2006 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



For a site that should have all pages listed as www, I use these:

site:domain.com
site:www.domain.com
site:domain.com -inurl:www

It is important to do each one both with, and without, the &filter=0 parameter and for each search to look at both page 1 (1 - 100) and page 10 (901 - 1000) of the results (I use the &num=100 parameter too).

Additionally, I do sometimes do all of those searches at several datacentres, because there can be differences.

You need to be very familiar with the site, and you need to know which URLs return 200, 301, 404, etc, in order to work out what is happening.

Don't be caught out by "historical supplemental" results, for pages that have been reindexed after content has changed. They can show in both indexes.

freelistfool

1:46 am on Nov 15, 2006 (gmt 0)

10+ Year Member



Thanks. That sorted it out for me.

Halfdeck

5:02 am on Nov 15, 2006 (gmt 0)

10+ Year Member



Now I got at least one supplemental page temporarily back into the main index after rewiring the internal links on the site plus PageRank bombing a couple of other urls on the site from external domains I own four days ago. The ex-supp page's cache is dated Nov 13, 2006. The page's been supplemental since spring 2006. It's a 100% original blog post with no meta description/title or cannonical issues whatsoever, just no page strength (blog root TBPR 2) and high percentage of outbound, largely unreciprocated links with over 70% of the total PageRank leaving the site.

EDIT: Hell, this post was meant for the Supplemental Skinny thread.

[edited by: Halfdeck at 5:10 am (utc) on Nov. 15, 2006]