Forum Moderators: open
By accident, i've come to track one Google search: "
bbc site:bbc.co.uk" Oct 12, 2003 [webmasterworld.com]: 3,100,000 pages
Apr 09, 2004 [webmasterworld.com]: 823,000 pages
Jul 12, 2004 [google.com]: 696,000 pages
I don't believe that BBC has deleted about 80% of their pages in nine months. So:
A) given that Google will only display 1,000 pages in any case, is this decline in number of pages indexed real, or
B) is their "about 696,000" statement from the serps highly unreliable (in the Google API, this number is referred to as an estimate and not a count)
If (A) then: Is the limit PR or folder level (path length) or a combination? And, what PR or path lenght seems to be the threshold?
Any opinions?
Not sure, but I think nowadays Google also filters out the subdomains with this "site:" search funtion?
BBC 2002 [webmasterworld.com...]
<<added>> sorry that in part seems to be the case with "www" added to the query..
So, for the search:
site:bbc.co.uk (without keywords) it is: Oct 12, 2003: 3,100,000 pages plus (*est)
Apr 09, 2004: 1,350,000 pages
Jul 13, 2004: 1,210,000 pages
Which is not an 80% drop, but still more than 50%.
------------
(*est) Estimate, as the figure without the keyword would logically have to be higher than the figure with the keyword. As i recall, you couldn't perform that search without a keyword in October 2003