Forum Moderators: open

Message Too Old, No Replies

How many sites per pagerank toolbar value?

The new internet Fortune 5000? (pagerank 8 to 10)...

         

vitaplease

9:02 am on Apr 19, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Side tracking from Brett's value of websites according to PR. [webmasterworld.com] How about guestimates on actual numbers of sites of which the highest ranking page has a PR of:

10 - Three
9 - About seventy (see this overview of PR9 & PR10 sites) [webmasterworld.com]
8 - about 4.200
7 - about 180.000
6 - about 850.000
5 - about 2.500.000

I am also interested in ways/methods of estimating these numbers - any ideas? and how did you reach that number?.

The PR10 and PR 9 sites are easy. Just look for the highest ranking SERP's on stopwords [webmasterworld.com] add a search for "university" and "news" and check Google's/Dmoz rankings for major publicly traded companies. [directory.google.com]

One way of estimating the total number for the lower PR5 to PR8 sites is checking the above mentioned company rankings [directory.google.com] and estimating how many fall into which pagerank value. However these are listed companies and therefor more important.

DMOZ lists 383.000 categories so they claim. I would guess only approx 1 % of these categories carry one PR8 site on average – which would be 3.800 sites. (Science e.g. is a well represented category) – do not forget many high ranking sites list several times in different categories! Add some PR 8 sites not listed in DMOZ (unlikely, but say another 400).

Guessing the number of PR5, 6 and 7 sites is much more difficult. I recall Google saying they were using the most topical/interesting +/- 3 million pages for their Fresh label and that in general, in the beginning, most sites (but not all) had a minimal PR of 5 or 6 to qualify for this Freshness. Saying 40% of the PR5 and PR6 sites have Fresh and some have more than one Fresh page per site I estimated the above numbers.

Google claims to have indexed approx 2 billion webpages. Another estimate would be the number of pages per website. I would say 10. That would mean 200 million websites in the Goolge index. In an earlier thread [webmasterworld.com] we discussed if the real pagerank follows a log 6 of 7 scale. That should also allow for some guessing on the number of sites per toolbar pagerank digit – however in this case webpages are counted and not sites.

ciml

4:43 pm on Apr 19, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Good work, vitaplease. I take it that for PR5, you mean PR5 and below?

It seems bizarre to have only 3 PR10s. If there are no more it possibly indicates that the distribution is not just logarithmic (the base doesn't matter to this), or that the normalisation is wierd.

Either that or the zipf/pareto distribution needs log's of both scales to work. I think this makes sense, but I can't explain or justify math's on Friday afternoons...

vitaplease

4:06 pm on Apr 20, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Ciml,

If the three PR 10 sites: Google, Adobe and Apple have proportionally way more links to them then the rest of the PR9 sites would it not be normal to have only three in the PR 10 league?

The problem is, Normalisation is something I remember I forgot 20 years ago..

With PR5 I mean only PR5. You think 2,5 million is way to high?

ciml

4:17 pm on Apr 20, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Indeed, it just strikes me as interesting that the front runners seem to be so far ahead. If that is the case then I expect there to be a larger proportion of very low PR pages than a typical log distribution would suggest.

It's hard to speculate on the curves when we can't get a representative sample, but the affects that you describe may well be to do with the increase in Google's index. The first billion URLs are likely to contain the most well-linked URLs, so the second billion must contribute much more rank source than rank sink.

<added>Sorry, I read the PR5 as being 2.5 billion, hence PR5 and below. 2.5 million sounds sensible, but it feels too low.</added>

muesli

4:39 pm on Oct 25, 2002 (gmt 0)

10+ Year Member



nature.com has made it to the PR10 club. (i would have posted it in the overview thread but it's locked.)

aek

4:58 pm on Oct 25, 2002 (gmt 0)

10+ Year Member



Theres is a lot more than 3 PR 10 sites around. i.e gammett.com, macromedia.com, ox.ac.uk, yahoo.com, w3.org, sun.com, real.com, netscape.com, lycos.com.

There's probably about 25 or so.

bridge98

5:04 pm on Oct 25, 2002 (gmt 0)

10+ Year Member



Hi, with regards to PR10, you can add Macromedia. Strangely www.linux.org is greyed out?
P.S. As the quote is 2.5 million, would I be correct in saying: "I'm quite proud to have two PR5 sites"?

jimh009

8:47 pm on Oct 25, 2002 (gmt 0)

10+ Year Member



ODP is also a PR10

[dmoz.org...]

Jane_Doe

9:09 pm on Oct 25, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



nasa.gov has a PR10.

MOOSBerlin

9:38 pm on Oct 25, 2002 (gmt 0)

10+ Year Member



Here [webmasterworld.com ] they have called 21 PR-10 pages, may be, it could be 50!

ScottM

9:43 pm on Oct 25, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I'm kinda curious as to why this thread has a page rank of 2?

All the rest I check are PR5?

Hmmmm...do we have a 'poison' word here?

lgn

3:06 am on Oct 26, 2002 (gmt 0)



Rumor has it, the spinaltap website has a PR of 11.

Hawkgirl

8:43 pm on Oct 29, 2002 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Rumor has it, the spinaltap website has a PR of 11.

I was searching for an update to this thread for a few reasons... however, having iced tea fly out of my nose was not one of them.

Thanks for the laugh. :)