Forum Moderators: open
Any ideas?
However, please do realize that a lot of small changes has been add to PR calculation since that document was written.
Turning to the technology developments that Google has planned for the future, Sullivan asked Brin to elaborate on the work that goes into the constant development of the famed PageRank system. Brin said that it was still very much an important part of Google's ranking system and that more than half a dozen new ranking technologies are tested each month, with roughly half of these being integrated into Google's PageRank algorithm.Inside Search Engine Strategies, San Jose - Day Three [seotoday.com] Aug 21, 2003
BTW, why should you worry about PR if the contents of the page cannot be indexed?
1) Page A ---> Page B ---> Page C
2) Page A ---> Page B
If Page A has a link to Page B which is linked to Page C, that is obviously the top diagram. If nothing links to Page C, that is obviously the bottom diagram.
But what happens if Page B has a link to Page C AND Page B has a NOFOLLOW metatag AND Page C has a NOINDEX,NOFOLLOW metatag? Robots would not see a link between Page B and C, but the HTML would. So in this case, does the PR flow according to robot crawling behavior (=bottom diagram) or pure HTML links (=top diagram)?
2. If Page C has no links to it other than from Page B, then it doesn't matter if Page C has a NOINDEX,NOFOLLOW metatag or not because the page will not be spidered by bots like Googlebot that obey the NOFOLLOW metatag on Page B. For Google Page C doesn't exist (there are no known links to it) and therefore Page C has no PageRank.
3. Page B is only being linked from Page A and Page A misses the NOFOLLOW metatag, so Page B receives a PageRank that is related to the PageRank of Page A. There is no transfer of PageRank in the meaning that the PageRank of Page B is incremented at the expense of the PageRank of Page A.
4. Page B misses a NOINDEX metatag so it will be indexed and it can be found in the SERPs (Search Engine Results Page) for a relevant but not too competitive query.
5. Because there is a NOFOLLOW for page B, no other page will get some PageRank from Page B. So even if a link would be added to Page A, then Page A would not benefit from such a link.
6. Page A has a PageRank due to the inbound links (see point 1), but could have had a higher PageRank if Page B and Page C would both link to Page A AND there would be no NOFOLLOW and/or NOINDEX for Page B + Page C.
My two yen.
We assume page A has pages T1...Tn which point to it (i.e., are citations). The parameter d is a damping factor which can be set between 0 and 1. We usually set d to 0.85. Also C(A) is defined as the number of links going out of page A. The PageRank of a page A is given as follows:
PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn))
Val