Forum Moderators: open

Message Too Old, No Replies

Pagerank Question - Swapping Links

If a PR 4 swaps with a PR 4, wouldn't it just cancel out any benefit?

         

rise2it

8:28 pm on Aug 26, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Example:

If a PR4 site swaps links with aanother PR4 site, wouldn't it just cancel out any benefit? Would either (or neither) get a pagerank increase?

Views/Opinions?

(Please no references to the Stanford Google Pagerank paper, for those of us who do not have the IQ's to absorb it!)

dougmcc1

9:07 pm on Aug 26, 2003 (gmt 0)

10+ Year Member



I don't think it works like that. There are a lot of other factors that are considered besides just PR. For example, if there are 100 links on the page they are linking to you from but there are only 5 other links on the page you link back to them from then obviously that wouldn't be equal. You would pass more PR to them then they would pass back to you.

Also, the stuff they put in their anchor text is probably going to be different than the stuff you put in your anchor text, and what your site is about is probably a bit different, or completely different than what their site is about. So the link relevence is going to be different as well.

But all things being equal, I still think two identical sites linking to each other in an identical manner would still benefit each other. And to take it a step further, even if they did cancel each other out then you should take into consideration whether or not it will benefit your visitors, which is what you should be considering in the first place.

ukgimp

8:40 am on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



dont worry about TBPR and any cancelling out. What you want is good anchor text from the other site and you can reciprocate that.

The anchor text will do you no end of favours so make sure it says

"widgets blue" and not "click here".

John_Creed

9:00 am on Aug 27, 2003 (gmt 0)

10+ Year Member



No they don't cancel out any benefits.

newsphinx

9:16 am on Aug 27, 2003 (gmt 0)

10+ Year Member



Just my two points.
I think PR is an award of your hardwork for your contribution to the internet by internet users, simply manifested by links indexed by Google. Since new sites keep appearing and if one links to a new site and loses PR, there will be no new sites be found. The internet will be dead.

James_Dale

11:17 am on Aug 27, 2003 (gmt 0)

10+ Year Member



There is no problem with this. Sites absolutely cannot lose PR by linking out.

This is probably the most common misconception that does the rounds in these forums.

dirkz

3:59 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



rise2it, what are benefits?

A reciprocal link means traffic. Sometimes we all forget that good old traffic stuff. Before there were any search engines we just linked to show our visitors other useful stuff, and so we sent this way traffic.

I'm assuming two pages, A and B, linking to each other and not to any other page. Both have a PR of 4.

As far as the PR algo goes (I won't cite it :), page A linking to B definitely leaks PR. Because that link is reciprocated, B also leaks PR to A, so in the end they remain at the same level.

If both have different PR, the PR flows a little bit from the higher PR to the lower, very simple. If each page also links to other pages, it gets more complex, because they leak more or less to other pages as well. Reciprocal linking is no special thing, it is covered in the simple basic PR formula without exception.

So in the example the PR will stay the same in this simplistic example. But the main benefit is traffic AND the message in the anchor text, signalling google that the page at the other site is relevant for "widget".

2_much

4:27 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



As Daniel from Google would say, How does it benefit your visitors? (at SES that's all he would say about the different optimization techniques).

Really, you want to link out and get links in, but most important than the actual link is the anchor text, make sure you get your keywords in there.

Gus_R

4:29 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



Whatever scenario presents:

page A gets a vote from B >> A increases pr
page B gets a vote from A >> B increases pr

'leaks' only when you talk about pr distribution.

Gus

dirkz

4:39 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



We're always talking about PR distribution :)

page A gets a vote from B >> A increases pr

B decreases pr

page B gets a vote from A >> B increases pr

A decreases pr

doc_z

5:19 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



As recently explained [webmasterworld.com], the benefit for the two sites (i.e. summing up all real PR of all pages of both sites) is neglectible. The increase of PR is only due to the devaluation of other external links. For the case that the transferred PR to other sites isn't effect (due to the additional links), the total PR of the two sites isn't changed. Of course, the distribution might be changed.

Of course, there are other benefits apart from PR.

Statements like "Sites absolutely cannot lose PR by linking out." are wrong.

Gus_R

5:33 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



A decreases pr

not true

Only lose pr possible pages A could link in same site and don't by linking external B.

nick420

5:41 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



Statements like "Sites absolutely cannot lose PR by linking out." are wrong.

This may be correct if you define loosing PR as not distributing it as effectively as possible to other internal pages. But I think the statement "A page absolutely cannot lose PR by linking out." is certainly correct.

Please correct me if I am wrong.

dirkz

5:49 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Sites don't have PR. PR is only given to single pages/URLs. So yourdomain.com and yourdomain.com/index.html and yourdomain.com/index.shtml are three different pages/URLs with different PR for google. I think this is very important to understand.

Every page that has a certain PR before linking to another site, then links to another site, will lose some PR (all other things remaining equal).

So the sentence:
"A page absolutely cannot lose PR by linking out."
is wrong.
It will always lose some. But of course, if the linked page links to another page, which links to another page, which links eventually back a.s.o., this leak can be very minimal.

I don't want to bring in the PR algo, I promised it :)

doc_z

5:49 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



A page absolutely cannot lose PR by linking out

This statement isn't correct too. In any normal case(*) not only the PR of the other pages of the site but also the PR of the linking page is decreased.

(*) normal case: the page has also some internal links (i.e. it isn't a dead end when all external links are removed)

nick420

6:21 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



dirkz... maybe I need you to bring in the algo :).

Just so I get this straight:

If I have two "dead end" pages with the identical inbound links from identical external pages, they will have the same PR. Then if on one of my two "dead end" pages I link to 20 external sites, that page will now have a lower PR then my other page?

I thought Google considered a link as a vote. I don't recall ever voting and having to give up something for the priveledge (insert your comment here about wars to protect freedom / forefathers / etc.). Doesn't this discourage voting?

mil2k

6:46 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I had to do this! ;) From the original Pagerank Paper (emphasis mine) :-

Note that the PageRanks form a probability distribution over web pages, so the sum of all web pages' PageRanks will be one.

So now if you give a link to another site, someone somewhere loses a tiny bit (or probably more) of his Pagerank. Now it depends on your web graph to finally determine if you lose some PR or not. These are just my views.

Having said that and as pointed by innumerous others in this thread give more importance to Anchor Text than PR. If google increases it's web index then you would generally see a drop of PR across Board.

[edited by: mil2k at 6:49 pm (utc) on Aug. 27, 2003]

doc_z

6:47 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



nick420,

I'm always referring to 'normal' (realistic) situations (as mentioned in msg#15), i.e. link pages that have also internal links. Of course, in the case you mentioned, PR is not decreased. However, this is only be true because you compare with a situation where PR is wasted (due to the dead link scenario).

doc_z

6:53 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



Note that the PageRanks form a probability distribution over web pages, so the sum of all web pages' PageRanks will be one

One have to add that this statement is only correct if dead end pages are removed.

dirkz

7:21 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



nick420, what exactly do you mean by dead-end pages? Pages with no outgoing links, only inbound ones?

doc_z, what do you mean by "pages with internal links"?

Now I throw it in, the algo :)

PR(A) = (1-d) + d (PR(T1)/C(T1) + ... + PR(Tn)/C(Tn))

(search for page rank explained on google)

A is the site for which to calculate the PR, T1...Tn are sites linking to A. C is the number of links on the given page T1..Tn.

The algo is very simple, we can for the moment omit the unnecessary stuff:

PR(A) = PR(T1)/C(T1) + ... + PR(Tn)/C(Tn)

That's it! So the PR of every page in the system is calculated from the PR of the linking pages, very simple. The sum of all PR is always the same. If the number of pages grows and you don't get more inbound links (you just stay the same with your page), you lose PR, because your relative share gets smaller.

From the formula above and the fact that the sum is always the same, you can deduct that every time you link to another page, this page's PR gets higher, right? (look at the formula). And if any page in the system grows in PR, other pages' PR will decrease, because the sum stays the same. Does that make sense?

So if you link out, you will always enable other sites to increase PR, and the logical consequence is you loose some.

I'm not saying that you should not link! In fact, you know that you win far more than the little PR you lose. You get traffic, and eventually even the lost PR comes back.

In order to grasp this, it's a good idea to just build a simplified net. Think of two or more pages that link in different sorts to each other, and for the moment assume that there PR is fixed at first (e.g., it's 1 or whatever). And then you can calculate the PR of one page through the PR of the linking one, and so on. Google has to do this about 50 times I guess, until all PRs don't move anymore :)

Hope that helps (as I said, search for "page rank explained").

dirkz

7:23 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



But I think all this PR is not very relevant in everyday webmaster life. To put it short: Just link as much as you can, purely inbound links are better than reciprocal ones, reciprocal ones are better than no links, no links are better than links from "bad neighborhood" :)

dirkz

7:26 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



nick420, you are absolutely right, voting is of course somewhat discouraged. This is one of the reasons a PR9 site won't easily link to your PR2 site (except it has really high quality fresh contenet), because it will lose a lot, and you will gain a lot :)

nick420

7:39 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



nick420, what exactly do you mean by dead-end pages? Pages with no outgoing links, only inbound ones?

dirkz... yes that is what I meant. Not that I would ever have any "dead end" pages. I was really just trying to simplify my question. Apparently I just made it more complicated with an unrealistic scenario.

Thanks for all the info on the algo.

doc_z

7:46 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



dirkz,

"dead-end pages" are pages with no link on it; "pages with internal links" are pages which link to other pages of the site.

The sum of all PR is always the same.

The total PR is N=the number of pages (if there are no dead end pages).

If the number of pages grows and you don't get more inbound links (you just stay the same with your page), you lose PR, because your relative share gets smaller.

That's not correct. The real PR in unaffected by the total number of pages. However, the ToolbarPR is probably related to the page with highest real PR and therefore the ToolbarPR normally (in realistic scenarios) depends on the total number of pages.

In order to grasp this, it's a good idea to just build a simplified net. ... Google has to do this about 50 times I guess, until all PRs don't move anymore :)

The number of iterations strongly depends on the algorithm used. For the original mentioned Jacobi algorithm, this is a valid guess. However, state of the art algorithms are much faster.

matuloo

7:52 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



Let me join the debate with mny understanding of the PR distribution.

Imagine a site with 1 index page and lets say 10 content pages. Index site has a PR of 4 for example, while there are 10 links on the index page pointing to the content pages and all 10 content pages link back to the index page.

Now you add an external link on the index page pointing to a completely different site, the PR of of the index page as such is not influenced at this moment but now there are 11 links on it which means that the content pages are getting a smaller portion of the PR thus their PR is decreased a bit and at the same time they return a smaller PR back to the index page which actually decreases the PR of the index page a little bit. So if you look at it this way the PR of the index page got decreased by placing a link on it.

On the other hand, if you gained a new reciprocal link at the same time, your PR might actually increase if the link is good enough to make up for the loss.

dirkz

8:17 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



matuloo, that's correct. That's what I'm trying to conveye here.

doc_z, you are probably right about the iterations.

By total number of pages, I mean the total number of pages in the WWW. I think there is a misunderstanding here, you mean the pages of a site. So you are right when you say that your PR in fact increases when you have more pages (on your site).

From the Brin and Page paper, the average Actual PR of all pages in the index is 1.0!

By "paper", Brins and Pages original publication is meant. So the PR of any given page of your site will decrease as the web grows bigger (assumed that the number of backlinks etc. for your page stay the same).

The google toolbar always gives you the PR of the page (URL) you are viewing, NOT the average of your site (except for that at the moment it behaves very weird :))

Dead-End pages don't complicate the computation, they are just a valid part on the Web. In fact, they make calculation easier (that is, for human beings) :)

Btw, the more pages your site has, the more PR altogether you can reach (because your potential share of the total PR of the Web is much bigger). It's up to you to distribute it from your index.html or whatever you have as main page. Pages of your own site (and links from them to a page of your site) count in the calculation just like any other page on the Web, it doesn't make any difference. So the more pages link to your main page, the more PR it will reach.

doc_z

9:16 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member Top Contributors Of The Month



dirkz,

of course, the average Actual PR of all pages in the index is 1.0. (assuming that there are no dead ends). And therefore the total PR is N=the number of indexed pages of the WWW.

So the PR of any given page of your site will decrease as the web grows bigger (assumed that the number of backlinks etc. for your page stay the same).

As already said, this isn't correct. The real PR (i.e. the PR of the formula you gave) of the page is unaffected (i.e. constant). The fraction of PR of the page to the total PR is only decreased (i.e. PR(A) is unaffected, PR(A) / total PR is decreased).

The google toolbar always gives you the PR of the page (URL) you are viewing, NOT the average of your site

Of course, the ToolbarPR is page related - no need to mention obvious facts. However, the toolbar is not showing the real PR. The toolbar shows in integer on a logarithmic scale. This scale is probably (not necessarily) related to the page with the highest real PR (e.g. by fixing this real PR with ToolbarPR 11.0). Therefore, a change in the total number of pages probably (i.e. in realistic scenarios) leads to a change in the ToolbarPR because it is rescaled (i.e. the same real PR corresponds now to a different ToolbarPR).

Dead-End pages don't complicate the computation, they are just a valid part on the Web. In fact, they make calculation easier (that is, for human beings) :)

I never said that dead end pages complicate the computation - I just said that dead ends lead to a decrease of the average PR (i.e. <PR> < 1 ). Also, in case of (originally) dead end pages you can link out without a decrease of PR, because PR was already wasted.

However, dead end pages can indeed lead to a complication. At least in the case when the calculation is done as computation of eigen vectors instead of an inversion of the transition matrix. For example, in most of those Stanford papers (e.g. by Kamvar et.al.) they are considering the calculation as the determation of eigen vectors. In this case dead end pages have to be treaten differently (i.e. they must be taken out of the calculation in the first step) and therefore are indeed more complicated.

Btw, the more pages your site has, the more PR altogether you can reach

Yes, according to the original PR algorithm you are 'producing' a PR of one when creating a new page. However, if you are studying the current implementation you'll find that Google has modified this algorithm significantly.

dirkz

9:29 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



doc_z, fully agreed :)


As already said, this isn't correct. The real PR (i.e. the PR of the formula you gave) of the page is unaffected (i.e. constant). The fraction of PR of the page to the total PR is only decreased (i.e. PR(A) is unaffected, PR(A) / total PR is decreased).

Ok, this is valid. But if your fraction of PR is decreased, then your "overall PR" decreases, doesn't it?

Maybe we have to keep real PR as you call it different from "PR share", which is then expressed on scale by the Toolbar. I think we talked about two different things.

Thanks for the discussion (what was the initial question?) :)

MikeNoLastName

10:06 pm on Aug 27, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Just my 2 cents...
You all are avoiding two things.
1. The dampening factor takes it's percent (supposedly 12-15%) right off the top at each link, so that PR5 is actually a PR4.25 (divided among the outgoing links of course) by the time it impacts the first set of linked sites and a PR3.61 at the next and so on. IMHO you DO lose some PR if for instance you otherwise only link back to your own pages. That one tenth (if you link to one other site and 9 links to your own) goes offsite.

2. It depends on whether off-site links are considered to have "infinite sucking power" as I like to call it. This appears to still be an argument in some circles. Anyone who has not studied the original Brin and Page thesis and actually run spreadsheet PR simulations of their own will probably not grasp this. Basically the difference, when running interations, is whether the PR5 coming into the site REMAINS a FULL PR5 (does not lose it's PR) with each successive iteration. Which of course is what this thread is all about.
A. If it doesn't, then definitely there is an obvious LOSS of rank to it with respect to the rest of the index, as the rank of that site is "transferred" to the destination site instead of its own site, and basically becomes an extension of the destination site.
B. If it DOES remain a full PR5 at each iteration, that means it pocesses this "infinite sucking power" (ISP?) and thus DOES NOT EVER lose PR. On each iteration it is still PR5 no matter how many iterations are run and EACH TIME TRANSFERS (PR5-dampening)/number-links to the other site! If it is true that the total PR of the web is equal to the number of pages, as the thesis claims, then it must be "sucking" more PR from itself and all sites linking to it. This leads to simulations I've run, with sites constructed in a precise manner, which result in some pretty interesting PR MULTIPLICATION phenomena with only a couple incoming links as the PR is imported from the rest of the net, which I don't think Google would permit.
This whole difference is discussed and argued in detail by a guy named Ian Rogers on a very nice website with examples so you can decide for yourself. His documented conclusion is that the calling SITE DOES lose PR by linking.

Since we cannot post URLs, those who are looking for a "lower IQ" version for explaining Pagerank which was requested earlier in this thread, might find a very good site by googling for Pagerank Explained Correctly with Examples.

Mike

berli

10:47 pm on Aug 27, 2003 (gmt 0)

10+ Year Member



deleted because it was already answered. I need some sleep.
This 32 message thread spans 2 pages: 32