Forum Moderators: open
Mayby this BlockRank is the main reason why things are weird right now?
The conclusion of the paper is:
"We have shown that the hyperlink graph of the web has a nested block structure, something that has not yet been thoroughly investigated in studies of the web. We exploit this structure to compute PageRank in a fast manner using an algorithm we call BlockRank. We show empirically that BlockRank speeds up PageRank computations by factors of 2 and higher, depending on the particular scenario.
There are a number of areas for future work: finding the "best" blocks for BlockRank by splitting up what would be slow-mixing blocks with internal nested block structure; using the block structure for hyperlink-based algorithms other than web search, such as in clustering or classification; and exploring more fully the topics of of updates and personalized PageRank."
Full paper in pdf format
[stanford.edu...]
Incredible read and they had a press release about it as some of the funding came from the national science foundation.
:) It's nice to see such cool research being published that will help, imho, search engines advance.
We now present the BlockRank algorithm that exploits the
empirical findings of the previous section to speed up the
computation of PageRank. This work is motivated by and
builds on aggregation/disaggregation techniques [5, 17]
and domain decomposition techniques [6] in numerical linear
algebra.
Speeding up the calculations must be one of the main concerns within Google.
[edited by: msgraph at 3:07 pm (utc) on May 20, 2003]
www.widget.com gets blockranked with all it's pages
www.widget2.com gets blockranked with all it's pages
and then the whole pagerank algo starts off with these values (saving time)
I would have thought that such a drastic change would have needed considerable testing before attempting to use it on real data. I suspect "Block Ranking" may be something they want to do, but I wouldn't expect to see it for several months.
But, who knows, some people ride by the seat of their pants!
It's about time i'll contribute something so here it is
Algorithm tweaks could boost Google's speed article:
[newscientist.com...]
Hope this helps
BroadProspect
Have to say I'm not getting to concerned about the personalised results any time soon, but I'm not looking forward to the time when we have different SERPs for different people. We will all be setting up multiple generic profiles to see how well we are targetting various groups. Oh boy.
I think the most significant point is the idea that the pagerank calculation is being improved so much - fits in with a more fluid update policy really.
Oh, and how long before we get the 'How do I check my sites BlockRank?' threads? ;)
Google's translation of the article: Researchers want to accelerate Google [translate.google.com]