homepage Welcome to WebmasterWorld Guest from 54.211.68.132
register, free tools, login, search, pro membership, help, library, announcements, recent posts, open posts,
Become a Pro Member
Home / Forums Index / Google / Google Desktop Tools and Google Labs Projects
Forum Library, Charter, Moderator: open

Google Desktop Tools and Google Labs Projects Forum

This 65 message thread spans 3 pages: < < 65 ( 1 2 [3]     
PageRank Indicator for Linux and Mac OSX
new extension for Mozilla browser displays PR
MarshallClark




msg:1100014
 7:39 pm on Jun 9, 2004 (gmt 0)

I just read about a new extension for the Mozilla FireFox browser that allows you to view PageRank on Mac OSX and Linux systems. Here's what's involved:

1. Download FireFox browser from [mozilla.org...]

2. Install the GoogleBar Mozilla extension from [googlebar.mozdev.org...]

3. Install the PR indicator extension from [toolbar.nickstallman.net...]

The PageRank indicator bar will appear right next to the PageInfo button on the GoogleBar when you restart the browser.

 

cheater512




msg:1100074
 6:06 am on Jun 15, 2004 (gmt 0)

LOL. Nice one trillianjedi. :)

cheater512




msg:1100075
 10:27 am on Jun 18, 2004 (gmt 0)

Ok I have a new server, new site and new version up.

Check [prgooglebar.org...] for details.

If you have any privacy concerns please post them in the forums there so everyone can see them.

At the moment it is not being logged. I will be putting in hostname logging soon tho. The full URL will not be logged.

Alex_T




msg:1100076
 6:45 am on Jun 28, 2004 (gmt 0)

Actually, the checksum algorithm was made available and can thus be used by other linux- and macox-based apps. See [mobileread.com...] and specifically here: [mobileread.com...]

bcolflesh




msg:1100077
 12:40 pm on Jun 28, 2004 (gmt 0)

Now that is some interesting reading - of course, G is now scrambling to change the algo and put out toolbar updates...

Scarecrow




msg:1100078
 2:27 am on Jul 6, 2004 (gmt 0)

You're correct about Sony, jgstyle, but times have changed. In the US, you've got the Digital Millenium Copyright Act (DMCA), and Australia undoubtedly either has or is in the process of implementing similar legislation. Reverse-engineering of this kind (if it is indeed the case - I can't tell at the moment) is clearly illegal.

Google has no case whatsoever if they try to use the "hacking" portion of the DMCA against anyone who fetches PageRank. That's because Google is using a public domain hash program written by Bob Jenkins in the mid-1990s. The algorithm was explained in an article in Dr.Dobbs magazine in September 1997. The code is on Jenkins' site [burtleburtle.net] and it's clearly marked as public domain, free for the taking, for any use whatsoever.

There's an "initval" in the code that seeds the algorithm. Google's initval is 0xe6359a60. This is one of two things that are specific to Google in the code. The other is that they stick the word "info:" in front of the URL before hashing it. That's no secret either, since they also do this in the QUERY_STRING that phones home.

The initval probably sticks out like a sore thumb when you decompile the toolbar (I wouldn't know because I don't have a decompiler). That would mean that Google is making no effort to hide anything. It's only used because the code as written defines an arbitrary initval.

Google's toolbar is about as "secret" as Alexa's Traffic Rank. If you look at Alexa's page, they've put all sorts of garbage HTML tags between the digits of the ranking number, which is a hilarious attempt to conceal it from any screen-scraper programmers whose careers are less than two weeks old. This too would not fly under the DMCA, because it is simply not a serious effort by Alexa.

The applicable laws, therefore, are copyright laws. "Fair use" comes into play. If you are noncommercial, if your scraping is primarily for monitoring the social role and functioning of search engines, if the traffic from your scraping doesn't load Google's or Alexa's servers (not possible, compared to all the toolbars out there that phone home with every new web page seen by the browser), then Google or Alexa cannot stop you through their legal department. All they can do is block you or your server.

In Google's case, they could write a new hash algorithm. They could also change the initval, but that would get discovered very quickly, and requires only a tiny change to all the scraper programs that are now out there. I've got my own working nicely, and will put it online in a couple of weeks if the PageRank portion keeps working. I'm also scraping Alexa's Traffic Rank and Yahoo's rather thorough external backlink count in the same program.
( link:http://www.example.com/anypage.html -site:www.example.com )

I don't think Google will bother this time around. No one takes PageRank too seriously anymore. The chances are better that the PageRank indicator will disappear entirely with the next toolbar update, and the "phone home" lookup discontinued. That would be a good thing for Google to do anyway.

The only thing keeping it going is Google's ambition to profile everyone with their immortal cookie, and their use of the toolbar to find new domains for crawling, and maybe some Alexa-style traffic-tracking that we don't know about. Without the PageRank indicator, none of this can be done as effortlessly, because then they cannot justify the "phone home" feature of the toolbar.

If you can't justify it, that means it more clearly gets labeled as "spyware" -- which, of course, it was since day one, in December 2000. The only difference between bad spyware and acceptable spyware is having a good cover story. That's the real function of the PageRank indicator on the toolbar.

This 65 message thread spans 3 pages: < < 65 ( 1 2 [3]
Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Google / Google Desktop Tools and Google Labs Projects
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved