Welcome to WebmasterWorld Guest from 188.8.131.52
At that time, I was beginning to write my university thesis/dissertation about the Google search engine, and WebmasterWorld was more useful than any library or bookstore on the planet for my research on that subject (a search for "google" I did on February 28, 2002 on Amazon.com gave zero results): this board was, and still is for me, an invaluable source of information, ideas, and clever insight.
In November, 2002 I got my "laurea" degree in Communication Sciences at Bologna University.
Four years later, I've finally managed to convert my thesis/dissertation from .DOC to .html, and published it at [tesi-google.it...] (under a Creative Commons license).
Note to mods: Brett encouraged me to post that link here.
Even though my thesis/dissertation is in Italian, I hope you will be able to find some useful links in the footnotes or bibliography. This is my way of "giving back" to this community, and saying thank you to Brett for creating such an awesome place.
About the translation: I really wish I had the time to translate my thesis into English. :(
If anybody is interested in translating it and/or publishing it, however, please feel free to stickymail me (my thesis is licensed under Creative Commons). You can also request to be a translator for my thesis at Tesionline [tesionline.com], by submitting a translation of the first 15 pages. If approved, your (complete) translation will then be published by Tesionline, and will become available for download in PDF format. Tesionline will share the download fee for each translated copy between me and the translator (FAQs here [tesionline.com]).
About "Data Refresh": back then (2002), I simply wrote that Brin and Page and other reserchers at Stanford improved crawling efficiency by introducing techniques that allow Googlebot to order its crawl starting from the most important URLs (citations: Cho, Garcia-Molina, Page, Efficient Crawling Through URL Ordering [scholar.google.com] and Arasu, Cho, Garcia-Molina, Paepcke, Searching the Web [scholar.google.com]). I also quoted this interview with Eric Schmidt [pcworld.com] (from PC World, January 30, 2002):
We are working on algorithms to detect which sites are having high traffic or high page rank or high change rates. We want to make sure those pages are as current as possible.
[edited by: Giacomo at 4:57 pm (utc) on July 29, 2006]