Forum Moderators: open

Message Too Old, No Replies

Google Archive

A What if fantasy scenario

         

Clark

11:40 pm on Mar 7, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



If google one day opens up their archive similar to archive.org, the idea of a web search would take on a whole new set of dimensions. I don't know if they ever plan to do this, but just as a scenario, they could for example use a once monthly snapshot of the web since they started collecting data. Then they could calculate the PR for the full database and you could search for information across the history of the internet.

While Archive.org is cool, not much to talk about in the way of search.

This brings up many thoughts and ideas.

Can you imagine how long the update process would be? And how complicated the PR calculation?

Say domainb.com only made it into the update march 2003 but links to domaina.com which is out of march 2003 but in feb 2003, does the PR passthrough?

eraldemukian

12:55 am on Mar 8, 2003 (gmt 0)

10+ Year Member



Hello,

google has indeed something terribly valuable: a snapshot of the
web at any give time. Already trimed down to the essentials of max
100K html/text per page.

But I am afraid that they simply overwrite it.
Just guessing, not knowing.

Future archeologiests will hate them for that:
At some point people will have enough storage and computing power that they could run their own analysis on these old data sets.
But, if its gone its gone.

The web is not really about history. Which is unfortunate, since it could be. Takes just a little bit more care.

Maybe google could spend some of the money that they make to
pile up even more data in form of tapes or something for future generations?

[Which is expensive: storing stuff on harddrives is cheaper then to save it on high speed data tape like Sony DTF for instance]

"The google museum"

or one other nice thing to do with your wealth, if you happen to be a successful (!) internet startup that owns a copy of the web.

Clark

1:03 am on Mar 8, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



I heard (here) that they keep copies of everything.

vitaplease

3:33 pm on Mar 9, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Who knows maybe ther're teaming together with:
National Digital Information Infrastructure and Preservation Program (NDIIPP) [webmasterworld.com]

Shak

3:43 pm on Mar 9, 2003 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



But I am afraid that they simply overwrite it.
Just guessing, not knowing.

NO Way, Google are Like Steptoe & Son. (collect, collect, and even more collect)

Shak

(naturally GG is the son)
[home.achilles.net...]