Welcome to WebmasterWorld Guest from 184.108.40.206
Forum Moderators: open
and provides an interesting read
re the task at hand every second for G
Over four billion Web pages, each an average of 10KB, all fully indexed.
Up to 2,000 PCs in a cluster.
Over 30 clusters.
104 interface languages including Klingon and Tagalog.
One petabyte of data in a cluster -- so much that hard disk error rates of 10-15 begin to be a real issue.
Sustained transfer rates of 2Gbps in a cluster.
An expectation that two machines will fail every day in each of the larger clusters.
No complete system failure since February 2000.
It is one of the largest computing projects on the planet, arguably employing more computers than any other single, fully managed system (we're not counting distributed computing projects here), some 200 computer science PhDs, and 600 other computer scientists.
[edited by: Brett_Tabke at 4:51 pm (utc) on Dec. 2, 2004]
[edit reason] fixed link [/edit]
This has always been IMO the best way to approach understanding a system as large as Google's, the individual components are very simple, believing that they do, or can do, very complex things, or use very complex elements, makes understanding it almost impossible. However, there seems to be a certain group of posters here who day in and day out insist on believing otherwise.