plasma@billy:~$ for i in a b c d e f g h i j k l m n o p q r s t u v w x y z; do for j in a b c d e f g h i j k l m n o p q r s t u v w x y z; do dig www-$i$j.google.com; done ¦grep www- ¦egrep -v "^;"; done
www-ab.google.com. 60 IN A 188.8.131.52
www-cw.google.com. 60 IN A 184.108.40.206
www-dc.google.com. 60 IN A 220.127.116.11
www-ex.google.com. 60 IN A 18.104.22.168
www-fi.google.com. 41 IN A 22.214.171.124
www-gv.google.com. 60 IN A 126.96.36.199
www-in.google.com. 60 IN A 188.8.131.52
www-kr.google.com. 60 IN A 184.108.40.206
www-lm.google.com. 60 IN A 220.127.116.11
www-mc.google.com. 60 IN A 18.104.22.168
www-va.google.com. 60 IN A 22.214.171.124
www-zu.google.com. 60 IN A 126.96.36.199
P.S.: It is quite unstable at the moment (got a server error already :)
toolbar PR is not up to date
index has the same SERPs as -va for my keywords
[edited by: plasma at 12:09 am (utc) on Nov. 22, 2003]
As i said in the other thread, wtg plasma.
13 DCs..there musta be some mind-blowing data accumulated on these.
Nice discovery, Plasma. That line of code tries all the combinations of all the letters, eh? Very cool.
It's not working for me right now... I'm putting it on the list though.
Added: was typing in -im, doh, -lm is live. 26 squared = 676 combinations... I take it you're not on dial-up ;-)
[edited by: Stefan at 12:25 am (utc) on Nov. 22, 2003]
With 'tracert' it looks to be another Irish datacenter. In August [webmasterworld.com] 2003 Google started with 'gv' their first datacenter in Ireland, and in September [webmasterworld.com] they added 'kr'. Maybe 'lm' stands for 'Limerick'?
As for 13 DCs, there used to be another datacenter called 'sj' (San Jose; 216.239.35.*), but that one is not responding now.
[edited by: takagi at 1:38 am (utc) on Nov. 22, 2003]
Maybe they just moved sj to a cheaper provider ;)
A query send to Google ends up at one of the datacenters. With the huge number of queries sent to Google (in April it was 200 million searches a day), they need to do some load balancing. Preferably queries should be sent to one of the nearest datacenters to give a quick response to the user. At this moment there seems to be 3 clusters of datacenters: California (ex, [sj], in, cw, mc), Virginia/DC (va, ab, dc) and Ireland (gv, kr, lm).
So what does that mean to Google's users? Not that much, unless you are interested in what results are to be found at the different DCs. Sometimes one DC has already new PageRank information, or new back links.
[edited by: takagi at 2:12 am (utc) on Nov. 22, 2003]
Okay, I'll volunteer my services for an explanation of the datacentres, and their interest to webmasters, even though there are others here much more capable than me of doing it.
Google does not have one location for the servers that supply the Search Engine Result Pages, (serps). For reasons of a logistical nature, it is better to have a number of "datacenters" spread around the planet. They can be found by entering URI's such as www-in.google.com, www-ex.google.com etc.
Traditionally during the "dance", new serps would not appear in all the datacentres at one time, they would "propagate" across the various DC's. This could give a heads-up on what one could expect in the final version. Things changed in the Dominic/Esmeralda updates.
Now, it appears as though Google will use certain DC's for experimental "tweaks" of the algo that may, or may not, appear in the final "Index". Thus, the great interest by webmasters in what they see for the serps in the various datacenters.
Hope this helps. Others here could have given you a better explanation, but it is expected that new members will take advantage of the Site Search to learn these things for themselves before posting and this is why you might not have been getting the answers you wanted.
<edit>Takagi beat me to it... typed too slowly</edit>
And a datacenter is about 10,000 (or a bucket load of)computers, is that right?
|And a datacenter is about 10,000 (or a bucket load of)computers, is that right? |
I'm not sure, but I'd give just about anything for a shell on one of those computers :\
|I'd give just about anything for a shell on one of those computers |
$ telnet www-lm.google.com
Connected to www-lm.google.com
Escape character is '^]'.
Last login: Fri Nov 21 09:43:36 from 188.8.131.52
And what would anything be? :)
A pat on the back :D
I do find it amazing just how much the datacentres differ; it means that there is no one Google as such - there are in fact 12 Googles, each different.
I have also, with little success, been trying to put them in some sort of order. I had a theory that the datacentres now work in a string; -dc gets the results first and then passes them on to -gv and then finally on to -ex and -in, or something like that.
This would make sense because it would give them a week's worth of backups. If something goes wrong with new data arriving in -dc, they can simply revert to the older data from -gv or even -in.
Just an idea, and I am sure that someone here will have a better idea of the datacentre landscape.