Welcome to WebmasterWorld Guest from 126.96.36.199
Forum Moderators: open
Nothing really new, but nicely summarized and technically detailed information on how Google responds to a query.
It's a partial list of different papers that Googlers have published. There's enough technical papers there to overload most SEOs, but that page is like honey to pull in great engineers. :)
The question I have always had, and if someone could shed some light: Up until now, were the various data center index servers all querying the same document servers?
This forum has high PageRank, right? And we all *know* that if you get a link from a related site - that is high in PageRank - it will benefit you in the SERP's, right?
GoogleGuy, I never thought I would see you doing SEO - and in public!
Do you feel bad now or what? If I see that page riding high in the SERP's for 'money keywords' what will happen if I spam report you? lol.
webmasterworld nick: jeremy_goodrich
subject: GoogleGuy link dropping
message: I saw him do it - right out in the open!
trying to inflate the PR of Google.com, as if it's
not high enough!
Expect the action on the SERP's to be found shortly, lol.
Thanks for the link seriously. :) One more thing on the evening 'to do' list.
Now I am dreaming about finding a paper breaking down the list of those 100 variables :)
On a more serious note this massive parallel computing technique using commodity intel boxes can be very well extended beyond web environment in other computing intensive but stateless fields like Gnome mapping , crash test simulation ( auto companies use expensive cray supercomputers for this) , SETI like projects etc etc...
But its very unsuitable for big database applications like finance/payroll ( this is a killer money making area where SUN/HP/IBM servers rule! )
The part that I personally like the most is this: this page has been up for a little while. At the same time, some article quoted another SE rep saying "Google never publishes any papers now; they squirrel away their knowledge" or something like that. The juxtaposition was a little humorous to me, at least, esp. given this page and the details in the IEEE paper. I'm not aware of any other search engines publishing papers like that lately. :)
Oh well. People knock on Google sometimes. If you just keep doing what you know is right, things seem to work out just fine. :)
As for knocking Google, rest assured that if you achieve success, people will knock you down or try. Take it as a compliment. No sense letting it bother you. If it *doesn't* happen then it means you're nobody or you're doing something wrong :)
P.S. Have you ever heard of a database called "R"? I always wondered if Google ever used tools like that which are great for processing batch data like PR should be, or if it was 100% custom made.
I was once looking to try out R for a project and searched Google (a couple of years back) but couldn't find it! I had a team of people search for it and finally found it. But while speaking I just did a search on "r" (not even adding the word database) and it was the first SERP! So I guess you folks have made some progress in the intervening time! I mean, how much harder can it be to find a page than using one letter search?
I am an Engineer by Profession , not a Masters but Bachelors.
Would love to one day work in Google. :)
Sorry moderator if I crossed TOS.
After all Google is the best known corporate in the whole world.
I doubt anyone can match your popularity worldwide.
What interested me about the papers section is the Genetic Algorithm and Artificial Intelligence uproach.It's truly remarkable that google has somebody who did some research in this area.This approach coupled with Evolutionary Algorithms becomes the next century science called Complexity..my favourite area. To Learn more about complexity visit www.santafe.edu, the institute opened by Noble Laurates.
My point is that google has an outstanding resource pool of engineers going by the papers alone.
I am really happy that google such wide variety of people at their resources.
It seems like they not only are looking for great engineers but also can use a new webmaster/SEO specialist as well ;)
anyway, great resource, this must definitively keeps everybody out of the current whining-and-exiting threads about the movement on SJ and other datacenters...
Matt Cutts mentioned at Pubcon that should probably better have been 101 kb.
I've seen quite a few of these papers already from the stanford repositories though. The also publish a lot of Google related things.
Besides John Koza and more recent papers based on his work, Google related publishings are my favourite fodder :)
Now I can compare my own search engien and database engines to see how clos I got to the google system. I'm a big fan of clusterign and am still dreaming of the day I can buy a few docent old PCs to try out my clustered GP algos :)
What do you do with the previous generation PCs? give em all to unis? Where can I apply for a "hardware grant" hehe...
Seriously, Working at google is a bit like dying and going to heaven... Nothing left after that, after all it seems the place to develop and make real ideas.
Keep up the good work, I'll get beck to you when I'm done reading the papers ;)
John Koza and GPs Algo :).
If john Holland invented Genetic Algorithm,then John Koza brought it to life at stanford.
That's the match I was looking for a long time mate.
>How close to Google System
Same here,I am trying at a smaller scale though.
Do report here abt your computation time and results.
>>>Seriously, Working at google is a bit like dying and going to heaven... Nothing left after that, after all it seems the place to develop and make real ideas.
Very Well Said KillRoy. :)
damn you Google, another sleepless night...
typo typo in edit[/edit][/edit]
>>I bet you can built a kick ass automatic text categorizer using genetic programming, much more accurately then the bayesian networks.
Perhaps the Best search Engine that evolues on it's own with the web, can be built using GA.That's like Adaptive Search Engine for me.