| This 35 message thread spans 2 pages: < < 35 ( 1  ) || |
|Google's Knowledge Graph Demonstrated with 'Bacon Number'|
| 8:48 am on Sep 14, 2012 (gmt 0)|
Google has demonstrated its knowledge graph with 'Bacon Number'
|To use Google's system, the user simply types in the words "Bacon number" followed by the name of the actor. By way of example, typing "Bacon number Simon Pegg" reveals that Bacon and the British actor are linked by Tom Cruise, because the latter appeared in 1992's A Few Good Men with Bacon and in 2006's Mission: Impossible III with Pegg. Pegg therefore has a Bacon number of two, indicating two degrees of separation. |
Lead engineer Yossi Matias said the project was about showcasing the power of Google's search engine by flagging up the deep-rooted connections between people in the film industry. "If you think about search in the traditional sense, for years it has been to try and match, find pages and sources where you would find the text," he told the Hollywood Reporter. "It's interesting that this small-world phenomena when applied to the world of actors actually shows that in most cases, most actors aren't that far apart from each other. And most of them have a relatively small Bacon number."Google builds Six Degrees of Kevin Bacon into its search system [guardian.co.uk]
[edited by: Robert_Charlton at 8:11 am (utc) on Sep 15, 2012]
[edit reason] fixed Google search link [/edit]
| 2:00 pm on Sep 17, 2012 (gmt 0)|
i think google's PR department is the best in the world.
when a 10-year old website does something, no one cares. no one even pays any attention. but when google comes along and does exactly the same thing everyone starts talking about natural language processing and quantum computers. and when panda came out i remember people drooling about artifical AI and machine learning, like google's computers had a mind of their own.
even the littlest change (and lets face it, this bacon number thing is even less than little) gets puffed up so much that it suddenly becomes the future of search.
whatever they are paying their PR people, its not enough!
| 5:41 pm on Sep 17, 2012 (gmt 0)|
londrum - don't misunderstand me. I am not at all convinced that Google is the one who is going to get this right. I am 100% convinced that
1. Their survival as a search engine depends on getting this right
2. They know that
3. This is fundamental to their long-term strategy
4. They are willing to pour serious resources into making sure they stay out ahead in this type of computing.
That said, their capabilities are clearly rudimentary and it's unclear how much of an advance this is. I should temper all of my comments with a "if they are doing it the way they say they are doing it" caveat.
Even if it turns out this is just a focussed algo that only calculates a Bacon Number (which, frankly, I don't believe for a second because there's no ROI for that for Google, so I can't see why they would bother), this still tells us something fairly big.
Think back to our perceptions of Google from, say, 5-7 years ago. We tended to think of Google as a company who was always going to seek an automated solution that scaled with computing technology and did not depend on scaling with human minds working on problems. Their model was to have a few brilliant minds develop automated tools and leverage that.
Look at where we are now. The Bacon Number Calculator almost certainly depends on data that has been scrubbed by humans and is a small subset of what's in the Knowledge Graph (whatever the hell that is). Google is pouring huge amounts of labor into their maps, both the data, the technology and the human labor to massage that data and cross-check it with the real world (if you missed the article in The Atlantic from last week, it is a must-read: [theatlantic.com...] ). They are buying structured data as Brotherhood of the Lan pointed out. So I'm seeing this in the context of a wider effort and see the Bacon Number Calculator as the tiny peek we're getting. It's possible that you're right and there's no wizard behind the curtain, but other signs and efforts from Google make me believe provisionally that there is some powerful stuff behind the curtain and for all sorts of reasons (SEO, privacy rights, data monopolies, "data infrastructure fail points" - a term I just came up with) this all bears serious watching.
To me, all of this adds up to Google placing big bets on being able to see relationships not just between web pages, but out in the real world as well. It is a move from a few guys and a lot of computers, to a company that is a lot of computers and a lot of people hand-feeding data into those computers.
Why all the hub bub because it's Google? Well, because there are only maybe three companies in the world that have the technology and the cash on hand to mount an effort like this (Google, Apple, MS). MS either isn't much of a player, or it's keeping quiet. Apple is a player (think Siri). They have the cash and they have the access to lots of user data that Google doesn't have. What they don't have is the search technology.
That leaves Google in the lead. Take away their phones (think patent infringement lawsuits) and all that user data and suddenly things look good for Apple. Let Google have its phones and I don't see anyone else winning this race. Someone could appear tomorrow with a technology that just crushes this problem, but in my opinion, Google will break the bank to buy that technology because if Apple gets it, Google is dead. On the other hand, if Google gets it, I think Apple is still viable (assuming the loss of Steve Jobs isn't fatal in any case).
Even if someone comes up with the killer algo, by putting all this energy into data that simply can't be obtained just by crawling the web, Google actually puts itself in a position where it can fall behind a bit on algorithmic technology and still have the data juggernaut to make up for that. It's scary how much data Google has and how rapidly they're increasing the size of that data set, but it's also impressive.
I know that's long but I could sum it up briefly by saying
1. Yes, Google's PR is all over this because it is vitally important that they be perceived as the leader here.
2. The resources are pouring in because it is, in fact, vitally important that they stay or become the leader in this because their long-term survival depends on gettting this right for way more complex cases than Bacon Numbers.
3. No matter how it shakes out, the Knowledge Graph effort shows that Google is willing to pour actual human labor into something at very high levels. To me, that is a valuable insight into shifts in the Google ethos.
Like them or not, they are still the biggest player in the room and when Google sneezes, we all catch colds.
| 7:15 pm on Sep 17, 2012 (gmt 0)|
i prefer to stick with that old "Occam's razor" thing... that the simplest answer is probably the right one.
and seeing as this has already been done a decade ago, the simplest answer is that it's just more of the same, with a bigger database of cast lists.
the fact that google appears to be returning exactly the same results as the ten-year-old website when there's more than 70,000 different possibilities to choose from (...when any one of them would have been equally correct, remember), is pretty good proof that not a lot has changed under the hood.
| 8:03 pm on Sep 17, 2012 (gmt 0)|
the "razor" is about cutting assumptions, not complexity.
| 12:51 am on Sep 18, 2012 (gmt 0)|
Everything ergophobe said..especially
|To me, all of this adds up to Google placing big bets on being able to see relationships not just between web pages, but out in the real world as well. |
is what they are going for..
|Google will break the bank to buy that technology because if Apple gets it, Google is dead. On the other hand, if Google gets it, I think Apple is still viable (assuming the loss of Steve Jobs isn't fatal in any case). |
Said as much to Ted in a thread a while back..G does not fear MS, or Amazon, G fears Apple...especially Apple getting into search properly, which they are moving towards..
| This 35 message thread spans 2 pages: < < 35 ( 1  ) |