Forum Moderators: bakedjake
I was wondering whether any search engine owners (I know some visit this forum) had done this type of thing before, how was it done?
As for how it is done, I am not familiar with the technical aspects, but from a higher level, clustering is just identifying commonality. I have seen (enterprise) clustering done on things as simple as the file system directory and on things as abstract as 'topical themes' (like Vivisimo). Interestingly enough, the simpler solutions seemed to get more usage.
Sorry I don't have any answers for you, but hopefully this gets your thoughts rolling again.
-bB
After running my search engine for the past 2 years and looking at the query history it's a little scary how bad people search. Very rarely are people searching for an answer to something vs searching for everything about something.
For example a lot of people just search for "boat" "car" "mortgage" and simple terms like that. Clustering on by default either by sorting the results in your serps or presenting them graphically would save these people lots of time by creating categories of results that match what they're looking for.
BTW, google isn't hard to compete against because they don't graphically cluster, they're just so branded into everyones mind and huge as a business that the little search engines that do more than advertising and marketing technologies don't get much "face time"
google isn't hard to compete against
Send your resume/CV to billg @ microsoft.com immediately :)
Perhaps clustering may help, however people who cluster do not use their own search engines, they are in effect just presenting other search engine's output differently and I see no point to use some site that will produce similar results to the search engine that I use anyway.
People got used to simple text searching - it is very hard to abandon habits and this is why clustering search engines have big problem of acceptance.
I'd prefer search engine to index twice as many pages rather than cluster existing.
Send your resume/CV to billg @ microsoft.com immediately :)
Are you a politician?
Google does cluster, they just don't represent it graphically. The algo they run and the process that happens behind the scenes clusters the results to build a standard output page.
I said google isn't hard to compete against because they do or don't "visually" cluster, but because they're a behemouth of a brand ;)
Clustering, Stemming, Ontollogy are all parts of the semantics used to create an understandable & searchable index of the web. While google may minimize the impact of these technologies i'm betting it does happen.
Are you a politician?
Not sure about this - when I make promises I firmly intend to keep them, depending on where you live it may or may not be integral part of politicians ;)
Google sure clusters, but not necesserily clusters by topic -- they cluster by domain for fast site: searches, they cluster data by similiarity, but do they cluster by page topic? I don't think so - there were a few recent examples of split screens that seem like coming from different topics, but this is done (now) very rarely. I am actually in favour of that type of mild clustering where its matters, ie when good matches fall into very distinct topics.
Not sure about this -
Just asking because you were good at cherrypicking my comment hehe
when I make promises I firmly intend to keep them, depending on where you live it may or may not be integral part of politicians ;)
good man!
Google sure clusters, but not necesserily clusters by topic -- they cluster by domain for fast site: searches, they cluster data by similiarity, but do they cluster by page topic? I don't think so - there were a few recent examples of split screens that seem like coming from different topics, but this is done (now) very rarely. I am actually in favour of that type of mild clustering where its matters, ie when good matches fall into very distinct topics.
I'm not sure how that differes from visually representing these as clusters of the search vs pretending they're "natural" serps.
While mozdex isn't anywhere near the scale of google yet, try a search with clustered enabled and let us know how/why that would deterr from the overall experience.
It would be just as easy to remove the side display and show them as natural results expanded from the query if we wanted to as well.
But yes.. clustering won't really make or break someone these days. It's really a technology inherant of a large search engine and almost unknown to most users. However some people like to be visually navigated through related searches and that is where some of the niche engines really excell.
cherrypicking my comment
ehehe, a politician has to be careful about making comments ;)
I'd say mozdex clustering is pretty reasonable - its an addition to the main listing and your results pages are pretty clean, I'd say well done!
I suppose the reason clustering is not picking up is due to most people using non-clustering search engine and clustering alone is not good enough, especially considering that many clustering search engines are merely meta-searchers (not mozdex).
Clustering is nice, but I'd rather index twice as much data or improve anti-spam algorithms: clustering or not, but if the stuff I want is not in Top 10 then I am going to rephrase my query, it is faster to type it for me than to click mouse on one of the clustered groups.