Forum Moderators: open
These look like keywords related to my site, keywords that appear close together. I suppose it would be good for me to start targetting those keywords as well, but I wonder what is the story on these "clusters"
Reminds me a Google lab of "clusters" but it never did much sense to me.
>which methodology FAST is using
No, only FAST knows. Anybody can take that ODP dump and slize and dice that to fit their needs. Who knows what the guys at FAST do with it?
- I'd like to know tough ;)
This has beeen recently implemented, or rather, I noticed it just recently when Alltheweb crawled most of my site (I did not even know I had a site 775 page big - mostly old forum postings). I think that there is an algorithm that checks what words appear to come in "clusters" or groups from within each site.
Some specific examples from my site
year, camping, heard (3)
There is no way that these could have come a DMOZ taxonomy, but Fast can find these three words in some of my pages, therefore an attempt to find a theme to site. I suppose that the less clusters a site has, the more targeted (or narrow) it's scope.
Webmasterworld clusters are:
yahoo, help & tutorials, promotion (6)
search engine submitting, internet, uk (5)
server side, internet, programming (4)
tips, resources, internet marketing (4)
google, search engines, searching (4)
registrering og positionering, internet, edb (4)
development, macintosh, systems (3)
cloaking, promotion, web design & development (3)
webmaster resources, web design, internet (3)
chats & forums, writers resources, arts (3)
log analysis, internet, software (3)
looksmart, grub, acquired (3)
webmasterworld forums index, search engines (3)
Take a look at:
registrering og positionering, internet, edb (4)The exact same words appear in this ODP cat [dmoz.org].
Same goes for:
webmaster resources, web design, internet (3)ODP category [dmoz.org]
Tells me there's some ODP stuff going on there :)
I'm quite certain that they use ODP.
If the cluster engine finds in the results several pages that are listed in - and linked from - an ODP category, it is very likely that this information is used for the cluster. That does not necessarily mean that they use the ODP taxanomy. It may get into the calculation indirectly, but it's also likely, that they use it not as the only method.
Btw, I think I have heard that they use also some semantic logic in order to build their clusters.