homepage Welcome to WebmasterWorld Guest from 23.23.22.200
register, free tools, login, search, subscribe, help, library, announcements, recent posts, open posts,
Subscribe to WebmasterWorld

Home / Forums Index / Code, Content, and Presentation / Databases
Forum Library, Charter, Moderators: physics

Databases Forum

    
"Did You Mean" database
Where can I find one ..
getxb




msg:3521820
 6:23 pm on Dec 6, 2007 (gmt 0)

Hi,

I have been asked by my Boss to download and try out a reasonably good "Did You Mean" database. It will act as a suggestion tool for site searches .. so if you type Helllo it will suggest the right word which is Hello. My question is is it available for free in the internet? Where can I find one?

What I found is a "COM ActiveX Server Component" which we don't need.

Please help me out.

Regards,
getxb

 

stajer




msg:3521855
 7:14 pm on Dec 6, 2007 (gmt 0)

The database itself is generated by you - it contains a list words that relate to your content. Then you take each search and compare it to that database. If the search contains a word or phrase not in your db, you present them the closest match in "Did you mean..." format.

Demaestro




msg:3521864
 7:33 pm on Dec 6, 2007 (gmt 0)

You should check out databases that have a soundex module.

It is basically a "sounds like" function. That coupled with a dictionary lookup should be enough for someone to make a custom app that would do what you want.

I doubt you will find something for free, off the shelf, that does what you want.

Tapolyai




msg:3521866
 7:39 pm on Dec 6, 2007 (gmt 0)

May I suggest that instead of downloading a database you calculate the "did you mean" on the fly?

You can use soundex, metaphone, or similar concepts.

What you will need is your language database of words, and the selected phonetic algorithm results.

Since this is for your web site, you can even create this through a program, and periodically update it.

This would allow you to create the "did you mean" results, on the fly.

That is, let's presume you select soundex as the algorithm.

* Go through all pages of your site and create database of words and soundex (filter non-relevant pages, duplicates, stop words, etc.)
* Primary key would be the soundex.

* user enters "helllo".
* Calculate soundex.
* look up matching soundex in DB.
* display words as "did you mean" with appropriate links to pages...

edit: eh, Demaestro already said while I was typing it up. :D

[edited by: Tapolyai at 7:40 pm (utc) on Dec. 6, 2007]

getxb




msg:3521921
 8:21 pm on Dec 6, 2007 (gmt 0)

Thanks everyone for your replies. Didn't thought of the "did you mean" on the fly concept; will ask Sir to have a look at this thread so that I can pass this to our senior programmer for development.

Does anyone know of any other possible ways? And is this the way the search engines work too?

Regards,
getxb

getxb




msg:3522242
 7:29 am on Dec 7, 2007 (gmt 0)

Is there a paid version of something like this we can purchase? OR Is there any libraries / modules that we can purchase which will help speed'n up the process and make it more correct?

Kindly suggest.

Regards,
getxb

volatilegx




msg:3522738
 9:28 pm on Dec 7, 2007 (gmt 0)

Here is some soundex info... [creativyst.com...]

The above link includes several source code examples in various languages.

getxb




msg:3522959
 7:18 am on Dec 8, 2007 (gmt 0)

Pls have a look at [discuss.joelonsoftware.com...] where someone mentioned that .. "Trying to make your own poor Soundex is asking for trouble." Also Commented .. "It might be nice to make a dictionary (for that proper spell checker library you use) based upon recent searches, or other domain knowledge (such as an index of content on the site). Obviously this dictionary needs maintaining, and old search terms expired periodically etc."

Right now we do maintain a dictionary and we have two heads who monitors the site searches and enters the misspelt words and their suggested words. We now plan to automate the same and enhance the whole procedure.

Global Options:
 top home search open messages active posts  
 

Home / Forums Index / Code, Content, and Presentation / Databases
rss feed

All trademarks and copyrights held by respective owners. Member comments are owned by the poster.
Home ¦ Free Tools ¦ Terms of Service ¦ Privacy Policy ¦ Report Problem ¦ About ¦ Library ¦ Newsletter
WebmasterWorld is a Developer Shed Community owned by Jim Boykin.
© Webmaster World 1996-2014 all rights reserved