To add to other comments: This functionality should also look at how common a term is in the corpus. Using the corpus as "correct" set of terms to search on isn't always what you want if the corpus is unclean (misspellings, etc.)
I believe this is why if you search on an uncommon term, Google will try to suggest something more common, even if you spelled the term correctly. On 8/29/05, Chris Lu <[EMAIL PROTECTED]> wrote: > > Constructing a separated index as a dictionary is one part of solution. > > The other part is to construct a dictionary with a list of possible > "good words". > By "good words", I mean all leagal queries, not necessarily "correct > words". > Two approaches I can think of: > * Use a word list(it may not be the word list you want, but it is just > a compromise). > * Analyze your original index, listing out all words inside. > > There should be other approaches. Anyone? > > -- > Chris Lu > ------------ > Lucene Search RAD on Any Database > http://www.dbsight.net > > On 8/29/05, Joseph B. Ottinger <[EMAIL PROTECTED]> wrote: > > java.net <http://java.net> had an article on this not long ago. See > > http://today.java.net/pub/a/today/2005/08/09/didyoumean.html . > > > > On Mon, 29 Aug 2005, Martin Rode wrote: > > > > > Hi everybody, > > > > > > Has anyone tried to code a solution like Google's "Did you mean?" in > Lucene? > > > > > > I would be very happy to hear your ideas, approaches, suggestions. > > > > > > Best, > > > Martin > > > > > > > > > > > > > > > --------------------------------------------------------------------- > > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > > > > > ----------------------------------------------------------------------- > > Joseph B. Ottinger http://enigmastation.com > > Editor, http://www.TheServerSide.com [EMAIL PROTECTED] > > > > --------------------------------------------------------------------- > > To unsubscribe, e-mail: [EMAIL PROTECTED] > > For additional commands, e-mail: [EMAIL PROTECTED] > > > > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: [EMAIL PROTECTED] > For additional commands, e-mail: [EMAIL PROTECTED] > >