[ 
http://issues.apache.org/jira/browse/LUCENE-285?page=comments#action_12418682 ] 

Cédrik LIME commented on LUCENE-285:
------------------------------------

Not: not sure this is the right way of proceeding with JIRA. Should I open a 
new bug report instead of commenting?

This implementation (as of Lucene 1.9.1) uses an un-optimized implementation of
the Levenshtein Distance algorithm (it uses way too much memory). Please see Bug
38911 (http://issues.apache.org/bugzilla/show_bug.cgi?id=38911) for more 
information and the new implementation.

> David Spencer Spell Checker improved
> ------------------------------------
>
>          Key: LUCENE-285
>          URL: http://issues.apache.org/jira/browse/LUCENE-285
>      Project: Lucene - Java
>         Type: Improvement

>   Components: Search
>     Versions: unspecified
>  Environment: Operating System: other
> Platform: All
>     Reporter: Nicolas Maisonneuve
>     Priority: Minor
>  Attachments: spellchecker.zip
>
> hy,
> i developed a SpellChecker based on the David Spencer code (DSc) but more 
> flexible.
> the structure of the index is inspired of the DSc (for a 3-4 gram):
> word:
> gram3:
> gram4:
>  
> 3start:
> 4start:
> ..
> 3end:
> 4end:
> ..
> transposition:
>  
> This index is a dictonary so there isn't the "freq" field like with DSc 
> version.
> it's independant of the user index. So we can add words becoming to several
> fields of several index for example or, why not, to a file with a list of 
> words.
> The suggestSimilar method return a list of suggests word sorted by the
> Levenshtein distance and optionaly to the popularity of the word for a 
> specific
> field in a user index. More of that, this list can be restricted only to words
> present in a specific field of a user index.
>  
> See the test case.
>  
> i hope this code will be put in the lucene sandbox. 
>  
> Nicolas Maisonneuve

-- 
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
   http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
   http://www.atlassian.com/software/jira


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to