Date: 2004-10-11T11:08:09 Editor: NicolasMaisonneuve <[EMAIL PROTECTED]> Wiki: Jakarta Lucene Wiki Page: SpellChecker URL: http://wiki.apache.org/jakarta-lucene/SpellChecker
no comment Change Log: ------------------------------------------------------------------------------ @@ -16,7 +16,7 @@ == add words to the dictionary == we can add words becoming to several fields of several index for example or, why not, to a file with a list of words. - * Example: we can add all the keywords of a specific field of your index. + * Example: we can add all the keywords of a specific field of your index. {{{ SpellChecker spell= new SpellChecker(dictionaryDirectory); spell.addWords(myIndex_Reader, myField); @@ -29,21 +29,21 @@ More of that, this list can be restricted only to words present in a specific field of a user index. -* First example: the suggestSimilar(misspelled_word, num_list) method. + * First example: the suggestSimilar(misspelled_word, num_list) method. The "num_list" is the maximum number of words returned. In this example (the simplest) the list is just sorted with the levenshtein distance. -{{{ + {{{ String[] l=spellChecker.suggestSimilar("sevanty", 10); //l[0] = "seventy" , l[1] = "seven" , l[2]="seventeen" -}}} + }}} -* Second example: the suggestSimilar(misspelled_word, num_list, myIndex_Redear,myField, morePopular) + * Second example: the suggestSimilar(misspelled_word, num_list, myIndex_Redear,myField, morePopular) -1. ""Note"": if myIndex_reader and myField are null this method is the same as the first method -2. The returned words are restricted only to the words presents in the field "myfield" of the user index "myIndex_Reader" -3. the list is sorted with the second criteria -4. If "morePopular" is true and the mispelled word exist in the field of the user index , return only the word more frequent than this. + 1. ""Note"": if myIndex_reader and myField are null this method is the same as the first method + 2. The returned words are restricted only to the words presents in the field "myfield" of the user index "myIndex_Reader" + 3. the list is sorted with the second criteria : the popularity (the frequence) of the word in the user field + 4. If "morePopular" is true and the mispelled word exist in the field of the user index , return only the word more frequent than this. -See the test case code for example + See the test case code for example == Download == download file to [http://issues.apache.org/bugzilla/show_bug.cgi?id=31617] --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]