We did extensive comparison in the past for Snowball, KStem and Hunspell
and there are cases where one of them works better but not other or
vice-versa. You may utilise all three of them by having 3 different fields
(fieldTypes) and during query, search in all of them.

For some of the cases where none of them works (e.g wolves, wolf etc)., use
StemOverriderFactory.

HTH.

Thanks,
Susheel

On Thu, Dec 15, 2016 at 11:32 AM, Ahmet Arslan <iori...@yahoo.com.invalid>
wrote:

> Hi,
>
> KStemFilter returns legitimate English words, please use it.
>
> Ahmet
>
>
>
> On Thursday, December 15, 2016 6:17 PM, Lasitha Wattaladeniya <
> watt...@gmail.com> wrote:
> Hello devs,
>
> I'm trying to develop this indexing and querying flow where it converts the
> words to its original form (lemmatization). I was doing bit of research
> lately but the information on the internet is very limited. I tried using
> hunspellfactory but it doesn't convert the word to it's original form,
> instead it gives suggestions for some words (hunspell works for some
> english words correctly but for some it gives multiple suggestions or no
> suggestions, i used the en_us.dic provided by openoffice)
>
> I know this is a generic problem in searching, so is there anyone who can
> point me to correct direction or some information :)
>
> Best regards,
> Lasitha Wattaladeniya
> Software Engineer
>
> Mobile : +6593896893
> Blog : techreadme.blogspot.com
>

Reply via email to