analyzers-common VS analyzers-icu

Daniel Bigham Wed, 01 Jun 2016 09:57:18 -0700

Hi,

I recently setup my code to choose the appropriate analyzer fromanalyzers-common depending on the language of the user's index/field.I then extended the existing source code to allow, for any language, toturn on/off things like stemming, case sensitivity, etc.

Today I discovered analyzers-icu, and I don't understand how tounderstand analyzers-common VS analyzers-icu.

Are they drop in replacements of each other? Are there features in onethat aren't available in the other? What are the pros and cons of usingone or the other?


In a nutshell, the features I care about are:

- The ability to specify a language and have tokenization performedaccording to that language

- Obviously the more languages supported the better

- The ability to turn on/off stemming for any language (implementedmyself for analyzers-common)- The ability to turn on/off case sensitivity for any language(implemented myself for analyzers-common)


Thanks,
Daniel

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

analyzers-common VS analyzers-icu

Reply via email to