Hi Dima,
The example code you mentioned in your other recent email is pretty close.
The only thing you'd probably want to add is access to the CharTermAttribute:
CharTermAttribute termAtt = addAttribute(CharTermAttribute.class);
and then in the loop over ts.incrementToken(), you can get to the output tokens
using termAtt.buffer() and termAtt.length(), or if you're going to Stringify
tokens anyway, termAtt.toString().
Steve
On Dec 18, 2012, at 1:16 PM, dokondr <[email protected]> wrote:
> Hello,
> I am looking for an example of using Tokenizer + Analyzer (in particular
> org.apache.lucene.analysis.ru.RussianAnalyzer) for standalone stemming.
> Can't find such an example here:
> http://lucene.apache.org/core/4_0_0/core/org/apache/lucene/analysis/package-summary.html?is-external=true#package_description
>
> Thanks!
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]