> hello;
>
> i want to filter my tokens and keep only string tokens (
> remove numbers
> ect).
> i sue this :
>
> public TokenStream tokenStream(String fieldName, Reader
> reader) {
> return new PorterStemFilter(
> new StopFilter(
> new LowerCaseFilter(
> new StandardFilter(
> new
> StandardTokenizer(reader))), stopset));
> }
Why not use LowerCaseTokenizer [1] instead of StandardTokenizer +
StandardFilter + LowerCaseFilter.
[1]http://lucene.apache.org/java/2_9_2/api/core/org/apache/lucene/analysis/LowerCaseTokenizer.html
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]