> hello;
> 
> i want to filter my tokens and keep only string tokens (
> remove numbers
> ect).
> i sue this :
> 
> public TokenStream tokenStream(String fieldName, Reader
> reader) {
>     return new PorterStemFilter(
>       new StopFilter(
>         new LowerCaseFilter(
>           new StandardFilter(
>             new
> StandardTokenizer(reader))), stopset));
>   }


Why not use LowerCaseTokenizer [1] instead of StandardTokenizer + 
StandardFilter +  LowerCaseFilter. 

[1]http://lucene.apache.org/java/2_9_2/api/core/org/apache/lucene/analysis/LowerCaseTokenizer.html


  

---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to