Massive Code Duplication in Contrib Analyzers - unifly the analyzer ctors
-------------------------------------------------------------------------

                 Key: LUCENE-2034
                 URL: https://issues.apache.org/jira/browse/LUCENE-2034
             Project: Lucene - Java
          Issue Type: Improvement
          Components: contrib/analyzers
    Affects Versions: 2.9
            Reporter: Simon Willnauer
            Priority: Minor
             Fix For: 3.0


Due to the variouse tokenStream APIs we had in lucene analyzer subclasses need 
to implement at least one of the methodes returning a tokenStream. When you 
look at the code it appears to be almost identical if both are implemented in 
the same analyzer.  Each analyzer defnes the same inner class (SavedStreams) 
which is unnecessary.
In contrib almost every analyzer uses stopwords and each of them creates his 
own way of loading them or defines a large number of ctors to load stopwords 
from a file, set, arrays etc.. those ctors should be removed / deprecated and 
eventually removed.




-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to