improve performance of contrib/TestCompoundWordTokenFilter
----------------------------------------------------------

                 Key: LUCENE-1786
                 URL: https://issues.apache.org/jira/browse/LUCENE-1786
             Project: Lucene - Java
          Issue Type: Test
          Components: contrib/analyzers
            Reporter: Robert Muir
            Priority: Minor


contrib/analyzers/compound has some tests that use a hyphenation grammar file.

The tests are currently for german, and they actually are nice, they show how 
the combination of the hyphenation rules and dictionary work in tandem.
The issue is that the german grammar file is not apache licensed: 
http://offo.sourceforge.net/hyphenation/licenses.html
So the test must download the entire offo zip file from sourceforge to execute.

I happen to think the test is a great example of how this thing works (with a 
language where it matters), but we could consider using a different grammar 
file, for a language that is apache licensed.
This way it could be included in the source with the test and would be more 
practical.


-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: java-dev-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-dev-h...@lucene.apache.org

Reply via email to