[ 
https://issues.apache.org/jira/browse/LUCENE-1166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12591936#action_12591936
 ] 

Thomas Peuss commented on LUCENE-1166:
--------------------------------------

The error is
{code}
    [junit] Testsuite: 
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter
    [junit] Tests run: 4, Failures: 0, Errors: 2, Time elapsed: 2,139 sec
    [junit]
    [junit] Testcase: 
testHyphenationCompoundWordsDE(org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter):
  Caused an ERROR
    [junit] File not found: 
/home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such file or 
directory)
    [junit] 
org.apache.lucene.analysis.compound.hyphenation.HyphenationException: File not 
found: /home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such 
file or directory)
    [junit]     at 
org.apache.lucene.analysis.compound.hyphenation.PatternParser.parse(PatternParser.java:123)
    [junit]     at 
org.apache.lucene.analysis.compound.hyphenation.HyphenationTree.loadPatterns(HyphenationTree.java:138)
    [junit]     at 
org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter.getHyphenationTree(HyphenationCompoundWordTokenFilter.java:142)
    [junit]     at 
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter.testHyphenationCompoundWordsDE(TestCompoundWordTokenFilter.java:70)
    [junit]
    [junit]
    [junit] Testcase: 
testHyphenationCompoundWordsDELongestMatch(org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter):
      Caused an ERROR
    [junit] File not found: 
/home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such file or 
directory)
    [junit] 
org.apache.lucene.analysis.compound.hyphenation.HyphenationException: File not 
found: /home/thomas/projects/lucene-trunk-compound/hyphenation.dtd (No such 
file or directory)
    [junit]     at 
org.apache.lucene.analysis.compound.hyphenation.PatternParser.parse(PatternParser.java:123)
    [junit]     at 
org.apache.lucene.analysis.compound.hyphenation.HyphenationTree.loadPatterns(HyphenationTree.java:138)
    [junit]     at 
org.apache.lucene.analysis.compound.HyphenationCompoundWordTokenFilter.getHyphenationTree(HyphenationCompoundWordTokenFilter.java:142)
    [junit]     at 
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter.testHyphenationCompoundWordsDELongestMatch(TestCompoundWordTokenFilter.java:96)
    [junit]
    [junit]
    [junit] Test 
org.apache.lucene.analysis.compound.TestCompoundWordTokenFilter FAILED
{code}

So it does not find the hyphenation.dtd. I have to investigate how I can make 
that DTD know to the parser without copying the hyphenation.dtd to Lucene's 
base directory.

> A tokenfilter to decompose compound words
> -----------------------------------------
>
>                 Key: LUCENE-1166
>                 URL: https://issues.apache.org/jira/browse/LUCENE-1166
>             Project: Lucene - Java
>          Issue Type: New Feature
>          Components: Analysis
>            Reporter: Thomas Peuss
>            Assignee: Grant Ingersoll
>            Priority: Minor
>         Attachments: CompoundTokenFilter.patch, CompoundTokenFilter.patch, 
> CompoundTokenFilter.patch, CompoundTokenFilter.patch, 
> CompoundTokenFilter.patch, CompoundTokenFilter.patch, 
> CompoundTokenFilter.patch, de.xml, hyphenation.dtd
>
>
> A tokenfilter to decompose compound words you find in many germanic languages 
> (like German, Swedish, ...) into single tokens.
> An example: Donaudampfschiff would be decomposed to Donau, dampf, schiff so 
> that you can find the word even when you only enter "Schiff".
> I use the hyphenation code from the Apache XML project FOP 
> (http://xmlgraphics.apache.org/fop/) to do the first step of decomposition. 
> Currently I use the FOP jars directly. I only use a handful of classes from 
> the FOP project.
> My question now:
> Would it be OK to copy this classes over to the Lucene project (renaming the 
> packages of course) or should I stick with the dependency to the FOP jars? 
> The FOP code uses the ASF V2 license as well.
> What do you think?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to