[
https://issues.apache.org/jira/browse/OPENNLP-200?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13092381#comment-13092381
]
Joern Kottmann commented on OPENNLP-200:
----------------------------------------
The test is using the platform default encoding to read the data set. Since the
default encoding is platform and location dependent this test will fail on
other machines, or produce different results.
To fix this always specify the encoding when opening the data, and it should be
retrieved via the class path instead.
> Addition of prepositional phrase attachment dataset and unit test for it
> ------------------------------------------------------------------------
>
> Key: OPENNLP-200
> URL: https://issues.apache.org/jira/browse/OPENNLP-200
> Project: OpenNLP
> Issue Type: New Feature
> Components: Maxent
> Reporter: Jason Baldridge
> Priority: Minor
> Labels: data, testing
> Fix For: tools-1.5.2-incubating, maxent-3.0.2-incubating
>
> Attachments: OPENNLP-200.patch, ppa.tar.gz
>
>
> I have obtained permission from Adwait Ratnaparkhi to include his
> prepositional phrase attachment dataset in the distribution as a test case.
> Jorn correctly points out that we need to see whether this is ASF compliant.
> Here is the original dataset:
> http://sites.google.com/site/adwaitratnaparkhi/publications/ppa.tar.gz?attredirects=0
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira