[ https://issues.apache.org/jira/browse/OPENNLP-1702?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Richard Zowalla closed OPENNLP-1702. ------------------------------------ Resolution: Fixed > BratDocumentStream should process files in bratCorpusDir deterministically > -------------------------------------------------------------------------- > > Key: OPENNLP-1702 > URL: https://issues.apache.org/jira/browse/OPENNLP-1702 > Project: OpenNLP > Issue Type: Bug > Components: Build, Packaging and Test > Affects Versions: 2.5.3 > Reporter: Martin Wiesner > Assignee: Martin Wiesner > Priority: Minor > Fix For: 2.5.4 > > Time Spent: 50m > Remaining Estimate: 0h > > With the recent addition of {{BratNameSampleStreamFactoryTest}} via > OPENNLP-1695, it became obvious (Eval test run), that the code in > BratDocumentStream is prone to non-determinism. This stems from the fact that > {{java.util.File#listFiles(..)}} does not guarantee any order of the returned > elements. > A potential fix for achieving determinism again is to sort the result of > listFiles(..) alphabetically in ASC order. -- This message was sent by Atlassian Jira (v8.20.10#820010)