Martin Wiesner created OPENNLP-1702:
---------------------------------------

             Summary: BratDocumentStream should process files in bratCorpusDir 
deterministically
                 Key: OPENNLP-1702
                 URL: https://issues.apache.org/jira/browse/OPENNLP-1702
             Project: OpenNLP
          Issue Type: Bug
          Components: Build, Packaging and Test
    Affects Versions: 2.5.3
            Reporter: Martin Wiesner
            Assignee: Martin Wiesner
             Fix For: 2.5.4


With the recent addition of {{BratNameSampleStreamFactoryTest}} via 
OPENNLP-1695, it became obvious (Eval test run), that the code in 
BratDocumentStream is prone to non-determinism. This stems from the fact that 
{{java.util.File#listFiles(..)}} does not guarantee any order of the returned 
elements. 

A potential fix for achieving determinism again, is to sort the result of 
listFiles(..) alphabetically in ASC order.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to