[ 
https://issues.apache.org/jira/browse/OPENNLP-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi resolved OPENNLP-1132.
------------------------------------
    Resolution: Fixed

> Fail with exception if not enough lines in leipzig parser
> ---------------------------------------------------------
>
>                 Key: OPENNLP-1132
>                 URL: https://issues.apache.org/jira/browse/OPENNLP-1132
>             Project: OpenNLP
>          Issue Type: Bug
>          Components: Language Detector
>    Affects Versions: 1.8.2
>            Reporter: Peter Thygesen
>            Assignee: Peter Thygesen
>             Fix For: 1.8.4
>
>
> Exception in thread "main" java.lang.IndexOutOfBoundsException: toIndex = 
> 100000
>       at java.util.ArrayList.subListRangeCheck(ArrayList.java:1004)
>       at java.util.ArrayList.subList(ArrayList.java:996)
>       at 
> opennlp.tools.formats.leipzig.LeipzigLanguageSampleStream$LeipzigSentencesStream.<init>(LeipzigLanguageSampleStream.java:65)
>       at 
> opennlp.tools.formats.leipzig.LeipzigLanguageSampleStream.read(LeipzigLanguageSampleStream.java:157)
>       at 
> opennlp.tools.formats.leipzig.LeipzigLanguageSampleStream.read(LeipzigLanguageSampleStream.java:42)
>       at 
> opennlp.tools.formats.leipzig.SampleShuffleStream.<init>(SampleShuffleStream.java:38)
>       at 
> opennlp.tools.formats.leipzig.LeipzigLanguageSampleStreamFactory.create(LeipzigLanguageSampleStreamFactory.java:76)
>       at 
> opennlp.tools.cmdline.AbstractConverterTool.run(AbstractConverterTool.java:106)
>       at opennlp.tools.cmdline.CLI.main(CLI.java:256)
> line 65:
> Set<Integer> selectedLines = new HashSet<>(
>           indexes.subList(0, sentencesPerSample * numberOfSamples));
> Fails if sentencesPerSample x numberOfSamples is larger than size of indexes 
> (source file).



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to