[ https://issues.apache.org/jira/browse/OPENNLP-753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17948499#comment-17948499 ]
Amarpreet Bassan commented on OPENNLP-753: ------------------------------------------ Hey J[örn Kottmann ,|https://issues.apache.org/jira/secure/ViewProfile.jspa?name=joern] Could you please help assign this issue to me? Best, Amar > USGS Indexer fails with ArrayIndexOutOfBoundsException > ------------------------------------------------------ > > Key: OPENNLP-753 > URL: https://issues.apache.org/jira/browse/OPENNLP-753 > Project: OpenNLP > Issue Type: Bug > Components: Entity Linker > Reporter: Jörn Kottmann > Priority: Major > > While indexing a recent version of the USGS data (NationalFile_20141202.txt) > the following exception is thrown: > java.lang.ArrayIndexOutOfBoundsException: 2 > at > opennlp.addons.geoentitylinker.indexing.USGSProcessor.readFile(USGSProcessor.java:77) > at > opennlp.addons.geoentitylinker.indexing.USGSProcessor.process(USGSProcessor.java:54) > at > opennlp.addons.geoentitylinker.indexing.GazetteerIndexer.index(GazetteerIndexer.java:168) > at > opennlp.addons.geoentitylinker.indexing.GazetteerIndexer.main(GazetteerIndexer.java:44) > A couple of lines in this file are now shorter than the expected minimum > length. I added a few lines to print those out: > Skipped line: 2717280|Sullivan Field > Skipped line: Sullivan Field > Skipped line: Sullivan Field > Skipped line: > |Airport|FL|12|Alachua|001|294359N|0823358W|29.7330798|-82.5661417|||||28|92|Newberry|10/16/2014| -- This message was sent by Atlassian Jira (v8.20.10#820010)