Nutch 2 DbUpdateMapper throws ArrayOutOfBoundsException when running update
---------------------------------------------------------------------------
Key: NUTCH-1077
URL: https://issues.apache.org/jira/browse/NUTCH-1077
Project: Nutch
Issue Type: Bug
Components: fetcher
Affects Versions: 2.0
Environment: CentOS 5 Linux with CDH3 Hadoop.
Reporter: Tom Davidson
I got this error when running a simple nutch update after doing a small fetch
and parse.
java.lang.ArrayIndexOutOfBoundsException: 0
at
org.apache.nutch.util.TableUtil.reverseAppendSplits(TableUtil.java:126)
at org.apache.nutch.util.TableUtil.reverseUrl(TableUtil.java:66)
at org.apache.nutch.util.TableUtil.reverseUrl(TableUtil.java:43)
at org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:70)
at org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:36)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:647)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:323)
at org.apache.hadoop.mapred.Child$4.run(Child.java:270)
at java.security.AccessController.doPrivileged(Native Method)
at javax.security.auth.Subject.doAs(Subject.java:396)
at
org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1127)
at org.apache.hadoop.mapred.Child.main(Child.java:264)
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira