Hello,
I get the following error when I do bin/nutch updatedb in nutch-2.0 with hbase
java.lang.ArrayIndexOutOfBoundsException: 1
at org.apache.nutch.util.TableUtil.unreverseUrl(TableUtil.java:98)
at org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:54)
at org.apache.nutch.crawl.DbUpdateMapper.map(DbUpdateMapper.java:37)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:144)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
I see this is because of reversing and unreversing urls. What is the idea
behind this reversal and unreversal in nutch-2.0?
Thanks.
Alex.