Hi Lewis, >From quickly checking out the code (Host.java + HostDB + HostDBUpdateReducer) it would seems like there is a bug exactly where you pointed.
Renato M. 2014-12-08 20:53 GMT+01:00 Lewis John Mcgibbney <lewis.mcgibb...@gmail.com>: > Hi Folks, > I was looking into the code within Nutch 2.X HostDbUpdateReducer and > 'think' I've discovered a bug in the way we output Host data. > > https://github.com/apache/nutch/blob/2.x/src/java/org/apache/nutch/host/HostDbUpdateReducer.java#L87 > I feel that the follwoing code > > host.getInlinks().put(new Utf8(outlink), new > Utf8(Integer.toString(outlinkCount.getCount(outlink)))); > > should be changed to the following > > host.getOutlinks().put(new Utf8(outlink), new > Utf8(Integer.toString(outlinkCount.getCount(outlink)))); > > Is anyone actively using the HostDb and can comment? > Thank you > Lewis > > -- > *Lewis* >