Hi Alex, My first thought: duplicate entries (urls) in crawldb1 and crawldb2?
Mathijs On Oct 28, 2011, at 6:54 , [email protected] wrote: > Hello, > > I have merged two cralwldb using bin/nutch mergedb crawldb crawldb1 crawldb2 > > I noticed that stats numbers in crawldb1+crawldb2 are not equal to numbers in > crawldb. For example df_unfetched1+ df_unfetched2 is not equal to > df_unfetched > > Any comments on this issue? > > Thanks. > Alex.

