Hello,
I noticed that updatedb command must remove gen, parse and fetch marks and put
UPDATEDB_MARK mark.
according to the code
Utf8 mark = Mark.PARSE_MARK.removeMarkIfExist(page);
if (mark != null) {
Mark.UPDATEDB_MARK.putMark(page, mark);
}
in DbUpdateReducer.java
However, outputting markers in Hbase shows that updatedb removes all marks,
except injector one and does not put UPDATEDB_MARK.
Thanks.
Alex.
- updatedb fails to put UPDATEDB_MARK in nutch-2.0 alxsss
- Re: updatedb fails to put UPDATEDB_MARK in nutch-2.0 Ferdy Galema

