Author: ferdy
Date: Fri Sep 7 14:19:47 2012
New Revision: 1382037
URL: http://svn.apache.org/viewvc?rev=1382037&view=rev
Log:
NUTCH-1456 Updater not setting batchId in markers correctly. (Alexander Kingson
via ferdy)
Modified:
nutch/branches/2.x/CHANGES.txt
nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java
Modified: nutch/branches/2.x/CHANGES.txt
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/CHANGES.txt?rev=1382037&r1=1382036&r2=1382037&view=diff
==============================================================================
--- nutch/branches/2.x/CHANGES.txt (original)
+++ nutch/branches/2.x/CHANGES.txt Fri Sep 7 14:19:47 2012
@@ -2,6 +2,8 @@ Nutch Change Log
Release 2.1 - Current Development
+* NUTCH-1456 Updater not setting batchId in markers correctly. (Alexander
Kingson via ferdy)
+
* NUTCH-1459 Remove dead code (phase2) from InjectorJob (ferdy)
* NUTCH-1431 Introduce link 'distance' and add configurable max distance in
the generator (ferdy)
Modified:
nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java
URL:
http://svn.apache.org/viewvc/nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java?rev=1382037&r1=1382036&r2=1382037&view=diff
==============================================================================
--- nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java
(original)
+++ nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java Fri
Sep 7 14:19:47 2012
@@ -192,9 +192,10 @@ extends GoraReducer<UrlWithScore, NutchW
}
Mark.GENERATE_MARK.removeMarkIfExist(page);
Mark.FETCH_MARK.removeMarkIfExist(page);
- Utf8 mark = Mark.PARSE_MARK.removeMarkIfExist(page);
- if (mark != null) {
- Mark.UPDATEDB_MARK.putMark(page, mark);
+ Utf8 parse_mark = Mark.PARSE_MARK.checkMark(page);
+ if (parse_mark != null) {
+ Mark.UPDATEDB_MARK.putMark(page, parse_mark);
+ Mark.PARSE_MARK.removeMark(page);
}
context.write(keyUrl, page);