[
https://issues.apache.org/jira/browse/NUTCH-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13612151#comment-13612151
]
Hudson commented on NUTCH-1533:
-------------------------------
Integrated in Nutch-nutchgora #541 (See
[https://builds.apache.org/job/Nutch-nutchgora/541/])
NUTCH-1533 - Implement getPrevModifiedTime(), setPrevModifiedTime(),
getBatchId() and setBatchId() accessors in o.a.n.storage.WebPage (Revision
1460380)
Result = FAILURE
fenglu : http://svn.apache.org/viewvc/nutch/branches/2.x/?view=rev&rev=1460380
Files :
* /nutch/branches/2.x/CHANGES.txt
* /nutch/branches/2.x/conf/gora-accumulo-mapping.xml
* /nutch/branches/2.x/conf/gora-cassandra-mapping.xml
* /nutch/branches/2.x/conf/gora-hbase-mapping.xml
* /nutch/branches/2.x/conf/gora-sql-mapping.xml
* /nutch/branches/2.x/src/gora/webpage.avsc
* /nutch/branches/2.x/src/java/org/apache/nutch/crawl/AdaptiveFetchSchedule.java
* /nutch/branches/2.x/src/java/org/apache/nutch/crawl/DbUpdateReducer.java
* /nutch/branches/2.x/src/java/org/apache/nutch/crawl/DefaultFetchSchedule.java
* /nutch/branches/2.x/src/java/org/apache/nutch/crawl/GeneratorReducer.java
* /nutch/branches/2.x/src/java/org/apache/nutch/storage/ParseStatus.java
* /nutch/branches/2.x/src/java/org/apache/nutch/storage/ProtocolStatus.java
* /nutch/branches/2.x/src/java/org/apache/nutch/storage/WebPage.java
> Implement getPrevModifiedTime(), setPrevModifiedTime(), getBatchId() and
> setBatchId() accessors in o.a.n.storage.WebPage
> ------------------------------------------------------------------------------------------------------------------------
>
> Key: NUTCH-1533
> URL: https://issues.apache.org/jira/browse/NUTCH-1533
> Project: Nutch
> Issue Type: Bug
> Components: storage
> Affects Versions: 2.1
> Reporter: Lewis John McGibbney
> Assignee: lufeng
> Priority: Minor
> Fix For: 2.2
>
> Attachments: NUTCH-1533.patch, NUTCH-1533v2.patch, NUTCH-1533-v3.patch
>
>
> NUTCH-1532 needs to obtain a batchId to add to NutchDocument prior to
> indexing. This is currently not available as we do not store the information
> in the WebPage. Additionally, we do not store the other ModifiedTime's but
> incorrectly set them in o.a.n.crawl.FetchSchedule#setFetchSchedule.
> All the above accessors should be implemented.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira