[ 
https://issues.apache.org/jira/browse/NUTCH-1533?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13605982#comment-13605982
 ] 

Lewis John McGibbney commented on NUTCH-1533:
---------------------------------------------

Hi Feng.
I am +1 for committing the most recent patch. Can you please commit this?
This way we can check if you have been set up with your Apache account, etc. 
properly.
Thank you and great work on this one.
                
> Implement getPrevModifiedTime(), setPrevModifiedTime(), getBatchId() and 
> setBatchId() accessors in o.a.n.storage.WebPage
> ------------------------------------------------------------------------------------------------------------------------
>
>                 Key: NUTCH-1533
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1533
>             Project: Nutch
>          Issue Type: Bug
>          Components: storage
>    Affects Versions: 2.1
>            Reporter: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 2.2
>
>         Attachments: NUTCH-1533.patch, NUTCH-1533v2.patch, NUTCH-1533-v3.patch
>
>
> NUTCH-1532 needs to obtain a batchId to add to NutchDocument prior to 
> indexing. This is currently not available as we do not store the information 
> in the WebPage. Additionally, we do not store the other ModifiedTime's but 
> incorrectly set them in o.a.n.crawl.FetchSchedule#setFetchSchedule.
> All the above accessors should be implemented.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to