[ 
https://issues.apache.org/jira/browse/NUTCH-1690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nguyen Manh Tien updated NUTCH-1690:
------------------------------------

    Attachment: NUTCH-1690.patch

> IndexClean: mark url as unindexed after clean to not delete again
> -----------------------------------------------------------------
>
>                 Key: NUTCH-1690
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1690
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Nguyen Manh Tien
>            Priority: Minor
>             Fix For: 2.3
>
>         Attachments: NUTCH-1690.patch
>
>
> We should marked a deleted page to not delete it again and again. That can 
> simply done by remove Index marker when we delete.
> I also change to delete duplicated url in solrclean.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)

Reply via email to