Nguyen Manh Tien created NUTCH-1690:
---------------------------------------
Summary: IndexClean: mark url as unindexed after clean to not
delete again
Key: NUTCH-1690
URL: https://issues.apache.org/jira/browse/NUTCH-1690
Project: Nutch
Issue Type: Improvement
Components: indexer
Reporter: Nguyen Manh Tien
Priority: Minor
We should marked a deleted page to not delete it again and again. That can
simply done by remove Index marker when we delete.
I also change to delete duplicated url in solrclean.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)