[
https://issues.apache.org/jira/browse/NUTCH-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13266889#comment-13266889
]
Lewis John McGibbney commented on NUTCH-1294:
---------------------------------------------
I think this is a really neat patch. The new extension point is a great
addition to this often desired aspect of maintaining your index. The script in
bin/nutch requires to be updated with the correct command, and the patch needs
to be tested before we commit. I would be happy to get this tested once the
blocker NUTCH-1205 has be resolved (which looks to be very soon). It would be
great to get this into 2.0. Thanks Dan.
> IndexClean job with solr implementation.
> ----------------------------------------
>
> Key: NUTCH-1294
> URL: https://issues.apache.org/jira/browse/NUTCH-1294
> Project: Nutch
> Issue Type: Improvement
> Affects Versions: nutchgora
> Reporter: Dan Rosher
> Priority: Minor
> Fix For: nutchgora
>
> Attachments: NUTCH-1294.patch
>
>
> I started by copying/altering the trunk version of SolrClean, though is was
> inadequate for our needs. We needed to mark particular pages as gone even
> though they still might be visible on the web, this implementation abstracts
> the index cleaning process, has a Solr implementation, and adds a clean index
> plugin extension that allows others to tailor how pages might be removed from
> their store.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira