[ 
https://issues.apache.org/jira/browse/NUTCH-1294?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13266889#comment-13266889
 ] 

Lewis John McGibbney commented on NUTCH-1294:
---------------------------------------------

I think this is a really neat patch. The new extension point is a great 
addition to this often desired aspect of maintaining your index. The script in 
bin/nutch requires to be updated with the correct command, and the patch needs 
to be tested before we commit. I would be happy to get this tested once the 
blocker NUTCH-1205 has be resolved (which looks to be very soon). It would be 
great to get this into 2.0. Thanks Dan.   
                
> IndexClean job with solr implementation.
> ----------------------------------------
>
>                 Key: NUTCH-1294
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1294
>             Project: Nutch
>          Issue Type: Improvement
>    Affects Versions: nutchgora
>            Reporter: Dan Rosher
>            Priority: Minor
>             Fix For: nutchgora
>
>         Attachments: NUTCH-1294.patch
>
>
> I started by copying/altering the trunk version of SolrClean, though is was 
> inadequate for our needs. We needed to mark particular pages as gone even 
> though they still might be visible on the web, this implementation abstracts 
> the index cleaning process, has a Solr implementation, and adds a clean index 
> plugin extension that allows others to tailor how pages might be removed from 
> their store.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to