[ 
https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17834020#comment-17834020
 ] 

Hudson commented on NUTCH-3032:
-------------------------------

SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #156 (See 
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/156/])
NUTCH-3032 Code for an ArbitraryIndexingFilter to index values resolved by user 
POJO code at index time (#810) (github: 
[https://github.com/apache/nutch/commit/c9e2f4ed693014e9dcb9d6f68ae918e0c0eedd26])
* (edit) build.xml
* (add) src/plugin/index-arbitrary/ivy.xml
* (add) 
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Multiplier.java
* (edit) conf/nutch-default.xml
* (add) 
src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/ArbitraryIndexingFilter.java
* (add) 
src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/package-info.java
* (edit) src/plugin/build.xml
* (add) src/plugin/index-arbitrary/build.xml
* (add) src/plugin/index-arbitrary/plugin.xml
* (add) 
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/TestArbitraryIndexingFilter.java
* (add) 
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Echo.java


> Indexing plugin as an adapter for end user's own POJO instances
> ---------------------------------------------------------------
>
>                 Key: NUTCH-3032
>                 URL: https://issues.apache.org/jira/browse/NUTCH-3032
>             Project: Nutch
>          Issue Type: Improvement
>          Components: indexer
>            Reporter: Joe Gilvary
>            Assignee: Joe Gilvary
>            Priority: Major
>              Labels: indexing
>             Fix For: 1.20
>
>         Attachments: NUTCH-3032.patch
>
>
> It could be helpful to let end users manipulate information at indexing time 
> with their own code without the need for writing their own indexing plugin. I 
> mentioned this on the dev mailing list 
> (https://www.mail-archive.com/dev@nutch.apache.org/msg31190.html) with some 
> description of my work in progress.
> One potential use is to address some of the same concerns that NUTCH-585 
> discusses regarding an alternative approach to picking and choosing which 
> content to index, but this approach would allow making index time decisions, 
> rather than setting the configuration for all content at the start of the 
> indexing run.
>  



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to