[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17834020#comment-17834020 ]
Hudson commented on NUTCH-3032: ------------------------------- SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #156 (See [https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/156/]) NUTCH-3032 Code for an ArbitraryIndexingFilter to index values resolved by user POJO code at index time (#810) (github: [https://github.com/apache/nutch/commit/c9e2f4ed693014e9dcb9d6f68ae918e0c0eedd26]) * (edit) build.xml * (add) src/plugin/index-arbitrary/ivy.xml * (add) src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Multiplier.java * (edit) conf/nutch-default.xml * (add) src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/ArbitraryIndexingFilter.java * (add) src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/package-info.java * (edit) src/plugin/build.xml * (add) src/plugin/index-arbitrary/build.xml * (add) src/plugin/index-arbitrary/plugin.xml * (add) src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/TestArbitraryIndexingFilter.java * (add) src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Echo.java > Indexing plugin as an adapter for end user's own POJO instances > --------------------------------------------------------------- > > Key: NUTCH-3032 > URL: https://issues.apache.org/jira/browse/NUTCH-3032 > Project: Nutch > Issue Type: Improvement > Components: indexer > Reporter: Joe Gilvary > Assignee: Joe Gilvary > Priority: Major > Labels: indexing > Fix For: 1.20 > > Attachments: NUTCH-3032.patch > > > It could be helpful to let end users manipulate information at indexing time > with their own code without the need for writing their own indexing plugin. I > mentioned this on the dev mailing list > (https://www.mail-archive.com/dev@nutch.apache.org/msg31190.html) with some > description of my work in progress. > One potential use is to address some of the same concerns that NUTCH-585 > discusses regarding an alternative approach to picking and choosing which > content to index, but this approach would allow making index time decisions, > rather than setting the configuration for all content at the start of the > indexing run. > -- This message was sent by Atlassian Jira (v8.20.10#820010)