[
https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17834020#comment-17834020
]
Hudson commented on NUTCH-3032:
-------------------------------
SUCCESS: Integrated in Jenkins build Nutch » Nutch-trunk #156 (See
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/156/])
NUTCH-3032 Code for an ArbitraryIndexingFilter to index values resolved by user
POJO code at index time (#810) (github:
[https://github.com/apache/nutch/commit/c9e2f4ed693014e9dcb9d6f68ae918e0c0eedd26])
* (edit) build.xml
* (add) src/plugin/index-arbitrary/ivy.xml
* (add)
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Multiplier.java
* (edit) conf/nutch-default.xml
* (add)
src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/ArbitraryIndexingFilter.java
* (add)
src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/package-info.java
* (edit) src/plugin/build.xml
* (add) src/plugin/index-arbitrary/build.xml
* (add) src/plugin/index-arbitrary/plugin.xml
* (add)
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/TestArbitraryIndexingFilter.java
* (add)
src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Echo.java
> Indexing plugin as an adapter for end user's own POJO instances
> ---------------------------------------------------------------
>
> Key: NUTCH-3032
> URL: https://issues.apache.org/jira/browse/NUTCH-3032
> Project: Nutch
> Issue Type: Improvement
> Components: indexer
> Reporter: Joe Gilvary
> Assignee: Joe Gilvary
> Priority: Major
> Labels: indexing
> Fix For: 1.20
>
> Attachments: NUTCH-3032.patch
>
>
> It could be helpful to let end users manipulate information at indexing time
> with their own code without the need for writing their own indexing plugin. I
> mentioned this on the dev mailing list
> (https://www.mail-archive.com/[email protected]/msg31190.html) with some
> description of my work in progress.
> One potential use is to address some of the same concerns that NUTCH-585
> discusses regarding an alternative approach to picking and choosing which
> content to index, but this approach would allow making index time decisions,
> rather than setting the configuration for all content at the start of the
> indexing run.
>
--
This message was sent by Atlassian Jira
(v8.20.10#820010)