[ https://issues.apache.org/jira/browse/NUTCH-3032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17830964#comment-17830964 ]
ASF GitHub Bot commented on NUTCH-3032: --------------------------------------- lewismc commented on code in PR #810: URL: https://github.com/apache/nutch/pull/810#discussion_r1539390873 ########## src/plugin/index-arbitrary/ivy.xml: ########## @@ -0,0 +1,41 @@ +<?xml version="1.0" ?> + +<!-- + Licensed to the Apache Software Foundation (ASF) under one or more + contributor license agreements. See the NOTICE file distributed with + this work for additional information regarding copyright ownership. + The ASF licenses this file to You under the Apache License, Version 2.0 + (the "License"); you may not use this file except in compliance with + the License. You may obtain a copy of the License at + + http://www.apache.org/licenses/LICENSE-2.0 + + Unless required by applicable law or agreed to in writing, software + distributed under the License is distributed on an "AS IS" BASIS, + WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. + See the License for the specific language governing permissions and + limitations under the License. +--> + Review Comment: Please remove whitespace. ########## src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Multiplier.java: ########## @@ -0,0 +1,31 @@ +package org.apache.nutch.indexer.arbitrary; Review Comment: Please add ALv2 license header. ########## build.xml: ########## @@ -44,7 +44,7 @@ <property name="spotbugs.home" value="${ivy.dir}/spotbugs-${spotbugs.version}" /> <property name="spotbugs.jar" value="${spotbugs.home}/lib/spotbugs-ant.jar" /> - <property name="apache-rat.version" value="0.16.1" /> + <property name="apache-rat.version" value="0.15" /> Review Comment: Please sync with `master` branch. This regression is tangential to NUTCH-3032. Thanks ########## src/plugin/index-arbitrary/src/test/org/apache/nutch/indexer/arbitrary/Echo.java: ########## @@ -0,0 +1,24 @@ +package org.apache.nutch.indexer.arbitrary; Review Comment: Please add ALv2 license header. ########## src/plugin/index-arbitrary/ivy.xml: ########## @@ -0,0 +1,41 @@ +<?xml version="1.0" ?> + Review Comment: Please remove whitespace. ########## src/plugin/index-arbitrary/build.xml: ########## @@ -0,0 +1,6 @@ +<?xml version="1.0"?> Review Comment: Please add ALv2 license header. ########## src/plugin/index-arbitrary/src/java/org/apache/nutch/indexer/arbitrary/ArbitraryIndexingFilter.java: ########## @@ -0,0 +1,266 @@ +package org.apache.nutch.indexer.arbitrary; Review Comment: Please add ALv2 license header. > Indexing plugin as an adapter for end user's own POJO instances > --------------------------------------------------------------- > > Key: NUTCH-3032 > URL: https://issues.apache.org/jira/browse/NUTCH-3032 > Project: Nutch > Issue Type: Improvement > Components: indexer > Reporter: Joe Gilvary > Priority: Major > Labels: indexing > Attachments: NUTCH-3032.patch > > > It could be helpful to let end users manipulate information at indexing time > with their own code without the need for writing their own indexing plugin. I > mentioned this on the dev mailing list > (https://www.mail-archive.com/dev@nutch.apache.org/msg31190.html) with some > description of my work in progress. > One potential use is to address some of the same concerns that NUTCH-585 > discusses regarding an alternative approach to picking and choosing which > content to index, but this approach would allow making index time decisions, > rather than setting the configuration for all content at the start of the > indexing run. > -- This message was sent by Atlassian Jira (v8.20.10#820010)