[
https://issues.apache.org/jira/browse/NUTCH-1403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17276667#comment-17276667
]
Hudson commented on NUTCH-1403:
-------------------------------
SUCCESS: Integrated in Jenkins build Nutch ยป Nutch-trunk #24 (See
[https://ci-builds.apache.org/job/Nutch/job/Nutch-trunk/24/])
fix for NUTCH-1403 contributed by aalbahem (ameer.albahem:
[https://github.com/apache/nutch/commit/598bbc40a3d3438233813b607cb031a6bb0a2f84])
* (add) src/plugin/scoring-metadata/pom.xml
* (add) src/plugin/scoring-metadata/plugin.xml
* (add)
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/MetadataScoringFilterTest.java
* (add) src/plugin/scoring-metadata/build.xml
* (add)
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/package.html
* (add)
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/MetadataScoringFilter.java
* (add) src/plugin/scoring-metadata/ivy.xml
* (edit) build.xml
* (edit) src/plugin/build.xml
Improve fix for NUTCH-1403 (ameer.albahem:
[https://github.com/apache/nutch/commit/cdb6b52b02958385497804ef7cd6a6b646616208])
* (edit) default.properties
* (delete) src/plugin/scoring-metadata/pom.xml
* (edit) conf/nutch-default.xml
* (delete)
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/MetadataScoringFilterTest.java
* (edit)
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/package.html
* (add)
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/TestMetadataScoringFilter.java
* (edit)
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/MetadataScoringFilter.java
Improve NUTCH-1403, add ASLv2 header (ameer.albahem:
[https://github.com/apache/nutch/commit/93aa2ab41097511f3afe8d34c9c13cafd735cec9])
* (edit)
src/plugin/scoring-metadata/src/test/org/apache/nutch/scoring/metadata/TestMetadataScoringFilter.java
* (edit)
src/plugin/scoring-metadata/src/java/org/apache/nutch/scoring/metadata/package.html
> Add default ScoringFilter for manipulating metadata
> ----------------------------------------------------
>
> Key: NUTCH-1403
> URL: https://issues.apache.org/jira/browse/NUTCH-1403
> Project: Nutch
> Issue Type: Improvement
> Reporter: Julien Nioche
> Priority: Major
> Fix For: 1.19
>
>
> This is currently done by the urlmeta plugin, which has too vague a name and
> a redundant indexing filter now that we have the index-metadata plugin. This
> scoring filter would help defining which metadata to pass from :
> - the crawl metadata to the content metadata
> - the content metadata to the parse metadata
> - the parse metadata to the crawldatum for the outlinks
> I'd make this scoring filter available by default i.e. not in a separate
> plugin as its functionalities are commonly used.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)