[
https://issues.apache.org/jira/browse/NUTCH-2479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sebastian Nagel updated NUTCH-2479:
-----------------------------------
Fix Version/s: 2.5
> urlmeta plugin port from 1.x to 2.x
> -----------------------------------
>
> Key: NUTCH-2479
> URL: https://issues.apache.org/jira/browse/NUTCH-2479
> Project: Nutch
> Issue Type: New Feature
> Components: nutch server, plugin, REST_api
> Affects Versions: 2.3.1
> Reporter: Ninaad Joshi
> Priority: Minor
> Labels: patch, plugin
> Fix For: 2.5
>
> Attachments: Ninaad.Joshi.plugin.urlmeta.patch
>
>
> I have ported urlmeta plugin available in 1.x to 2.x
> It is designed to do two things:
> * Meta Tags that are supplied with your Crawl URLs, during injection either
> through seed.txt or through REST API, will be propagated throughout the
> out-links of those Crawl URLs
> * When you index your URLs, the meta tags that you specified with your URLs
> will be indexed alongside those URLs--and can be directly queried, assuming
> you have done everything else correctly.
> I have also added support through the NutchServer REST-API. Have Attached
> patch along with this issue.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)