Ninaad Joshi created NUTCH-2479:
-----------------------------------
Summary: urlmeta plugin port from 1.x to 2.x
Key: NUTCH-2479
URL: https://issues.apache.org/jira/browse/NUTCH-2479
Project: Nutch
Issue Type: New Feature
Components: nutch server, plugin, REST_api
Affects Versions: 2.3.1
Reporter: Ninaad Joshi
Priority: Minor
Attachments: Ninaad.Joshi.plugin.urlmeta.patch
I have ported urlmeta plugin available in 1.x to 2.x
It is designed to do two things:
* Meta Tags that are supplied with your Crawl URLs, during injection either
through seed.txt or through REST API, will be propagated throughout the
out-links of those Crawl URLs
* When you index your URLs, the meta tags that you specified with your URLs
will be indexed alongside those URLs--and can be directly queried, assuming you
have done everything else correctly.
I have also added support through the NutchServer REST-API. Have Attached patch
along with this issue.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)