Ninaad Joshi created NUTCH-2479:
-----------------------------------

             Summary: urlmeta plugin port from 1.x to 2.x
                 Key: NUTCH-2479
                 URL: https://issues.apache.org/jira/browse/NUTCH-2479
             Project: Nutch
          Issue Type: New Feature
          Components: nutch server, plugin, REST_api
    Affects Versions: 2.3.1
            Reporter: Ninaad Joshi
            Priority: Minor
         Attachments: Ninaad.Joshi.plugin.urlmeta.patch

I have ported urlmeta plugin available in 1.x to 2.x

It is designed to do two things:
* Meta Tags that are supplied with your Crawl URLs, during injection either 
through seed.txt or through REST API, will be propagated throughout the 
out-links of those Crawl URLs
* When you index your URLs, the meta tags that you specified with your URLs 
will be indexed alongside those URLs--and can be directly queried, assuming you 
have done everything else correctly. 

I have also added support through the NutchServer REST-API. Have Attached patch 
along with this issue.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to