Jorge Luis Betancourt Gonzalez created NUTCH-2443:
-----------------------------------------------------

             Summary: Extract links from the video tag with the parse-html 
plugin
                 Key: NUTCH-2443
                 URL: https://issues.apache.org/jira/browse/NUTCH-2443
             Project: Nutch
          Issue Type: Improvement
          Components: parser, plugin
    Affects Versions: 1.13
            Reporter: Jorge Luis Betancourt Gonzalez
            Assignee: Jorge Luis Betancourt Gonzalez
            Priority: Minor
             Fix For: 1.14


At the moment the {{parse-html}} extracts links from the tags {{a, area, form}} 
(configurable){{, frame, iframe, script, link, img}}. Since we allow extracting 
links to binary files (images) extracting links also from the {{video}} tag 
should be supported.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to