Joseph Naegele created NUTCH-2248:
-------------------------------------
Summary: CSS parser plugin
Key: NUTCH-2248
URL: https://issues.apache.org/jira/browse/NUTCH-2248
Project: Nutch
Issue Type: New Feature
Components: parser, plugin
Affects Versions: 1.12
Reporter: Joseph Naegele
This plugin allows for collecting {{uri}} links from CSS (stylesheets). This is
useful for collecting parent stylesheets, fonts, and images needed to display
web pages as intended.
Parsed Outlinks do not have associated anchors, and no additional text/content
is parsed from the stylesheet.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)