SWF parser
----------
Key: NUTCH-198
URL: http://issues.apache.org/jira/browse/NUTCH-198
Project: Nutch
Type: New Feature
Components: fetcher
Versions: 0.8-dev
Reporter: Andrzej Bialecki
Assigned to: Andrzej Bialecki
Attachments: parse-swf.zip
This is a parser for the Flash SWF files. It uses JavaSWF2 library (BSD
license), and uses some heuristic to extract as much text as possible
(including potential links) from ActionScript sections.
If there are no objections, I'd like to add it soon.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira