[ http://issues.apache.org/jira/browse/NUTCH-52?page=all ]
Rohit Kulkarni updated NUTCH-52:
--------------------------------
Attachment: parse-msexcel.zip
The plugin is tested with the latest nutch SVN and seems to work
fine. Currently only STRING and NUMERIC Excel cell data types are being
considered.
Please try it out and let me know if anyone has any suggestions.
Plugin is attached as a zip file
thanks,
Rohit
> Parser plugin for MS Excel files
> --------------------------------
>
> Key: NUTCH-52
> URL: http://issues.apache.org/jira/browse/NUTCH-52
> Project: Nutch
> Type: Improvement
> Components: fetcher
> Reporter: Rohit Kulkarni
> Priority: Trivial
> Attachments: parse-msexcel.zip
>
> Nutch plugin to parse MSExcel files (using jakarta poi) and based on the
> MSPowerPointParser plugin by Stephan Strittmatter.
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
For more information on JIRA, see:
http://www.atlassian.com/software/jira