Hi,
The ppt-parser from Stephan Strittmatter (NUTCH-21) seems to work ok - I
would suggest to add him to the regular plugins (Important: if you
download the plugin from jira - be careful to take the current version
(by now the second attachment from 2.Aug.2005!))
The xls-plugin from Rohit Kulkarni (NUTCH-52) needs still some work. The
latest changes concerning the ParseStatus are not integrated, so it
won't run under nutch-0.7. There are also some null-pointer-problems.
The zip-Plugin from Rohit Kulkarni (NUTCH-53) seems to work for me, but
I gave him only a few tests.
Regards
Michael
Jérôme Charron wrote:
Any parser plugins available for parsing xsl,ppt and
zip extension files.
Some patches are available for xsl, ppt and zip plugins:
(JIRA is actually down, so that I can't give you URLs to the related issues
and patches).
If people are intersted in this patches to be commited in trunk, please vote
for them.
http://issues.apache.org/jira/browse/Nutch
Regards
Jérôme
--
Michael Nebel
http://www.nebel.de/
http://www.netluchs.de/
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general