Hi,

The ppt-parser from Stephan Strittmatter (NUTCH-21) seems to work ok - I would suggest to add him to the regular plugins (Important: if you download the plugin from jira - be careful to take the current version (by now the second attachment from 2.Aug.2005!))

The xls-plugin from Rohit Kulkarni (NUTCH-52) needs still some work. The latest changes concerning the ParseStatus are not integrated, so it won't run under nutch-0.7. There are also some null-pointer-problems.

The zip-Plugin from Rohit Kulkarni (NUTCH-53) seems to work for me, but I gave him only a few tests.

Regards

        Michael



Jérôme Charron wrote:

Any parser plugins available for parsing xsl,ppt and
zip extension files.


Some patches are available for xsl, ppt and zip plugins:
(JIRA is actually down, so that I can't give you URLs to the related issues and patches).

If people are intersted in this patches to be commited in trunk, please vote for them.
http://issues.apache.org/jira/browse/Nutch

Regards

Jérôme




--
Michael Nebel
http://www.nebel.de/
http://www.netluchs.de/



-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to