Hi Jérôme,

I think, the ppt-parser is ready to go. Now!

But the xls-plugin won't work without further modifications. It's a "pre-Andrzej"-version which still uses the ParseException. I'm not sure, if my "hacks" to get them running are ok. I still see many errors (null-pointer-exceptions while crawling). Ok - I see... I should update NUTCH-52 :-)

Regards

        Michael



The ppt-parser from Stephan Strittmatter (NUTCH-21) seems to work ok - I
would suggest to add him to the regular plugins (Important: if you
download the plugin from jira - be careful to take the current version
(by now the second attachment from 2.Aug.2005!))

The xls-plugin from Rohit Kulkarni (NUTCH-52) needs still some work. The
latest changes concerning the ParseStatus are not integrated, so it
won't run under nutch-0.7. There are also some null-pointer-problems.

The zip-Plugin from Rohit Kulkarni (NUTCH-53) seems to work for me, but
I gave him only a few tests.


Thanks Michael for this status about these plugins.
Since the best way to widely test and improve these plugins is to widely using them,
I thing it's time to commit them.
If there is no objections in the next days, I will commit them next week.
However, my first idea was to commit these patches in the trunk in order to avoid introducing some new bugs in the future 0.7.1 release. Committers (especially Piotr, our release expert) and developpers, what do you think about this point? (trunk for 0.8, or 0.7 branch for 0.7.1)

Regards

Jérôme



--
Michael Nebel
http://www.nebel.de/
http://www.netluchs.de/



-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to