Bugs item #1109321, was opened at 2005-01-25 20:39 Message generated for change (Comment added) made by stritti You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=1109321&group_id=59548
Category: plugin: other Group: None Status: Open Resolution: None Priority: 5 Submitted By: Stephan Strittmatter (stritti) Assigned to: Nobody/Anonymous (nobody) Summary: [patch] parser plugin for MS PowerPoint slides Initial Comment: I attached my latest version of the PowerPoint-Parser. It supports also UniCode content. I tested it succesful with a Chinese slide and several other different slides which I googled. Additional to the attached files Jakarta-POI is required: - poi-2.5.1-final-20040804.jar - poi-scratchpad-2.5.1-final-20040804.jar These files can be found at: http://www.apache.org/dyn/closer.cgi/jakarta/poi/ For the JUnit-Tests, I use additional - junitperf-1.9.1.jar to meshure the performance. One problem is, like at Word-Parser, that the whole stream is needed. So the limits of downloaded stream size has to be changed within the nutch configuration. Probably this feature should be changed to be mime type specific? Kind regards, Stephan Strittmatter ---------------------------------------------------------------------- >Comment By: Stephan Strittmatter (stritti) Date: 2005-02-07 20:15 Message: Logged In: YES user_id=290357 The JDK1.4 compatible version is already uploaded since some days. Are there any coments about this implementation. ---------------------------------------------------------------------- Comment By: Stephan Strittmatter (stritti) Date: 2005-01-26 09:54 Message: Logged In: YES user_id=290357 Sorry, I have overseen in the current version some JDK5 specific parts. I was shure I had removed all. I will check the sources to be JDK1.4 compatible. ---------------------------------------------------------------------- You can respond by visiting: https://sourceforge.net/tracker/?func=detail&atid=491356&aid=1109321&group_id=59548 ------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
