Hi Stephan,
I also didn't find anything regarding Powerpoint Parsers, and I'm working
on a project using Lucene and Nutch parsers, I also investigated this.
I actually created an MS Powerpoint this morning, using POI via the code
submitted on this url :
http://www.mail-archive.com/[email protected]/msg04809.html
Unfortunately, my parser plugin interface is slightly different than the
Nutch one so it's not really ready for commit. I've also fixed the MS
Excel plugin (with a fix to POI as well) submitted by Werner Ramaekers.
I could send you these files so you can fix bug 1018611 and submit the
powerpoint parser.
Just ask me if you want, I'll send them to you.

Regards,
Stephan Lagraulet


On Mon, January 10, 2005 13:30, Strittmatter, Stephan said:
> Dear Nutch developers,
>
> during the last weeks I investigated into Nutch and found, that currently
> MS PowerPoint slides are not supported. I am not shure, but I have also
> not found any hint within the mailing lists, that someone already has
> implemented a parser plugin for this document type.
>
> I created such a plugin based on POI which is working fine for latin char
> based text. There are currently some problems with slides containing other
> chars like chinese or cyrillic, but I want to solve this also in the next
> days.
>
> After doing some more tests and improved javadocs I would be glad if I
> could overgive the sources on behalf of Sybit GmbH    to the nutch-project.
>
> Kind regards,
>
> Stephan Strittmatter
> Senior Developer
> -----
> Sybit GmbH, Waldstra�e 28, D-78315 Radolfzell
> Fon: +49 (7732) 9508-00           Fax: -29
> mailto:[EMAIL PROTECTED]
> http://www.sybit.de
> Sybit - a bit better
>




-------------------------------------------------------
The SF.Net email is sponsored by: Beat the post-holiday blues
Get a FREE limited edition SourceForge.net t-shirt from ThinkGeek.
It's fun and FREE -- well, almost....http://www.thinkgeek.com/sfshirt
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to