--- Jérôme Charron <[EMAIL PROTECTED]> a écrit :
> Yes, you are rigth, but my response was a short time solution.
> 1. A quick solution could be to checsk that a plugin can be
> associated to
> many content-types (if so, there's just to add application/powerpoint
> in the
> mspowerpoint plugin xml).
I remember having played with that a wee bit, but the problem was that
the plugins themselves are riddled with pieces of code like the one
below, found in MSWordParser in release 0.7:
if (contentType != null &&
!contentType.startsWith("application/msword"))
return new ParseStatus(ParseStatus.FAILED,
ParseStatus.FAILED_INVALID_FORMAT,
"Content-Type not application/msword: " +
contentType).getEmptyParse();
which means that, whatever you do, you're screwed - Excuse my French (:
- since the MIME type is hard-coded in the plugin. It also means that
if you want to add a MIME type (say application/vnd.ms-word), you have
to edit the code.
> 2. Remember that powerpoint plugin is not part of the Nutch-0.7
> release...
Now, you'll have to find a better one than that, Jerome! :)
Slán agat,
Sebastien
___________________________________________________________________________
Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! Messenger
Téléchargez cette version sur http://fr.messenger.yahoo.com
-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general