--- Jérôme Charron <[EMAIL PROTECTED]> a écrit :

> Yes, you are rigth, but my response was a short time solution.
> 1. A quick solution could be to checsk that a plugin can be
> associated to 
> many content-types (if so, there's just to add application/powerpoint
> in the 
> mspowerpoint plugin xml).


I remember having played with that a wee bit, but the problem was that
the plugins themselves are riddled with pieces of code like the one
below, found in MSWordParser in release 0.7:

 if (contentType != null &&
!contentType.startsWith("application/msword"))
      return new ParseStatus(ParseStatus.FAILED,
ParseStatus.FAILED_INVALID_FORMAT,
        "Content-Type not application/msword: " +
contentType).getEmptyParse();

which means that, whatever you do, you're screwed - Excuse my French (:
- since the MIME type is hard-coded in the plugin.  It also means that
if you want to add a MIME type (say application/vnd.ms-word), you have
to edit the code.


> 2. Remember that powerpoint plugin is not part of the Nutch-0.7
> release... 

Now, you'll have to find a better one than that, Jerome! :)


Slán agat,
Sebastien


        

        
                
___________________________________________________________________________ 
Appel audio GRATUIT partout dans le monde avec le nouveau Yahoo! Messenger 
Téléchargez cette version sur http://fr.messenger.yahoo.com


-------------------------------------------------------
SF.Net email is Sponsored by the Better Software Conference & EXPO
September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices
Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA
Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to