1) Add the following to conf/parse-plugins.xml
<mimeType name="audio/mpeg">
<plugin id="parse-mp3" />
</mimeType>
2) Make sure the following is in conf/parse-plugins.xml.
<alias name="parse-mp3"
extension-id="org.apache.nutch.parse.mp3.MP3Parser"
/>
3) plugins/parse-mp3/plugin.xml should contain...
<plugin
id="parse-mp3"
name="MP3 Parse Plug-in"
version="1.0.0"
provider-name="nutch.org">
<runtime>
<library name="parse-mp3.jar">
<export name="*"/>
</library>
<library name="jid3lib-0.5.4.jar"/>
</runtime>
<requires>
<import plugin="nutch-extensionpoints"/>
</requires>
<extension id="org.apache.nutch.parse.mp3"
name="MP3Parse"
point="org.apache.nutch.parse.Parser">
<implementation id="org.apache.nutch.parse.mp3.MP3Parser"
class="org.apache.nutch.parse.mp3.MP3Parser">
<parameter name="contentType" value="audio/mpeg"/>
<parameter name="pathSuffix" value=""/>
</implementation>
</extension>
</plugin>
4) Make sure jid3lib-0.5.4.jar and parse-mp3.jar is in directory
plugins/parse-mp3.
-----Original Message-----
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
Sent: Wednesday, December 12, 2007 3:25 PM
To: [email protected]
Subject: Re: problem with mp3 parser
It did not help. Also I checked the search.dir value does not change in
C:\Tomcat\webapps\ROOT\WEB-INF\classes\nutch-default.xml although I changed
it in nutch/conf/nutch-deafult.xml. Should the size of nutch*.war file to
change depending on how many sites are fetched. Also if I out all nutch
command in a file and execute it, nutch gives errors like some directory is
not found, although the dir is there.
Thanks for any ideas.
Alex.
-----Original Message-----
From: Hasan Diwan <[EMAIL PROTECTED]>
To: [email protected]
Sent: Wed, 12 Dec 2007 9:34 am
Subject: Re: problem with mp3 parser
On 12/12/2007, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote:
> I have this file file:///C:/nutch/plugins/parse-mp3/jid3lib-0.5.4.jar
Try putting it in file:///C:/nutch/plugins/parse-mp3/lib/jid3lib-0.5.4.jar
--
Cheers,
Hasan Diwan <[EMAIL PROTECTED]>
________________________________________________________________________
More new features than ever. Check out the new AIM(R) Mail ! -
http://webmail.aim.com