Shi Wang wrote:

Hi! Hickman,
You can download it here:
_http://nutch.cvs.sourceforge.net/nutch/nutch/src/plugin/parse-rtf/lib/_
Actually, You will have this problem if you use the version 0.9, and, the other plugin you may miss is the mp3 parser, you can download it here:
_http://nutch.cvs.sourceforge.net/nutch/nutch/src/plugin/parse-mp3/lib/_
For more information, you can see the Nutch wiki:
http://wiki.apache.org/nutch/RunNutchInEclipse0.9
2008/1/15, Chaz Hickman <[EMAIL PROTECTED] <mailto:[EMAIL PROTECTED]>>:

    I'm relatively new to using Nutch, but I've managed to successfully
    deploy it and use it so far. I've been asked to add rtf parsing to it,
    and I'm having problems.

    As far as I can tell, I need to get hold of a file called
    rtf_parser_src.jar, but I can find nowhere to get it from. The source
    referenced in the build file and the README,
    http://www.cobase.cs.ucla.edu/pub/javacc/rtf_parser_src.jar, no longer
    exists, and so I'm left unable to successfully compile.

    Can anyone point out where I'm gong wrong, or direct me to where I can
    download the required file?

Shawn,

Thanks for the pointer. I'd looked through the wiki, but missed the link on that page. I've downloaded the file and managed to get the rtf plugin built, although it wasn't 100% straightforward as the build.xml for it insists on trying to download the parser source from that old stale url I mentioned. Putting a dummy file in the tmp directory fixed that and allowed the plugin to build.

Reply via email to