I'm relatively new to using Nutch, but I've managed to successfully deploy it and use it so far. I've been asked to add rtf parsing to it, and I'm having problems.

As far as I can tell, I need to get hold of a file called rtf_parser_src.jar, but I can find nowhere to get it from. The source referenced in the build file and the README, http://www.cobase.cs.ucla.edu/pub/javacc/rtf_parser_src.jar, no longer exists, and so I'm left unable to successfully compile.

Can anyone point out where I'm gong wrong, or direct me to where I can download the required file?

Thanks,
Chaz

Reply via email to