I'm relatively new to using Nutch, but I've managed to successfully
deploy it and use it so far. I've been asked to add rtf parsing to it,
and I'm having problems.
As far as I can tell, I need to get hold of a file called
rtf_parser_src.jar, but I can find nowhere to get it from. The source
referenced in the build file and the README,
http://www.cobase.cs.ucla.edu/pub/javacc/rtf_parser_src.jar, no longer
exists, and so I'm left unable to successfully compile.
Can anyone point out where I'm gong wrong, or direct me to where I can
download the required file?
Thanks,
Chaz
- Problems building the parse-rtf plugin Chaz Hickman
-