Hi John, Thanks for your reply. Actually I already have the feedparser working from the command line. I also included a program, test2.java with my original email that shows how I can dynamically load the class and call the feedparser method. So, I actually already have that tool.
Any help on this issue would be greatly appreciated. Thanks, Chris ----- Original Message ----- From: John X <[EMAIL PROTECTED]> Date: Saturday, March 26, 2005 1:07 am Subject: Re: Huge Problem trying to develop plugin for Nutch > Why try it the hard way? You may want to > create a simple tool, just calling feedparser to parse your hi.rss? > Have that work first, then worry about dynamic loading and nutch > plugin system. > Let us know when you have the simple tool. > > John > > On Fri, Mar 25, 2005 at 06:08:50PM -0800, Chris Mattmann wrote: > > Hi Folks, > > > > > > > > My name is Chris Mattmann: I work at the Jet Propulsion > Laboratory in > > Pasadena, CA, U.S.A. I'm new to the list. Nice to meet you all. > > > > > > > > I am having some * major * trouble trying to build an RSS > content parser > > plugin for nutch. My plugin is based on the parse-pdf plugin > structure and > > uses the apache commons-feedparser library out of the Jakarta > sandbox to try > > and parse rss feeds and send them to nutch for indexing. The > probem that I > > am having is * very * strange. Basically after about 2 days of > going around > > the Nutch source code I've tracked my problem down to basically > the fact > > that for whatever reason, the jdom.jar library the commons- > feedparser relies > > on, is not accessible via the Nutch Plugin runtime. I keep > getting the same > > error whenever I run the crawler to crawl Rss pages. I've set up > a dummy web > > page with a single link to an rss file. Here's the webpage: > > > > > > >
