Hi Andrzej, At the time that I was working diligently on this plugin (April/May), I had done some thorough research into finding what I felt would be the most flexible, reliable way to parse RSS files. The RSS feed parser out of the jakarta-commmons sandbox was what I found, and I stand by it. I understand your concerns however about its reliance on several libraries, but it just comes with the territory in this case. However, as noted in: http://issues.apache.org/jira/browse/NUTCH-30 by Kevin Burton, when feedparser 2.0 comes out, the reliance on the external libraries will be removed, so I think that by adopting the feedparser based plugin right now, we have a clear upgrade path that leads us to the plugin's independence of external libraries, without changing (much of) the underlying source code.
That's my two cents. Thanks! Cheers, Chris Mattmann On 7/20/05 11:58 PM, "Andrzej Bialecki" <[EMAIL PROTECTED]> wrote: > [EMAIL PROTECTED] wrote: >> Hi, >> >> Does anyone know why Chris Mattmann's RSS plugin ( >> http://issues.apache.org/jira/browse/NUTCH-30 ) wasn't put in the >> repository, and whether there are plans to revive it and include it? > > That's probably my fault. I was almost ready to import it, but then > during the final review I hesitated - I'm wary of pulling in so many > dependencies. Then other things got in the way, and I sort of dropped it > for the moment... > > If there's no way to parse RSS reliably other than using these dozens of > libraries, so be it. Is this the case? ______________________________________________ Chris A. Mattmann [EMAIL PROTECTED] Staff Member Modeling and Data Management Systems Section (387) Data Management Systems and Technologies Group _________________________________________________ Jet Propulsion Laboratory Pasadena, CA Office: 171-266B Mailstop: 171-246 _______________________________________________________ Disclaimer: The opinions presented within are my own and do not reflect those of either NASA, JPL, or the California Institute of Technology. ------------------------------------------------------- SF.Net email is sponsored by: Discover Easy Linux Migration Strategies from IBM. Find simple to follow Roadmaps, straightforward articles, informative Webcasts and more! Get everything you need to get up to speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
