Hi Andrzej,

  At the time that I was working diligently on this plugin (April/May), I
had done some thorough research into finding what I felt would be the most
flexible, reliable way to parse RSS files. The RSS feed parser out of the
jakarta-commmons sandbox was what I found, and I stand by it. I understand
your concerns however about its reliance on several libraries, but it just
comes with the territory in this case. However, as noted in:
http://issues.apache.org/jira/browse/NUTCH-30  by Kevin Burton, when
feedparser 2.0 comes out, the reliance on the external libraries will be
removed, so I think that by adopting the feedparser based plugin right now,
we have a clear upgrade path that leads us to the plugin's independence of
external libraries, without changing (much of) the underlying source code.

That's my two cents.

Thanks!

Cheers,
  Chris Mattmann



On 7/20/05 11:58 PM, "Andrzej Bialecki" <[EMAIL PROTECTED]> wrote:

> [EMAIL PROTECTED] wrote:
>> Hi,
>> 
>> Does anyone know why Chris Mattmann's RSS plugin (
>> http://issues.apache.org/jira/browse/NUTCH-30 ) wasn't put in the
>> repository, and whether there are plans to revive it and include it?
> 
> That's probably my fault. I was almost ready to import it, but then
> during the final review I hesitated - I'm wary of pulling in so many
> dependencies. Then other things got in the way, and I sort of dropped it
> for the moment...
> 
> If there's no way to parse RSS reliably other than using these dozens of
> libraries, so be it. Is this the case?

______________________________________________
Chris A. Mattmann
[EMAIL PROTECTED]
Staff Member
Modeling and Data Management Systems Section (387)
Data Management Systems and Technologies Group
 
_________________________________________________
Jet Propulsion Laboratory            Pasadena, CA
Office: 171-266B                        Mailstop:  171-246
_______________________________________________________
 
Disclaimer:  The opinions presented within are my own and do not reflect
those of either NASA, JPL, or the California Institute of Technology.
 
 





-------------------------------------------------------
SF.Net email is sponsored by: Discover Easy Linux Migration Strategies
from IBM. Find simple to follow Roadmaps, straightforward articles,
informative Webcasts and more! Get everything you need to get up to
speed, fast. http://ads.osdn.com/?ad_id=7477&alloc_id=16492&op=click
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to