[ http://issues.apache.org/jira/browse/NUTCH-30?page=comments#action_62140 ] Kevin Burton commented on NUTCH-30: -----------------------------------
I'd recommend against NOT using an XSLT approach to your RSS issue. Use the FeedParser. There are a score of issues that an XSLT approach won't fix. WRT external XML libraries we're using Jaxen and JDOM right now. FeedParser 2.0 won't have any external libraries and will only use SAX internally. I'd just bite the bullet and take the dependencies because IMO you won't find another parser that comes anywhere close to solving all the issues that the FeedParser will. Rome comes close though but then you'd have the same amount of dependencies. Kevin > rss feed parser > --------------- > > Key: NUTCH-30 > URL: http://issues.apache.org/jira/browse/NUTCH-30 > Project: Nutch > Type: Improvement > Components: fetcher > Reporter: Stefan Grroschupf > Priority: Minor > Attachments: RSSParserPatch.txt, RSS_Parser.zip, parse-rss-patch.txt, > parse-rss.zip > > A simple rss feed parser supporting: > rss and atom: > + version 0.3 > + version 09 > + version 10 > + version 20 > Converting of different rss versions is done via xslt. > The xslt was contributed by Frank Henze - Thanks! -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - If you want more information on JIRA, or have a bug to report see: http://www.atlassian.com/software/jira
