[ http://issues.apache.org/jira/browse/NUTCH-30?page=comments#action_62111 ] Chris A. Mattmann commented on NUTCH-30: ----------------------------------------
Hi Folks, One more comment on the parse-rss plugin that I've just attached: I also included a junit test drawn from John X's junit test on the PDF parser. The rss junit test parses a sample rss file and makes sure it reads the correct amount of outlinks, and the correct outlinks from the RSS file. Thanks, Chris > rss feed parser > --------------- > > Key: NUTCH-30 > URL: http://issues.apache.org/jira/browse/NUTCH-30 > Project: Nutch > Type: Improvement > Components: fetcher > Reporter: Stefan Grroschupf > Priority: Minor > Attachments: RSSParserPatch.txt, RSS_Parser.zip, parse-rss-patch.txt, > parse-rss.zip > > A simple rss feed parser supporting: > rss and atom: > + version 0.3 > + version 09 > + version 10 > + version 20 > Converting of different rss versions is done via xslt. > The xslt was contributed by Frank Henze - Thanks! -- This message is automatically generated by JIRA. - If you think it was sent incorrectly contact one of the administrators: http://issues.apache.org/jira/secure/Administrators.jspa - If you want more information on JIRA, or have a bug to report see: http://www.atlassian.com/software/jira
