[ http://issues.apache.org/jira/browse/NUTCH-30?page=history ]
Chris A. Mattmann updated NUTCH-30:
-----------------------------------
Attachment: parse-rss-srcbin-incl-path.zip
Hi John,
Here ya go. The zip file includes:
1. up-to-date zipped up src of the plugin, incl. required binary jars in
lib directory, tested against the latest SVN of nutch
2. text output of running the unit tests to test the plugin
3. patch file against the latest SVN source.
You should be good to go with this. Let me know if there are any troubles.
BTW, I changed the content type in the plugin.xml to be "application/rss+xml"
as oppossed to "text/xml", as it was before. I'm sure we'll need to think more
about what the most appropriate seting for this is, but for now, it should be
fine (as it can always be tailored to the user's env by changing the attribute).
Take care and thanks!
Cheers,
Chris
> rss feed parser
> ---------------
>
> Key: NUTCH-30
> URL: http://issues.apache.org/jira/browse/NUTCH-30
> Project: Nutch
> Type: Improvement
> Components: fetcher
> Reporter: Stefan Grroschupf
> Priority: Minor
> Attachments: RSSParserPatch.txt, RSS_Parser.zip, parse-rss-1.0-040605.zip,
> parse-rss-patch.txt, parse-rss-srcbin-incl-path.zip, parse-rss.zip
>
> A simple rss feed parser supporting:
> rss and atom:
> + version 0.3
> + version 09
> + version 10
> + version 20
> Converting of different rss versions is done via xslt.
> The xslt was contributed by Frank Henze - Thanks!
--
This message is automatically generated by JIRA.
-
If you think it was sent incorrectly contact one of the administrators:
http://issues.apache.org/jira/secure/Administrators.jspa
-
If you want more information on JIRA, or have a bug to report see:
http://www.atlassian.com/software/jira