[ https://issues.apache.org/jira/browse/NUTCH-1053?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Tejas Patil updated NUTCH-1053: ------------------------------- Attachment: NUTCH-1053.trunk.patch A tiny change in ivy file for feeds plugin fixes the problem. Attached a patch for trunk. {noformat}$ wget http://feeds.bbci.co.uk/news/scotland/rss.xml $ bin/nutch plugin feed org.apache.nutch.parse.feed.FeedParser rss.xml key: http://www.bbc.co.uk/sport/0/football/22477429 data: Version: 5 Status: success(1,0) Title: The man who floored Alex Ferguson Outlinks: 0 Content Metadata: Parse Metadata: CharEncodingForConversion=utf-8 OriginalCharEncoding=utf-8 feed=http://www.bbc.co.uk/news/scotland/ published=1368226806000 text: How Sir Alex's temper helped build his legend - and success ................ ................ {noformat} > Parsing of RSS feeds fails > --------------------------- > > Key: NUTCH-1053 > URL: https://issues.apache.org/jira/browse/NUTCH-1053 > Project: Nutch > Issue Type: Bug > Components: parser > Affects Versions: 1.4 > Reporter: Julien Nioche > Assignee: Julien Nioche > Fix For: 1.7 > > Attachments: nutch-1053.patch, NUTCH-1053.trunk.patch, seed.txt > > > See discussion on > http://lucene.472066.n3.nabble.com/RSS-feed-parsing-on-Nutch-1-3-td3166487.html > Have been able to reproduce the problem and will look into it -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira