[ https://issues.apache.org/jira/browse/NUTCH-583?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Lewis John McGibbney updated NUTCH-583: --------------------------------------- Fix Version/s: 2.2 1.7 > FeedParser empty links for items > -------------------------------- > > Key: NUTCH-583 > URL: https://issues.apache.org/jira/browse/NUTCH-583 > Project: Nutch > Issue Type: Bug > Affects Versions: 1.0.0 > Reporter: Enis Soztutar > Assignee: Enis Soztutar > Fix For: 1.7, 2.2 > > > FeedParser in feed plugin just discards the item if it does not have <link> > element. However Rss 2.0 does not necessitate the <link> element for each > <item>. > Moreover sometimes the link is given in the <guid> element which is a > globally unique identifier for the item. I think we can search the url for an > item first, then if it is still not found, we can use the feed's url, but > with merging all the parse texts into one Parse object. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira