item description element not indexed
------------------------------------
Key: CONNECTORS-235
URL: https://issues.apache.org/jira/browse/CONNECTORS-235
Project: ManifoldCF
Issue Type: Improvement
Components: RSS connector
Affects Versions: ManifoldCF 0.2
Reporter: Kate McGonigal
The RSS feed's *item* description is not written to any field in the Solr
index.
I have a typical RSS feed with the general structure:
<rss>
<channel>
<title></title>
<link></link>
<description></description>
<item>
<title></title>
<link></link>
<pubDate></pubDate>
<description> *** the description I do want *** </description>
<author></author>
<category></category>
</item>
</channel>
</rss>
Example:
For the RSS feed:
http://www.onemansjazz.ca/component/option,com_rss/feed,RSS2.0/no_html,1/
the rss/channel/item/description field is not indexed into Solr.
Example notes:
- what does get written to the Solr "description" field is the description
metadata from the website, i.e. "Jazz radio show from Winnipeg on CKUW 95.9 FM,
hosted by Maurice Hogue." in this case.
- on the "Dechromed Content" tab of the job, "No dechromed content" is
selected. I'm not sure if that is relevant.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira