[
https://issues.apache.org/jira/browse/CONNECTORS-235?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13079141#comment-13079141
]
Karl Wright commented on CONNECTORS-235:
----------------------------------------
Just to be clear, here's an example of the Solr log line for indexing one of
the documents from the above mentioned feed. You can, of course, configure the
job to map the field names to whatever you like. This is with no mapping
whatsoever.
INFO: [] webapp=/solr path=/update/extract
params={literal.source=http://www.onemansjazz.ca/component/option,com_rss/feed,RSS2.0/no_html,1/&literal.category=Radio+-+Play+lists&literal.summary=I+had+a+lot+of+fun+putting+this+show+together+this+week.+Hope+you+enjoy+it,+too.&literal.id=http://www.onemansjazz.ca/content/view/332/30/&literal.title=July+23,+2011+Playlist&literal.pubdate=1311339967000}
status=0 QTime=13
I'm pretty certain you must have a metadata value set for "description" in your
job, because there is absolutely no mechanism (and never was one) for picking
up the channel description from the feed. So you will have to remove that in
order to get all this to work for you.
> item description element not indexed
> ------------------------------------
>
> Key: CONNECTORS-235
> URL: https://issues.apache.org/jira/browse/CONNECTORS-235
> Project: ManifoldCF
> Issue Type: Improvement
> Components: RSS connector
> Affects Versions: ManifoldCF 0.2
> Reporter: Kate McGonigal
> Assignee: Karl Wright
> Fix For: ManifoldCF 0.3
>
>
> The RSS feed's *item* description is not written to any field in the Solr
> index.
> I have a typical RSS feed with the general structure:
> <rss>
> <channel>
> <title></title>
> <link></link>
> <description></description>
> <item>
> <title></title>
> <link></link>
> <pubDate></pubDate>
> <description> *** the description I do want *** </description>
> <author></author>
> <category></category>
> </item>
> </channel>
> </rss>
> Example:
> For the RSS feed:
> http://www.onemansjazz.ca/component/option,com_rss/feed,RSS2.0/no_html,1/
> the rss/channel/item/description field is not indexed into Solr.
> Example notes:
> - what does get written to the Solr "description" field is the description
> metadata from the website, i.e. "Jazz radio show from Winnipeg on CKUW 95.9
> FM, hosted by Maurice Hogue." in this case.
> - on the "Dechromed Content" tab of the job, "No dechromed content" is
> selected. I'm not sure if that is relevant.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira