[
https://issues.apache.org/jira/browse/CONNECTORS-171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Fuad Efendi updated CONNECTORS-171:
-----------------------------------
Status: Patch Available (was: Open)
See also org.apache.solr.common.util.XML class from SOLR which uses arrays (so
it's faster) and which doesn't escape chars below ' '
> RSS Connector dechromed mode can corrupt data prior to indexing
> ---------------------------------------------------------------
>
> Key: CONNECTORS-171
> URL: https://issues.apache.org/jira/browse/CONNECTORS-171
> Project: ManifoldCF
> Issue Type: Bug
> Components: Framework core, RSS connector
> Affects Versions: ManifoldCF 0.2
> Reporter: Karl Wright
>
> When the contents of the <description> or <content> fields of the feed
> contains entity references, the content may be turned into invalid XML before
> being handed to the index. The cause of this is decoding of the entity
> references by the XML parser that parses the feed, while the output of text
> in-between tags is not re-encoded.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira