[ 
https://issues.apache.org/jira/browse/CONNECTORS-171?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fuad Efendi updated CONNECTORS-171:
-----------------------------------

    Status: Patch Available  (was: Open)

See also org.apache.solr.common.util.XML class from SOLR which uses arrays (so 
it's faster) and which doesn't escape chars below ' '

> RSS Connector dechromed mode can corrupt data prior to indexing
> ---------------------------------------------------------------
>
>                 Key: CONNECTORS-171
>                 URL: https://issues.apache.org/jira/browse/CONNECTORS-171
>             Project: ManifoldCF
>          Issue Type: Bug
>          Components: Framework core, RSS connector
>    Affects Versions: ManifoldCF 0.2
>            Reporter: Karl Wright
>
> When the contents of the <description> or <content> fields of the feed 
> contains entity references, the content may be turned into invalid XML before 
> being handed to the index. The cause of this is decoding of the entity 
> references by the XML parser that parses the feed, while the output of text 
> in-between tags is not re-encoded.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to