Hi all,
here at ADS we have a problem with some metadata that contain CDATA elements.
The problem is caused by the export procedure of Invenio that doesn't
properly encode these elements.

What happens is that all the elements like
'<![CDATA[ foobar ]]>'
are converted to
'&lt;![CDATA[ foobar ]]>'
and this in XML is an error.

After reading a very similar discussion from 2010 (started by Benoit),
I suppose that the problem is still in
invenio.textutils.encode_for_xml()
which is used in
bibformat_utils.record_get_xml().

I honestly don't understand why all the tags inside a subflield are
not escaped (but I suppose there is a good reason) but in case of
CDATA the tag should be completely escaped.

Thanks for your help,

Giovanni




--------------------------------------------------------------
Giovanni Di Milia
IT Specialist at SAO/NASA ADS
Harvard-Smithsonian Center for Astrophysics
60 Garden Street, MS 83
Cambridge, MA 02138 USA
email: [email protected]
--------------------------------------------------------------

Reply via email to