[ 
https://issues.apache.org/jira/browse/XERCESC-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Scott Cantor updated XERCESC-2120:
----------------------------------
    Affects Version/s: 3.2.3
                       3.2.2
                       3.2.1
                       3.2.4

> DOM Serialization does not correctly validate Surrogate Pairs
> -------------------------------------------------------------
>
>                 Key: XERCESC-2120
>                 URL: https://issues.apache.org/jira/browse/XERCESC-2120
>             Project: Xerces-C++
>          Issue Type: Bug
>          Components: DOM
>    Affects Versions: 3.2.0, 3.2.1, 3.2.2, 3.2.3, 3.2.4
>            Reporter: Andrew Blackton
>            Priority: Major
>         Attachments: DOMCharacterValidationTest.cpp, DomStringValidation.patch
>
>
> When attempting to write an xml document containing valid UTF-16 surrogate 
> pairs an error occurs during validation. This causes the write to fail.
> It appears as though this issue was introduced with 
> https://issues.apache.org/jira/browse/XERCESC-1854 in the following commit 
> http://svn.apache.org/viewvc/xerces/c/trunk/src/xercesc/dom/impl/DOMLSSerializerImpl.cpp?r1=768978&r2=1226891.
> I have supplied a reproducible and a potential patch. The string validator 
> should be responsible for determining if the codepoint is part of a surrogate 
> pair. However, I may also like to make the argument that this may not be the 
> right location to be doing the string validation. As it will leave the output 
> document in an inconsistent (half-written) state.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org

Reply via email to