[jira] [Updated] (XERCESC-2120) DOM Serialization does not correctly validate Surrogate Pairs

2022-10-06 Thread Scott Cantor (Jira)


 [ 
https://issues.apache.org/jira/browse/XERCESC-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Scott Cantor updated XERCESC-2120:
--
Affects Version/s: 3.2.3
   3.2.2
   3.2.1
   3.2.4

> DOM Serialization does not correctly validate Surrogate Pairs
> -
>
> Key: XERCESC-2120
> URL: https://issues.apache.org/jira/browse/XERCESC-2120
> Project: Xerces-C++
>  Issue Type: Bug
>  Components: DOM
>Affects Versions: 3.2.0, 3.2.1, 3.2.2, 3.2.3, 3.2.4
>Reporter: Andrew Blackton
>Priority: Major
> Attachments: DOMCharacterValidationTest.cpp, DomStringValidation.patch
>
>
> When attempting to write an xml document containing valid UTF-16 surrogate 
> pairs an error occurs during validation. This causes the write to fail.
> It appears as though this issue was introduced with 
> https://issues.apache.org/jira/browse/XERCESC-1854 in the following commit 
> http://svn.apache.org/viewvc/xerces/c/trunk/src/xercesc/dom/impl/DOMLSSerializerImpl.cpp?r1=768978=1226891.
> I have supplied a reproducible and a potential patch. The string validator 
> should be responsible for determining if the codepoint is part of a surrogate 
> pair. However, I may also like to make the argument that this may not be the 
> right location to be doing the string validation. As it will leave the output 
> document in an inconsistent (half-written) state.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

-
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org



[jira] [Updated] (XERCESC-2120) DOM Serialization does not correctly validate Surrogate Pairs

2017-09-15 Thread Andrew Blackton (JIRA)

 [ 
https://issues.apache.org/jira/browse/XERCESC-2120?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Blackton updated XERCESC-2120:
-
Description: 
When attempting to write an xml document containing valid UTF-16 surrogate 
pairs an error occurs during validation. This causes the write to fail.

It appears as though this issue was introduced with 
https://issues.apache.org/jira/browse/XERCESC-1854 in the following commit 
http://svn.apache.org/viewvc/xerces/c/trunk/src/xercesc/dom/impl/DOMLSSerializerImpl.cpp?r1=768978=1226891.

I have supplied a reproducible and a potential patch. The string validator 
should be responsible for determining if the codepoint is part of a surrogate 
pair. However, I may also like to make the argument that this may not be the 
right location to be doing the string validation. As it will leave the output 
document in an inconsistent (half-written) state.

  was:
When attempting to write an xml document containing valid UTF-16 surrogate 
pairs an error occurs during validation. This causes the write to fail.

It appears as though this issue was introduced with 
https://issues.apache.org/jira/browse/XERCESC-1854 in the following commit 
http://svn.apache.org/viewvc/xerces/c/trunk/src/xercesc/dom/impl/DOMLSSerializerImpl.cpp?r1=768978=1226891.

I have supplied a reproducible and a potential patch. The string validator 
should be responsible for determining if the codepoint is part of a surrogate 
pair.


> DOM Serialization does not correctly validate Surrogate Pairs
> -
>
> Key: XERCESC-2120
> URL: https://issues.apache.org/jira/browse/XERCESC-2120
> Project: Xerces-C++
>  Issue Type: Bug
>  Components: DOM
>Affects Versions: 3.2.0
>Reporter: Andrew Blackton
> Attachments: DOMCharacterValidationTest.cpp, DomStringValidation.patch
>
>
> When attempting to write an xml document containing valid UTF-16 surrogate 
> pairs an error occurs during validation. This causes the write to fail.
> It appears as though this issue was introduced with 
> https://issues.apache.org/jira/browse/XERCESC-1854 in the following commit 
> http://svn.apache.org/viewvc/xerces/c/trunk/src/xercesc/dom/impl/DOMLSSerializerImpl.cpp?r1=768978=1226891.
> I have supplied a reproducible and a potential patch. The string validator 
> should be responsible for determining if the codepoint is part of a surrogate 
> pair. However, I may also like to make the argument that this may not be the 
> right location to be doing the string validation. As it will leave the output 
> document in an inconsistent (half-written) state.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org