[jira] [Commented] (XERCESC-2071) Unable to transcode an UTF-8 multibyte string

2017-12-11 Thread Scott Cantor (JIRA)

[ 
https://issues.apache.org/jira/browse/XERCESC-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16286007#comment-16286007
 ] 

Scott Cantor commented on XERCESC-2071:
---

Of course, what you're doing doesn't make sense. That's for transcoding into 
local code page and those characters can't be represented.

> Unable to transcode an UTF-8 multibyte string
> -
>
> Key: XERCESC-2071
> URL: https://issues.apache.org/jira/browse/XERCESC-2071
> Project: Xerces-C++
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 3.1.3
> Environment: Linux 32 bit, Armv6. Release version of the library
>Reporter: SandBeavers
>
> Xerces-C++ is unable to transcode a multiybyte UTF-8 string. For example 
> ❀★☀Happy❀★☀ throws an exception.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org



[jira] [Commented] (XERCESC-2071) Unable to transcode an UTF-8 multibyte string

2017-12-11 Thread SandBeavers (JIRA)

[ 
https://issues.apache.org/jira/browse/XERCESC-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16286003#comment-16286003
 ] 

SandBeavers commented on XERCESC-2071:
--

Every unicode character above 0x00FF will thrown an exception.
https://unicode-table.com/en/ 

{code:java}
XMLCh Text = 0x03A6;
char* cs = xercesc::XMLString::transcode();
{code}




> Unable to transcode an UTF-8 multibyte string
> -
>
> Key: XERCESC-2071
> URL: https://issues.apache.org/jira/browse/XERCESC-2071
> Project: Xerces-C++
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 3.1.3
> Environment: Linux 32 bit, Armv6. Release version of the library
>Reporter: SandBeavers
>
> Xerces-C++ is unable to transcode a multiybyte UTF-8 string. For example 
> ❀★☀Happy❀★☀ throws an exception.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org



[jira] [Commented] (XERCESC-2071) Unable to transcode an UTF-8 multibyte string

2017-07-12 Thread Scott Cantor (JIRA)

[ 
https://issues.apache.org/jira/browse/XERCESC-2071?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16084007#comment-16084007
 ] 

Scott Cantor commented on XERCESC-2071:
---

Somebody would have to explain what the actual failure is here, and possibly 
attach a real sample that doesn't risk a cut and paste corruption.

Literally pasting that text into an XML file doesn't cause any issue. DOMPrint 
handles it, and a manual XMLString::transcode of the content doesn't fail, 
though I have no idea what the actual non-UTF8 output would even be meant to be.

I doubt I can fix it anyway, and wouldn't risk it, but if somebody understands 
this stuff and can provide a patch, great.

> Unable to transcode an UTF-8 multibyte string
> -
>
> Key: XERCESC-2071
> URL: https://issues.apache.org/jira/browse/XERCESC-2071
> Project: Xerces-C++
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 3.1.3
> Environment: Linux 32 bit, Armv6. Release version of the library
>Reporter: SandBeavers
>
> Xerces-C++ is unable to transcode a multiybyte UTF-8 string. For example 
> ❀★☀Happy❀★☀ throws an exception.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org