[ 
https://issues.apache.org/jira/browse/XERCESC-2258?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17901441#comment-17901441
 ] 

Boris Kolpackov commented on XERCESC-2258:
------------------------------------------

Uh, I just noticed you are talking about emojis in {_}file name{_}, not file 
content. I suspect the file opening API Xerces-C++ uses does not handle Unicode 
or Xerces-C++ doesn't pass the file name in correct encoding.

You can probably work around this bug by handling the file opening yourself and 
passing to Xerces-C++ the stream object to read from.

> XercesDOMParser fails to parse file with with emojis and other symbols on mac
> -----------------------------------------------------------------------------
>
>                 Key: XERCESC-2258
>                 URL: https://issues.apache.org/jira/browse/XERCESC-2258
>             Project: Xerces-C++
>          Issue Type: Bug
>         Environment: macOS 14.4
>            Reporter: Uma Kant
>            Priority: Critical
>
> h2. File with valid Unicode characters in filename (βœ…πŸ’–πŸ˜†πŸ€’πŸŽ‡πŸŒ¬πŸ‘₯πŸ§’πŸΌπŸ‡ΊπŸ‡Έ) fails to load
> on windows it is working fine, the file open successfully but on mac it fails 
> with error.
> Xerces version 3.2.5
> macOS version 14.4



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: c-dev-unsubscr...@xerces.apache.org
For additional commands, e-mail: c-dev-h...@xerces.apache.org

Reply via email to