I think this is because of ICU, please check the ICU version you are using.
Thanks N Regards, Sachin Mittal ---------------------------------------------------------------------------- --------------------------------------------------------- This e-mail and its attachments contain confidential information from HUAWEI, which is intended only for the person or entity whose address is listed above. Any use of the information contained herein in any way (including, but not limited to, total or partial disclosure, reproduction, or dissemination) by persons other than the intended recipient(s) is prohibited. If you receive this e-mail in error, please notify the sender by phone or email immediately and delete it! -----Original Message----- From: Vadapalli, Deepthi [mailto:deepthi.vadapa...@ca.com] Sent: Friday, January 03, 2014 6:41 PM To: c-users@xerces.apache.org Subject: XercesDOMParser's parse function throws UTFDataFormatException with czech republic characters Hi, With Czech republic characters, XercesDOMParser's parse function is throwing an exception UTFDataFormatException. My test program code snippet is: ... xercesDOMParser* parser = new XercesDOMParser(); xercesc::MemBufInputSource* inputSource = NULL; std::string mystr = "<test>hello íčty</test>"; try { inputSource = new xercesc::MemBufInputSource((const XMLByte*) mystr.c_str(),(unsigned int)mystr.length(), "inputsource", false); parser->parse(*inputSource); } catch (...) { cout << "Unexpected Exception \n" ; return -1; } ... I am using xerces 3.1 version. Can you please suggest what could be wrong with this program? Thank you Deepthi