On 21/02/12 16:15, Jens Grivolla wrote:
> On 02/21/2012 04:08 PM, Thilo Goetz wrote:
>> On 21/02/12 15:59, Jens Grivolla wrote:
>>> it appears that InlineXMLCasConsumer depends on the system locale for
>>> some internal transformations. The output appears to be written in UTF8
>>> (outStream.write(xmlAnnotations.getBytes("UTF-8"));) but when used on a
>>> machine with a locale of ASCII all accented characters get broken.
>>>
>>> I suspect that it has to do with the XMLSerializer working on a
>>> ByteArrayOutputStream, but haven't been able to track it down yet.
>>
>> Have you checked that it's really the writing end where things
>> get corrupted, and not the reading end? Just a thought...
>
> Yes, we have an XmiWriterCasConsumer in parallel that works fine.
>
> Jens
>
Ah yes, eyeballing the source gives:
// return XML string
return new String(byteArrayOutputStream.toByteArray());
This is in CasToInlineXml.java. I stopped after I found this,
maybe there's more. Jira, patch, you know the drill :-)
--Thilo