Hi All
I have two problems related to control characters handling in POI.
1. control characters replaced by '?' , ideally I want to show control
characters.
2. String pattern _x00HH_ gets converted to control characters (here H is
0-9 or A-F)
I investigated the problem and find out that following facts that causes
this problem
1. Excel internally uses OOXML and XML does not support control characters.
2. To overcome this XML limitation Excel escapes the control character.
3. If that escape of control character comes in a string then this also has
to be escaped so that it does not gets converted to control character.
Excel does it using following logic
# Excel escapes control characters with _xHHHH_ and also escapes any
# literal strings of that type by encoding the leading underscore. So
# "\0" -> _x0000_ and "_x0000_" -> _x005F_x0000_.
Is it a bug or is it intended? If it's a bug then, are you going to solve
it in future releases?
Regards
Rajneesh