https://bz.apache.org/bugzilla/show_bug.cgi?id=60260

--- Comment #4 from Javen O'Neal <[email protected]> ---
The formula parse tree assumes the formula contains only ASCII. It reads one
**char** from the formula string at a time via GetChar() [1]. For multi-byte
symbols in the formula, a char is returned for each byte in the symbol [2].
Perhaps we should be using String.codePointAt(int index) instead [3].

> private char look;
> ...
> private void GetChar() {
> ...
>     look=_formulaString.charAt(_pointer);

[1]
https://svn.apache.org/viewvc/poi/trunk/src/java/org/apache/poi/ss/formula/FormulaParser.java?revision=1776796&view=markup#l1134

[2] http://docs.oracle.com/javase/6/docs/api/java/lang/Character.html#unicode
[3]
http://docs.oracle.com/javase/6/docs/api/java/lang/String.html#codePointAt(int)

-- 
You are receiving this mail because:
You are the assignee for the bug.
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to