https://bz.apache.org/bugzilla/show_bug.cgi?id=47950

--- Comment #3 from Trejkaz (pen name) <[email protected]> ---
toUpperCase() is the devil... along with every other method in the JRE which
has an overload where you can omit the locale, the file encoding, the time
zone, etc. :(

I also wish it were possible in Java to do Unicode operations against a
specific version. This is something I encountered very early on when I wanted
to make a Lucene tokeniser for grapheme clusters and realised that Java was
going to screw me on backwards compatibility so I had to archive off my own
copy of the table for all that data.

I guess the same tactic could work here - save a copy of the uppercasing table.
Microsoft even said there are special cases, so it is probably the easiest way
to know it's correct. Depending on how you store it, the storage cost might not
even be terribly bad. ICU uses some kind of trie encoding to store this sort of
thing.

-- 
You are receiving this mail because:
You are the assignee for the bug.

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to