Il 03/03/2014 07:13, Paul Eggert ha scritto:
Norihiro Tanaka wrote:
However I don't understand why the optimization isn't completed on
non-UTF8 locale only. Can you explain it?
Sorry, no; there's a lot about that code I don't yet understand.
IIRC it's because a CSET matches any byte, while the corresponding
MBCSET only matches that byte if it is a single-byte character. So for
example, say "\x83A" is a two-byte character. The CSET "A" will match
it but the corresponding MBCSET will not.
This can happen in the Shift-JIS encoding.
Paolo