Am Sun, 28 Sep 2014 12:38:25 -0700 schrieb Walter Bright <[email protected]>:
> I suggest that in the future write code that is explicit about the intention > - > by character or by decoded character - by using adapters .byChar or .byDchar. ... or by "user perceived character" or by "word" or by "line". I'm always on the fence with code points. Sure they are the code points, but what does it mean in practice? Is it valid to start a Unicode string with just a diacritical mark? Does it make sense to split in the middle of Korean symbols, effectively removing parts of the glyphs and rendering them invalid? Bearophile, what does your code _do_ with the dchar ranges? How is it not rendered into a caricature of its own attempts to support non-ASCII by the above ? -- Marco
