On 1/13/11 10:26 PM, Nick Sabalausky wrote: [snip]
[ 'f', {u with the umlaut}, 'n', 'f' ]Or: [ 'f', 'u', {umlaut combining character}, 'n', 'f' ] Those *both* get rendered exactly the same, and both represent the same four-letter sequence. In the second example, the 'u' and the {umlaut combining character} combine to form one grapheme. The f's and n's just happen to be single-code-point graphemes. Note that while some characters exist in pre-combined form (such as the {u with the umlaut} above), legend has it there are others than can only be represented using a combining character. It's also my understanding, though I'm not certain, that sometimes multiple combining characters can be used together on the same "root" character.
Thanks. One further question is: in the above example with u-with-umlaut, there is one code point that corresponds to the entire combination. Are there combinations that do not have a unique code point?
Andrei
