On Fri, Dec 6, 2013 at 3:40 AM, Naz Gassiep <[email protected]> wrote: > > I favour using a single method for all things, and so I am attracted to the > idea of using combining characters for everything. However, language parsing > tools for languages where those combined characters are used may be fooled > when presented with U+0061 combined with U+0304 instead of the usual U+0101.
In Unicode the characters with precomposed diacritics are given "canonical equivalences" to the corresponding sequences of base characters followed by separate diacritics. So Unicode-compliant parsing tools should not distinguish between the two. -- Shriramana Sharma ஶ்ரீரமணஶர்மா श्रीरमणशर्मा

