A few minutes ago I wrote that: >I think that as a general rule, the "combining" accents should be disregared >during collation. > > etc.
I just read that "collation" page from Unicode.org and it seems to be completely at odds with what I suggested, e.g. in its insistence that some sequences of code points are "canonically equivalent." In light of this fact, I do not see how Unicode can ever really be considered "collated." And it follows that it cannot be reversed. At least, this is the case if one follows the advice at Unicode.org. The "collation" that Unicode.org seems to suggest is basically the invention of some academics. It does not seem to correspond to any human alphabet. Please, please correct me if I am wrong on this. I have never been one of those to just ignore Unicode. But I am starting to see that it does not really work so well in the real world once one leaves the realm of "ASCII-with-zeroes-on-top." ________________________________________ From: sqlite-users-boun...@sqlite.org [sqlite-users-boun...@sqlite.org] On Behalf Of Igor Tandetnik [itandet...@mvps.org] Sent: Tuesday, November 17, 2009 1:01 PM To: sqlite-users@sqlite.org Subject: Re: [sqlite] Unicode support Simon Slavin <slav...@bigfraud.org> wrote: > On 17 Nov 2009, at 6:37pm, Igor Tandetnik wrote: > >> Simon Slavin <slav...@bigfraud.org> wrote: >>> First split the string into characters, then reassemble them in >>> reverse order. >> >> The problem is, in Unicode it's not quite clear what constitutes a >> "character". Are we talking about codepoints, sort elements, >> graphemes? Depending on the application, either definition might >> make sense. > > I agree about the problem, but sort elements is the obvious answer in > this case. This would mean that the result of the hypothetical flip() function would be locale-dependent. E.g. in Spanish Traditional sort, a combination 'ch' sorts as if it were a single letter between 'c' and 'd', forming a single sort element (a so-called contraction). So should 'a ch b' reverse to 'b ch a' under Spanish Traditional sort, and to 'b hc a' otherwise? Would you pass a desired locale as a parameter to flip(), in order to achieve that? Igor Tandetnik _______________________________________________ sqlite-users mailing list sqlite-users@sqlite.org http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users The information contained in this e-mail is privileged and confidential information intended only for the use of the individual or entity named. If you are not the intended recipient, or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any disclosure, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this e-mail in error, please immediately notify the sender and delete any copies from your system. _______________________________________________ sqlite-users mailing list sqlite-users@sqlite.org http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users The information contained in this e-mail is privileged and confidential information intended only for the use of the individual or entity named. If you are not the intended recipient, or the employee or agent responsible for delivering this message to the intended recipient, you are hereby notified that any disclosure, dissemination, distribution, or copying of this communication is strictly prohibited. If you have received this e-mail in error, please immediately notify the sender and delete any copies from your system. _______________________________________________ sqlite-users mailing list sqlite-users@sqlite.org http://sqlite.org:8080/cgi-bin/mailman/listinfo/sqlite-users