On 09/19/2013 08:18 AM, Chris wrote:
Short question in case anyone knows the answer straight away:
How do I sort text so that non-ascii characters like "á" are treated in
the same way as "a"?
Now I'm getting this:
[wow, ara, ába, marca]
===> sort(listAbove);
[ara, marca, wow, ába]
I'd like to get:
[ ába, ara, marca, wow]
Thanks.
I have a project that tries to do exactly that:
https://code.google.com/p/trileri/source/browse/trunk/tr/dizgi.d#823
However, it is in Turkish and in need of a rewrite. :/
For the whole thing to work, every character must be of a certain
alphabet. Here is the English alphabet:
https://code.google.com/p/trileri/source/browse/trunk/tr/alfabe.d#747
Here is how I define e.g. á to be an accented version of a:
https://code.google.com/p/trileri/source/browse/trunk/tr/harfler.d#23
However, some characters stand individually as they are not accents but
proper letters themselves (e.g. ç of the Turkish alphabet):
https://code.google.com/p/trileri/source/browse/trunk/tr/harfler.d#44
Well... I hope to get back to it at some point, taking advantage of the
new std.uni as well.
Ali