Every so often someone asks on this list for Unicode to be handled properly.  I 
did it myself.  Then other people have to explain how hard this is.  So here’s 
an article which, after introductory material, discusses the hard questions in 


Are two strings the same?
How long is a string?
How do you sort things in alphabetical order?

The first and third questions are requirements for implementing COLLATE in 
SQLite.  And the fact that the second question is a difficult one emphasises 
that one shouldn’t take Unicode as simple.

