Danilo Segan wrote:
You have problems to do google searchs in Serbian because a text
can be in two different scripts; 
    

I'm actually more concerned with the display and input problem,
rather than doing Google searches (I mentioned Google only to show
that people care about language more, yet Google is not able
to deduce such information correctly with the current state of
encodings).  I want to type "letters", and display it using any of
the scripts simply by changing a font.  I'm native Serbian, and most
native Serbian speakers tend to think of it as a display property (you
certainly know that, since I know you're well clued about Serbian
problems :).
  
The only possible reform that would alleviate this condition would
be to completely unify the latin and cyrillic scripts or to
eliminate one of them. Then cyrillic or latin could simply be a font
setting for users of all languages.

(I dont know if this is even possible, you said yourself they are
only "almost" bijective)

If you cannot eliminate the possibility of serbian being written in
a cyrillic or latin script then the original problem will only be
compounded.

Well, it's not up to encoding to enforce correct usage of it.  After
all, one can type text using "small caps" region of Unicode standard,
but how often does that happen?
Small caps and Zenkaku latin do not look the same as regular script.
It would be difficult to do so accidentally, afaik.





--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to