---
"Any sufficiently advanced technology is indistinguishable from magic."

                                              -- Clarke's Third Law

> The only non-printable characters in Unicode are control characters
> whose general category code in the Unicode database starts with C, i.e.
> those characters that get printed with
> 
> $ egrep '^[^;]*;[^;]*;C' UnicodeData-Latest.txt

What about characters like U+E000, First Private Use character? That gets
listed with your 'egrep'.

--roozbeh


-
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/lists/

Reply via email to