Hi,

       As reported in a previous mail I found the libunicode library
that provides support for handling unicode strings + conversion from
various charsets. Written in C.
       This is a possible alternative to the iconv/wchar functions
doing the same thing in glibc-2.1.
       There is also a recently released (June) library called ICU from
IBM (licence is IBM Public Licence, rated Open Source compliant by 
www.opensource.org, and a reasonable one IMHO) 
http://www.alphaworks.ibm.com/tech/icu/. Written in C++ and provides
Java style interface functions/classes. It also implements the functionalities
of gettext with the ability to handle unicode.

       Assuming that htdig ultimately wants to handle various charsets
from HTML/XML documents I'd like your opinion to make the right choice.
The tradeoffs are:

       glibc iconv/wchar : requires work to make it a standalone library

       libunicode : not 100% stable although quite mature

       icu : reimplements gettext which is probably not desirable

       I tend to think that icu is the right solution because it seems
much more mature and complete and is written in C++. It does not make use
of templates (thanks god). 

       Hopefully I will not find a fourth alternative by tomorrow :-)

             Cheers,

-- 
                Loic Dachary

                ECILA
                100 av. du Gal Leclerc
                93500 Pantin - France
                Tel: 33 1 56 96 09 80, Fax: 33 1 56 96 09 61
                e-mail: [EMAIL PROTECTED] URL: http://www.senga.org/


------------------------------------
To unsubscribe from the htdig3-dev mailing list, send a message to
[EMAIL PROTECTED] containing the single word "unsubscribe" in
the SUBJECT of the message.

Reply via email to