Re: UTF to ASCII?

James G. Sack (jim) Wed, 09 Jan 2008 18:44:01 -0800

Christopher Smith wrote:
> James G. Sack (jim) wrote:
>> Gregory K. Ruiz-Ade wrote:
>>   
>>> Anyone know of a library or utility that can be fed a UTF string, and
>>> stomp it down to the most-visually-equivalent ASCII string?
>>>
>>> I'm looking for something that will basically take all foreign language
>>> gylphs and replace them with the nearest-looking ASCII equivalent.
>>>
>>> Perl is my current tool of choice, but either my search-fu is weak, or
>>> there simply isn't anything available right now.
>>>     
>> I'm not sure I know how to interpret your question.
>>
>> What would you want to do with, say
>>   BENGALI LETTER NGA (ঙ)?
>>
>> Maybe you were thinking only of latin letters with accents? That might
>> make more sense, I guess. There's only maybe(?) a few hundred of those.
>>
>> But offhand, I think you are wishing for the moon. :-)
>>   
> There are lots of Unicode characters which have ASCII equivalents. The
> trade markand copyright logo are examples. This kind of thing is really
> important.


I suppose you must mean that "(C)" is an equivalent for the copyright
sign. Well ..um, ..maybe.

But I'm sorry, I would not agree to the there being lots of Unicode
characters which have ASCII equivalents. I would say that
(overwhelmingly) most unicode characters are not anything like ASCII,
and have nothing that could be called ASCII equivalent.  The unicode
folks _have_ come up with an english name for every character though. It
does get a bit tedious spelling it out all the time .. "LATIN SMALL
LETTER E WITH ACUTE"?


Regards,
..jim

-- 
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg

Re: UTF to ASCII?

Reply via email to