Christopher Smith wrote: > James G. Sack (jim) wrote: >> Gregory K. Ruiz-Ade wrote: >> >>> Anyone know of a library or utility that can be fed a UTF string, and >>> stomp it down to the most-visually-equivalent ASCII string? >>> >>> I'm looking for something that will basically take all foreign language >>> gylphs and replace them with the nearest-looking ASCII equivalent. >>> >>> Perl is my current tool of choice, but either my search-fu is weak, or >>> there simply isn't anything available right now. >>> >> I'm not sure I know how to interpret your question. >> >> What would you want to do with, say >> BENGALI LETTER NGA (ঙ)? >> >> Maybe you were thinking only of latin letters with accents? That might >> make more sense, I guess. There's only maybe(?) a few hundred of those. >> >> But offhand, I think you are wishing for the moon. :-) >> > There are lots of Unicode characters which have ASCII equivalents. The > trade markand copyright logo are examples. This kind of thing is really > important.
I suppose you must mean that "(C)" is an equivalent for the copyright sign. Well ..um, ..maybe. But I'm sorry, I would not agree to the there being lots of Unicode characters which have ASCII equivalents. I would say that (overwhelmingly) most unicode characters are not anything like ASCII, and have nothing that could be called ASCII equivalent. The unicode folks _have_ come up with an english name for every character though. It does get a bit tedious spelling it out all the time .. "LATIN SMALL LETTER E WITH ACUTE"? Regards, ..jim -- [email protected] http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg
