Re: How to remove accents while conforming to language standards?

Jukka K. Korpela Mon, 04 Nov 2013 12:02:44 -0800

2013-11-04 21:00, Jennifer Wong wrote:

The use case is that customers want to integrate data from our
enterprise solution to their ASCII-based downstream systems.

This is very different from the question about removing accents whileconforming to language standards. The very goal makes it impossible toconform to language standards. The next question should be what the datawill be used for, and how.

Thus all accents need to be removed.

I would not jump into that conclusion. Just because some system isASCII-based does not mean that you cannot in any way handle non-ASCIIdata. You can encode non-ASCII characters in ASCII in many ways. To takea trivial example, you could convert È to E` and later possibly convertit back, though in such approaches you need to be careful to make theconversion reversible (if it needs to be). In some cases, out-of-bandinformation could be included, e.g. entering a name in a simplified formin ASCII but accompanied with a note (in ASCII) describing accents thathave been omitted.

Even if it is acceptable to do lossy mappings (like just dropping allaccents, or mapping, say, Ä to AE without worrying about possible AE inoriginal data), the crucial question is how the data will be used, nowand in the future.


Yucca

Re: How to remove accents while conforming to language standards?

Reply via email to