Benno Schulenberg <[EMAIL PROTECTED]> writes:

> Follow-up Comment #1, bug #19977 (project grep):
>
> That is a good request.  Grep not matching accented characters is not only a
> problem in Arabic with Tashkeel signs, but also in many European languages
> with diacritical signs.  One would like to have an option to ignore any of
> those signs so that the following command outputs five lines instead of just
> one:
>
> $ echo -e " a\n á\n à\n â\n ä" | grep --ignore-diacritics a

Actually you should be able to use equivalence classes:

$ printf "%s\n" a á à â ä | grep '[[=a=]]'

Alas, that does not seem to work yet.

Andreas.

-- 
Andreas Schwab, SuSE Labs, [EMAIL PROTECTED]
SuSE Linux Products GmbH, Maxfeldstraße 5, 90409 Nürnberg, Germany
PGP key fingerprint = 58CA 54C7 6D53 942B 1756  01D3 44D5 214B 8276 4ED5
"And now for something completely different."


Reply via email to