[email protected] writes:

> I am not at all secure about how the standard GNU utilities will handle
> non-ascii characters. For example, 'wc -c', just counts bytes. True, the man
> page talks about bytes, not characters, but I am still left uncomfortable.

GNU wc(1) has -m for that:

% echo $LANG
en_US.UTF-8
% echo 'äöü' | wc -c
7
% echo 'äöü' | wc -m
4

-- 
Christian Neukirchen  <[email protected]>  http://chneukirchen.org


_______________________________________________
Nmh-workers mailing list
[email protected]
https://lists.nongnu.org/mailman/listinfo/nmh-workers

Reply via email to