> For ext2 and almos all "traditional" unix fs they don't care about
> encodings, it is just bytes. so if you type in utf-8 they are stored
> in utf-8.
>
This behaviour is mandated by POSIX. Any nonzero sequence of bytes
except '/' and '\0' are legal in filenames, in any order. The only
ones which have special meaning are "." and "..".
I am not so sure of that. POSIX-2001 says:
Filename
A name consisting of 1 to {NAME_MAX} bytes used to name a file.
The characters composing the name may be selected from the set
of all character values excluding the slash character and the null byte.
The filenames dot and dot-dot have special meaning.
Byte
An individually addressable unit of data storage that is exactly an octet,
used to store a character or a portion of a character
Character
A sequence of one or more bytes representing a single graphic symbol
or control code.
Andries
--
Linux-UTF8: i18n of Linux on all levels
Archive: http://mail.nl.linux.org/linux-utf8/