marco <[EMAIL PROTECTED]>:

> I need to make a scan of all the files on a Linux system (independently
> of the type of filesystem and the options given at mount time) and record
> all the filenames. I'm using the readdir() syscall that returns a pointer
> to a struct dirent. My question is: what should I assume about the
> format/encoding of the d_name[] field?

Assume it's a null-terminated octet string. It shouldn't be empty, and
it shouldn't contain (ASCII) '/'. You can't assume the string is valid
character data in any particular encoding. However, if it is valid as
UTF-8, then it probably really is UTF-8, but it might not be
printable, so you'll still have to process it before displaying it.

Edmund
--
Linux-UTF8:   i18n of Linux on all levels
Archive:      http://mail.nl.linux.org/linux-utf8/

Reply via email to