On Thu Oct 13 07:38:54 EDT 2011, [email protected] wrote:
> On Thursday 13 of October 2011 13:15:57 slash wrote:
> > I have some files on an external ext2 drive that have whitespace and
> > umlauts (ä, ö) in them. trfs took care of the whitespace. But ext2srv
> > presents umlauts as a question mark symbol (�) and won't let me access
> > the file (error: file does not exist).
> 
> i believe -- but i am not sure! -- that linux stores and reads names on 
> ext2/3/4 without any conversion between filesystem and I/O syscalls like 
> open(). if you have iso8859-1 or similar single-byte locale on linux, your 
> ext2 contains iso8859-1 encoded filenames.

correct.

if you know what the charset on disk is, you could probablly hack ext2fs
into translating names.  or (less hacky) you could write a transliterating fs,
or add this to trfs' duties.

i don't know if this i helpful, but if you use p9p tools you will always get 
utf8,
without any oddness.  it used to be easier because the system tools weren't
trying so hard to break utf-8.  it used to just all work.  ymmv with a utf-8
locale.  i found it messed up some scripts because the beauty of locale is that
you just can't count on the format of anything.

- erik

Reply via email to