On Thu Oct 13 07:38:54 EDT 2011, [email protected] wrote: > On Thursday 13 of October 2011 13:15:57 slash wrote: > > I have some files on an external ext2 drive that have whitespace and > > umlauts (ä, ö) in them. trfs took care of the whitespace. But ext2srv > > presents umlauts as a question mark symbol (�) and won't let me access > > the file (error: file does not exist). > > i believe -- but i am not sure! -- that linux stores and reads names on > ext2/3/4 without any conversion between filesystem and I/O syscalls like > open(). if you have iso8859-1 or similar single-byte locale on linux, your > ext2 contains iso8859-1 encoded filenames.
correct. if you know what the charset on disk is, you could probablly hack ext2fs into translating names. or (less hacky) you could write a transliterating fs, or add this to trfs' duties. i don't know if this i helpful, but if you use p9p tools you will always get utf8, without any oddness. it used to be easier because the system tools weren't trying so hard to break utf-8. it used to just all work. ymmv with a utf-8 locale. i found it messed up some scripts because the beauty of locale is that you just can't count on the format of anything. - erik
