Re: accents in file names
2009/2/16 Mihai Donțu mihai.do...@gmail.com: On Friday 13 February 2009, Chuck Swiger wrote: On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote: accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. why? i use ISO-8859-2 You've answered why when you state that you set up a locale which supports ISO Latin-X charset. If you are running in the default C/ POSIX locale, using the US-ASCII character set and a font that only knows about 7-bit ASCII glyphs, then you won't get accented characters. UFS doesn't deal with encoding at all, just store what you give That's right, which means you need to use filenames encoded in UTF8 rather than in arbitrary Unicode. UTF-8 is what we prefer these days, but the filesystem can handle anything that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.). Now, I assume Daniel was copying filé.txt from a non-UFS (Windows box, FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think of and in which such a problem might appear. I assume this is why scp and nfs worked on my example above, but samba didn't? -- R $h ! $- ! $+ $@ $2 @ $1 .UUCP. (sendmail.cf) ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
On Thursday 12 February 2009 13:44:40 Chuck Swiger wrote: Perhaps I'm biased, but I've long been of the opinion that the Mac platform with HFS+ has very good internationalization support. Then maybe this is your cue to take over the HFS support in FreeBSD. I've only seen this support degrading over the years. ;) -- Mel Problem with today's modular software: they start with the modules and never get to the software part. ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
On Friday 13 February 2009, Chuck Swiger wrote: On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote: accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. why? i use ISO-8859-2 You've answered why when you state that you set up a locale which supports ISO Latin-X charset. If you are running in the default C/ POSIX locale, using the US-ASCII character set and a font that only knows about 7-bit ASCII glyphs, then you won't get accented characters. UFS doesn't deal with encoding at all, just store what you give That's right, which means you need to use filenames encoded in UTF8 rather than in arbitrary Unicode. UTF-8 is what we prefer these days, but the filesystem can handle anything that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.). Now, I assume Daniel was copying filé.txt from a non-UFS (Windows box, FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think of and in which such a problem might appear. People in Asia tend to want UTF-16 or UTF-32 encoding (although historical encodings like Big5, Shift- JIS, and now GB18030 for China are still rather popular, and those are multibyte encodings), and things like gcc's implementation of widechars or Python are standardizing on UTF-32. -- Mihai Donțu ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
Yes, that's right. I copied the files from win4bsd system. Mihai Donțu wrote: On Friday 13 February 2009, Chuck Swiger wrote: On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote: accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. why? i use ISO-8859-2 You've answered why when you state that you set up a locale which supports ISO Latin-X charset. If you are running in the default C/ POSIX locale, using the US-ASCII character set and a font that only knows about 7-bit ASCII glyphs, then you won't get accented characters. UFS doesn't deal with encoding at all, just store what you give That's right, which means you need to use filenames encoded in UTF8 rather than in arbitrary Unicode. UTF-8 is what we prefer these days, but the filesystem can handle anything that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.). Now, I assume Daniel was copying filé.txt from a non-UFS (Windows box, FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think of and in which such a problem might appear. People in Asia tend to want UTF-16 or UTF-32 encoding (although historical encodings like Big5, Shift- JIS, and now GB18030 for China are still rather popular, and those are multibyte encodings), and things like gcc's implementation of widechars or Python are standardizing on UTF-32. ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
Ok, Thanks a lot... I will read carefully these articles... bye. daniel Chuck Swiger wrote: On Feb 12, 2009, at 1:46 PM, Daniel Leal wrote: is there a way to have a freebsd system with file names with accented words. Like filé.txt instead of file.txt. Now if I copy a file with an accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. Perhaps this might give you some insight: http://www.cl.cam.ac.uk/~mgk25/unicode.html Note that other file systems have more comprehensive Unicode support: http://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits Perhaps I'm biased, but I've long been of the opinion that the Mac platform with HFS+ has very good internationalization support. Regards, ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
On Feb 12, 2009, at 1:46 PM, Daniel Leal wrote: is there a way to have a freebsd system with file names with accented words. Like filé.txt instead of file.txt. Now if I copy a file with an accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. Perhaps this might give you some insight: http://www.cl.cam.ac.uk/~mgk25/unicode.html Note that other file systems have more comprehensive Unicode support: http://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits Perhaps I'm biased, but I've long been of the opinion that the Mac platform with HFS+ has very good internationalization support. Regards, -- -Chuck ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
how and from what do you copy. UFS generally doesn't have any limits for filename characters. i do have files with polish letters on my disk - no problem On Thu, 12 Feb 2009, Daniel Leal wrote: Hi. is there a way to have a freebsd system with file names with accented words. Like filé.txt instead of file.txt. Now if I copy a file with an accented letter to my freebsd box, the accented letter simply disappear. thanks, daniel ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. why? i use ISO-8859-2 UFS doesn't deal with encoding at all, just store what you give ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
2009/2/12 Chuck Swiger cswi...@mac.com: On Feb 12, 2009, at 1:46 PM, Daniel Leal wrote: is there a way to have a freebsd system with file names with accented words. Like filé.txt instead of file.txt. Now if I copy a file with an accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. Perhaps this might give you some insight: http://www.cl.cam.ac.uk/~mgk25/unicode.html Note that other file systems have more comprehensive Unicode support: http://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits Perhaps I'm biased, but I've long been of the opinion that the Mac platform with HFS+ has very good internationalization support. Regards, -- -Chuck ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org Yeah, I love the way you're allowed a / in filenames on the Mac. Makes me snigger... How are you copying the files over? On my Mac: [ch...@zeus]~% touch bluurgh\303\251#\303\251 is what comes up when I make an é [ch...@zeus]~% ls 5500plugin.tar NetBeansProjects/ hist200.txt Applications/ Pictures/ hist300.txt Desktop/Public/ hist600.txt Documents/ Rips/ hist900.txt Library/Sites/ public_html@ Movies/ bluurghe?? xcodeJava/ Music/ drop/ [ch...@zeus]~% ls |grep bluu bluurghé [ch...@zeus]~% Look! grep supports that character, but ls doesn't show it properly... so scp works fine... [ch...@zeus]~% scp bluurghe\314\201 amnesiac.bayofrum.net:. bluurghé100%0 0.0KB/s 00:00 [ch...@zeus]~% ssh amnesiac.bayofrum.net 'ls |grep blu' bluurghé [ch...@zeus]~% ssh amnesiac.bayofrum.net 'rm blu*' Samba seems to do strange things though; copied it over with samba [ch...@zeus]~% ssh amnesiac.bayofrum.net 'ls |grep blu' bluurghé* [ch...@zeus]~% What's with the *? and after nfs: [ch...@zeus]~% sudo mount -t nfs amnesiac.bayofrum.net:/usr/home/chris Applications [ch...@zeus]~% cp bluurghe\314\201 Applications/ [ch...@zeus]~% ssh amnesiac.bayofrum.net 'ls |grep blu' bluurghé [ch...@zeus]~% What?? Why does it work OK with nfs and scp, but not samba? Really wouldn't bother unless you spend your time exclusively in GUI environments, just seems a real hassle. Chris -- R $h ! $- ! $+ $@ $2 @ $1 .UUCP. (sendmail.cf) ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org
Re: accents in file names
On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote: accented letter to my freebsd box, the accented letter simply disappear. UFS supports 8-bit characters except for / and \0, but you also need to run a terminal with UTF8 support and use a correct font to view such things. why? i use ISO-8859-2 You've answered why when you state that you set up a locale which supports ISO Latin-X charset. If you are running in the default C/ POSIX locale, using the US-ASCII character set and a font that only knows about 7-bit ASCII glyphs, then you won't get accented characters. UFS doesn't deal with encoding at all, just store what you give That's right, which means you need to use filenames encoded in UTF8 rather than in arbitrary Unicode. People in Asia tend to want UTF-16 or UTF-32 encoding (although historical encodings like Big5, Shift- JIS, and now GB18030 for China are still rather popular, and those are multibyte encodings), and things like gcc's implementation of widechars or Python are standardizing on UTF-32. Regards, -- -Chuck ___ freebsd-questions@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-questions To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org