Re: accents in file names

2009-02-17 Thread Chris Rees
2009/2/16 Mihai Donțu mihai.do...@gmail.com:
 On Friday 13 February 2009, Chuck Swiger wrote:
 On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote:
  accented letter to my freebsd box, the accented letter simply
  disappear.
 
  UFS supports 8-bit characters except for / and \0, but you also
  need to run a terminal with UTF8 support and use a correct font to
  view such things.
 
  why? i use ISO-8859-2

 You've answered why when you state that you set up a locale which
 supports ISO Latin-X charset.  If you are running in the default C/
 POSIX locale, using the US-ASCII character set and a font that only
 knows about 7-bit ASCII glyphs, then you won't get accented characters.

  UFS doesn't deal with encoding at all, just store what you give

 That's right, which means you need to use filenames encoded in UTF8
 rather than in arbitrary Unicode.

 UTF-8 is what we prefer these days, but the filesystem can handle anything
 that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.).

 Now, I assume Daniel was copying filé.txt from a non-UFS (Windows box,
 FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think
 of and in which such a problem might appear.


I assume this is why scp and nfs worked on my example above, but samba didn't?

-- 
R $h !  $- ! $+  $@ $2  @ $1 .UUCP.  (sendmail.cf)
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-16 Thread Mel
On Thursday 12 February 2009 13:44:40 Chuck Swiger wrote:

 Perhaps I'm biased, but I've long been of the opinion that the Mac
 platform with HFS+ has very good internationalization support.

Then maybe this is your cue to take over the HFS support in FreeBSD. I've only 
seen this support degrading over the years. ;)

-- 
Mel

Problem with today's modular software: they start with the modules
and never get to the software part.
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-16 Thread Mihai Donțu
On Friday 13 February 2009, Chuck Swiger wrote:
 On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote:
  accented letter to my freebsd box, the accented letter simply
  disappear.
 
  UFS supports 8-bit characters except for / and \0, but you also
  need to run a terminal with UTF8 support and use a correct font to
  view such things.
 
  why? i use ISO-8859-2

 You've answered why when you state that you set up a locale which
 supports ISO Latin-X charset.  If you are running in the default C/
 POSIX locale, using the US-ASCII character set and a font that only
 knows about 7-bit ASCII glyphs, then you won't get accented characters.

  UFS doesn't deal with encoding at all, just store what you give

 That's right, which means you need to use filenames encoded in UTF8
 rather than in arbitrary Unicode.

UTF-8 is what we prefer these days, but the filesystem can handle anything 
that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.).

Now, I assume Daniel was copying filé.txt from a non-UFS (Windows box, 
FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think 
of and in which such a problem might appear.

 People in Asia tend to want UTF-16 
 or UTF-32 encoding (although historical encodings like Big5, Shift-
 JIS, and now GB18030 for China are still rather popular, and those are
 multibyte encodings), and things like gcc's implementation of
 widechars or Python are standardizing on UTF-32.

-- 
Mihai Donțu
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-16 Thread Daniel Leal

Yes, that's right. I copied the files from win4bsd system.


Mihai Donțu wrote:

On Friday 13 February 2009, Chuck Swiger wrote:
  

On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote:


accented letter to my freebsd box, the accented letter simply
disappear.
  

UFS supports 8-bit characters except for / and \0, but you also
need to run a terminal with UTF8 support and use a correct font to
view such things.


why? i use ISO-8859-2
  

You've answered why when you state that you set up a locale which
supports ISO Latin-X charset.  If you are running in the default C/
POSIX locale, using the US-ASCII character set and a font that only
knows about 7-bit ASCII glyphs, then you won't get accented characters.



UFS doesn't deal with encoding at all, just store what you give
  

That's right, which means you need to use filenames encoded in UTF8
rather than in arbitrary Unicode.



UTF-8 is what we prefer these days, but the filesystem can handle anything 
that is ASCII compatible (like you said: Shift_JIS, EUC-JP etc.).


Now, I assume Daniel was copying filé.txt from a non-UFS (Windows box, 
FAT32, NTFS etc) filesystem to UFS, because this is the only case I can think 
of and in which such a problem might appear.


  
People in Asia tend to want UTF-16 
or UTF-32 encoding (although historical encodings like Big5, Shift-

JIS, and now GB18030 for China are still rather popular, and those are
multibyte encodings), and things like gcc's implementation of
widechars or Python are standardizing on UTF-32.



  


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-13 Thread Daniel Leal

Ok, Thanks a lot... I will read carefully these articles...

bye.

daniel



Chuck Swiger wrote:

On Feb 12, 2009, at 1:46 PM, Daniel Leal wrote:
is there a way to have a freebsd system with file names with accented 
words. Like filé.txt instead of file.txt. Now if I copy a file 
with an accented letter to my freebsd box, the accented letter simply 
disappear.


UFS supports 8-bit characters except for / and \0, but you also 
need to run a terminal with UTF8 support and use a correct font to 
view such things.  Perhaps this might give you some insight:


  http://www.cl.cam.ac.uk/~mgk25/unicode.html

Note that other file systems have more comprehensive Unicode support:

  http://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits

Perhaps I'm biased, but I've long been of the opinion that the Mac 
platform with HFS+ has very good internationalization support.


Regards,


___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-12 Thread Chuck Swiger

On Feb 12, 2009, at 1:46 PM, Daniel Leal wrote:
is there a way to have a freebsd system with file names with  
accented words. Like filé.txt instead of file.txt. Now if I copy  
a file with an accented letter to my freebsd box, the accented  
letter simply disappear.


UFS supports 8-bit characters except for / and \0, but you also  
need to run a terminal with UTF8 support and use a correct font to  
view such things.  Perhaps this might give you some insight:


  http://www.cl.cam.ac.uk/~mgk25/unicode.html

Note that other file systems have more comprehensive Unicode support:

  http://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits

Perhaps I'm biased, but I've long been of the opinion that the Mac  
platform with HFS+ has very good internationalization support.


Regards,
--
-Chuck

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-12 Thread Wojciech Puchar

how and from what do you copy.

UFS generally doesn't have any limits for filename characters.

i do have files with polish letters on my disk - no problem

On Thu, 12 Feb 2009, Daniel Leal wrote:


Hi.

is there a way to have a freebsd system with file names with accented words. 
Like filé.txt instead of file.txt. Now if I copy a file with an accented 
letter to my freebsd box, the accented letter simply disappear.


thanks,

daniel
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org

Re: accents in file names

2009-02-12 Thread Wojciech Puchar

accented letter to my freebsd box, the accented letter simply disappear.


UFS supports 8-bit characters except for / and \0, but you also need to 
run a terminal with UTF8 support and use a correct font to view such things.


why? i use ISO-8859-2

UFS doesn't deal with encoding at all, just store what you give
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-12 Thread Chris Rees
2009/2/12 Chuck Swiger cswi...@mac.com:
 On Feb 12, 2009, at 1:46 PM, Daniel Leal wrote:

 is there a way to have a freebsd system with file names with accented
 words. Like filé.txt instead of file.txt. Now if I copy a file with an
 accented letter to my freebsd box, the accented letter simply disappear.

 UFS supports 8-bit characters except for / and \0, but you also need to
 run a terminal with UTF8 support and use a correct font to view such things.
  Perhaps this might give you some insight:

  http://www.cl.cam.ac.uk/~mgk25/unicode.html

 Note that other file systems have more comprehensive Unicode support:

  http://en.wikipedia.org/wiki/Comparison_of_file_systems#Limits

 Perhaps I'm biased, but I've long been of the opinion that the Mac platform
 with HFS+ has very good internationalization support.

 Regards,
 --
 -Chuck

 ___
 freebsd-questions@freebsd.org mailing list
 http://lists.freebsd.org/mailman/listinfo/freebsd-questions
 To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Yeah, I love the way you're allowed a / in filenames on the Mac. Makes
me snigger...

How are you copying the files over?

On my Mac:

[ch...@zeus]~% touch bluurgh\303\251#\303\251 is what comes up
when I make an é
[ch...@zeus]~% ls
5500plugin.tar  NetBeansProjects/   hist200.txt
Applications/   Pictures/   hist300.txt
Desktop/Public/ hist600.txt
Documents/  Rips/   hist900.txt
Library/Sites/  public_html@
Movies/ bluurghe??  xcodeJava/
Music/  drop/
[ch...@zeus]~% ls |grep bluu
bluurghé
[ch...@zeus]~%

Look! grep supports that character, but ls doesn't show it properly...

so scp works fine...

[ch...@zeus]~% scp bluurghe\314\201 amnesiac.bayofrum.net:.
bluurghé100%0 0.0KB/s   00:00
[ch...@zeus]~% ssh amnesiac.bayofrum.net 'ls |grep blu'
bluurghé
[ch...@zeus]~% ssh amnesiac.bayofrum.net 'rm blu*'

Samba seems to do strange things though; copied it over with samba

[ch...@zeus]~% ssh amnesiac.bayofrum.net 'ls |grep blu'
bluurghé*
[ch...@zeus]~%

What's with the *?

and after nfs:


[ch...@zeus]~% sudo mount -t nfs amnesiac.bayofrum.net:/usr/home/chris
Applications
[ch...@zeus]~% cp bluurghe\314\201 Applications/
[ch...@zeus]~% ssh amnesiac.bayofrum.net 'ls |grep blu'
bluurghé
[ch...@zeus]~%

What?? Why does it work OK with nfs and scp, but not samba?

Really wouldn't bother unless you spend your time exclusively in GUI
environments, just seems a real hassle.

Chris

-- 
R $h !  $- ! $+  $@ $2  @ $1 .UUCP.  (sendmail.cf)
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: accents in file names

2009-02-12 Thread Chuck Swiger

On Feb 12, 2009, at 2:50 PM, Wojciech Puchar wrote:
accented letter to my freebsd box, the accented letter simply  
disappear.


UFS supports 8-bit characters except for / and \0, but you also  
need to run a terminal with UTF8 support and use a correct font to  
view such things.


why? i use ISO-8859-2


You've answered why when you state that you set up a locale which  
supports ISO Latin-X charset.  If you are running in the default C/ 
POSIX locale, using the US-ASCII character set and a font that only  
knows about 7-bit ASCII glyphs, then you won't get accented characters.



UFS doesn't deal with encoding at all, just store what you give


That's right, which means you need to use filenames encoded in UTF8  
rather than in arbitrary Unicode.  People in Asia tend to want UTF-16  
or UTF-32 encoding (although historical encodings like Big5, Shift- 
JIS, and now GB18030 for China are still rather popular, and those are  
multibyte encodings), and things like gcc's implementation of  
widechars or Python are standardizing on UTF-32.


Regards,
--
-Chuck

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org