Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-09 Thread Polytropon
On Wed, 8 Jun 2011 19:57:05 +, Alexander Best arun...@freebsd.org wrote:
 otaku% ls|grep html|hd 
   c3 84 c3 96 c3 9c c3 a4  c3 b6 c3 bc c3 9f 2e 68  |Ã.Ã.Ã.ÀöÌÃ..h|
 0010  74 6d 6c 0a   |tml.|
 0014
 [...]
 is gtk maybe switching to unicode when saving non-asciichars, instead to
 ISO8859-15?

Yes, it looks that way.

You _could_ try to use the traditional (non-european) settings:

setenv  LC_ALL  en_US.ISO8859-1
setenv  LC_MESSAGES en_US.ISO8859-1
setenv  LC_COLLATE  de_DE.ISO8859-1
setenv  LC_CTYPEde_DE.ISO8859-1
setenv  LC_MONETARY de_DE.ISO8859-1
setenv  LC_NUMERIC  de_DE.ISO8859-1
setenv  LC_TIME de_DE.ISO8859-1

(I set them per /ect/csh.cshrc systemwide, and don't set $LANG).
The downside is that there is no Euro symbol with this setting,
but nobody needs that. :-)



 however when it accesses a filename it can understand unicode as
 well as ISO8859-15?

I would assume that as ISO-Umlauts and UTF-Umlauts have
different byte representation, (iso)öäü.html and (utf)öäü.html
would be different file names, so files with the same
file name would be possible.



 can i instruct gtk to always use ISO8859-15 when saving filenames?

As I'm not a Gnome user, I can't be specific on that
question. Maybe there is a setting available through
the gconf tool?

A workaround, of course NOT a solution, is to NOT use
non-standard characters in file names. I have trained
my kids... erm users. Users! :-) to exactly do that, so
there won't be problems in file name representation.
And only lowercase. And no spaces. If you use, for
example, das_uebel_vom_fasz.html instead of Das Übel
vom Faß.html, you don't need to care for character
representation (as everything will always be ASCII).



-- 
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-09 Thread Polytropon
On Wed, 08 Jun 2011 13:18:34 -0700, Chuck Swiger cswi...@mac.com wrote:
 Fortunately, ISO 8859-15 can be stored on FreeBSD's UFS
 filesystem without lossage-- other character sets can't
 since UFS doesn't do Unicode per se, just UTF-8.

Storing is one thing, correct displaying is the other.

As Alexander mentioned, display for 8851-1 and -15
even work in text mode console (using cons25l1) in
terms of Umlauts and Eszett. The speciality of
-15, the Euro symbol, won't be displayed if I remember
correctly. Maybe the currency symbol (squished
bug) will be used instead?



-- 
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-09 Thread Bernt Hansson

2011-06-08 21:57, Alexander Best skrev:

hi there,


Hallo


for me the output of `locale -a` looks like this:

LANG=en_GB.ISO8859-15
LC_CTYPE=de_DE.ISO8859-15
LC_COLLATE=en_GB.ISO8859-15
LC_TIME=de_DE.ISO8859-15
LC_NUMERIC=de_DE.ISO8859-15
LC_MONETARY=de_DE.ISO8859-15
LC_MESSAGES=en_GB.ISO8859-15
LC_ALL=


What is the reason for setting that?

I only set this in the .login_conf;

me:\
:charset=iso-8859-1:\
:lang=sv_SE.ISO8859-1:

And all is fine and dandy, except gqview only understands utf-8
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-09 Thread Polytropon
On Thu, 09 Jun 2011 13:16:18 +0200, Bernt Hansson be...@bah.homeip.net wrote:
 2011-06-08 21:57, Alexander Best skrev:
  hi there,
 
 Hallo
 
  for me the output of `locale -a` looks like this:
 
  LANG=en_GB.ISO8859-15
  LC_CTYPE=de_DE.ISO8859-15
  LC_COLLATE=en_GB.ISO8859-15
  LC_TIME=de_DE.ISO8859-15
  LC_NUMERIC=de_DE.ISO8859-15
  LC_MONETARY=de_DE.ISO8859-15
  LC_MESSAGES=en_GB.ISO8859-15
  LC_ALL=
 
 What is the reason for setting that?

The reason is to have finer control over certain
language aspects. For example, if one wants to have
collation and date specific settings for the german
language, but english program messages, LC_* can be
used to address things individually. LC_ALL sets _one_
definition for all aspects, and LANG... I think LANG
will be used if LC_* aren't present... not fully sure.



 I only set this in the .login_conf;
 
 me:\
  :charset=iso-8859-1:\
  :lang=sv_SE.ISO8859-1:
 
 And all is fine and dandy, except gqview only understands utf-8

I assume this does define LC_* / LANG variables? I
know this setting is present in the system, but I
never really used login.conf. :-)


-- 
Polytropon
Magdeburg, Germany
Happy FreeBSD user since 4.0
Andra moi ennepe, Mousa, ...
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-09 Thread Bernt Hansson

2011-06-09 13:21, Polytropon skrev:

On Thu, 09 Jun 2011 13:16:18 +0200, Bernt Hanssonbe...@bah.homeip.net  wrote:

2011-06-08 21:57, Alexander Best skrev:

hi there,


Hallo


for me the output of `locale -a` looks like this:

LANG=en_GB.ISO8859-15
LC_CTYPE=de_DE.ISO8859-15
LC_COLLATE=en_GB.ISO8859-15
LC_TIME=de_DE.ISO8859-15
LC_NUMERIC=de_DE.ISO8859-15
LC_MONETARY=de_DE.ISO8859-15
LC_MESSAGES=en_GB.ISO8859-15
LC_ALL=


What is the reason for setting that?


The reason is to have finer control over certain
language aspects. For example, if one wants to have
collation and date specific settings for the german
language, but english program messages, LC_* can be
used to address things individually. LC_ALL sets _one_
definition for all aspects, and LANG... I think LANG
will be used if LC_* aren't present... not fully sure.




I only set this in the .login_conf;

me:\
  :charset=iso-8859-1:\
  :lang=sv_SE.ISO8859-1:

And all is fine and dandy, except gqview only understands utf-8


I assume this does define LC_* / LANG variables?


It defines LANG=sv_SE.ISO8859-1 no LC*


I know this setting is present in the system, but I
never really used login.conf. :-)


login.conf is system wide. .login_conf is not.

Example from xterm

%cal
 Juni 2011
Sö Må Ti On To Fr Lö
  1  2  3  4
 5  6  7  8  9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30

åäö
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-09 Thread Alexander Best
On Thu Jun  9 11, Polytropon wrote:
 On Wed, 8 Jun 2011 19:57:05 +, Alexander Best arun...@freebsd.org wrote:
  otaku% ls|grep html|hd 
    c3 84 c3 96 c3 9c c3 a4  c3 b6 c3 bc c3 9f 2e 68  
  |Ã.Ã.Ã.ÀöÌÃ..h|
  0010  74 6d 6c 0a   |tml.|
  0014
  [...]
  is gtk maybe switching to unicode when saving non-asciichars, instead to
  ISO8859-15?
 
 Yes, it looks that way.
 
 You _could_ try to use the traditional (non-european) settings:
 
 setenv  LC_ALL  en_US.ISO8859-1
 setenv  LC_MESSAGES en_US.ISO8859-1
 setenv  LC_COLLATE  de_DE.ISO8859-1
 setenv  LC_CTYPEde_DE.ISO8859-1
 setenv  LC_MONETARY de_DE.ISO8859-1
 setenv  LC_NUMERIC  de_DE.ISO8859-1
 setenv  LC_TIME de_DE.ISO8859-1
 
 (I set them per /ect/csh.cshrc systemwide, and don't set $LANG).
 The downside is that there is no Euro symbol with this setting,
 but nobody needs that. :-)
 
 
 
  however when it accesses a filename it can understand unicode as
  well as ISO8859-15?
 
 I would assume that as ISO-Umlauts and UTF-Umlauts have
 different byte representation, (iso)öäü.html and (utf)öäü.html
 would be different file names, so files with the same
 file name would be possible.
 
 
 
  can i instruct gtk to always use ISO8859-15 when saving filenames?
 
 As I'm not a Gnome user, I can't be specific on that
 question. Maybe there is a setting available through
 the gconf tool?
 
 A workaround, of course NOT a solution, is to NOT use
 non-standard characters in file names. I have trained
 my kids... erm users. Users! :-) to exactly do that, so
 there won't be problems in file name representation.
 And only lowercase. And no spaces. If you use, for
 example, das_uebel_vom_fasz.html instead of Das Übel
 vom Faß.html, you don't need to care for character
 representation (as everything will always be ASCII).

i think the problem is gtk. when i set LC_ALL=ISO8859-15, gimp complaints
about it:

(gimp:56139): Gtk-WARNING **: Locale not supported by C library.
Using the fallback 'C' locale.

when i set it to lets say LC_ALL=de_DE.ISO8859-15 or LC_ALL=de_DE.ISO8859-1, it
doesn't complain. so it recognises that locale. however for some reason it
switches to UTF-8 when using chars like ö or ä, although they are perfectly
valid chars for the locale i defined.

this seems like a gtk bug imho.

cheers.
alex

 
 
 
 -- 
 Polytropon
 Magdeburg, Germany
 Happy FreeBSD user since 4.0
 Andra moi ennepe, Mousa, ...

-- 
a13x
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-08 Thread Alexander Best
hi there,

for me the output of `locale -a` looks like this:

LANG=en_GB.ISO8859-15
LC_CTYPE=de_DE.ISO8859-15
LC_COLLATE=en_GB.ISO8859-15
LC_TIME=de_DE.ISO8859-15
LC_NUMERIC=de_DE.ISO8859-15
LC_MONETARY=de_DE.ISO8859-15
LC_MESSAGES=en_GB.ISO8859-15
LC_ALL=

when i do `touch ÄÖÜäöüß` this works great. i can see the correct characters
under the console, X term and in gtk file dialogs.

*however* when i save a file via the gtk save dialog (lets say from within
chromium), the filename only gets displayed correctly in the gtk open dialog.

saving a file ÄÖÜäöüß.html from chromium's gtk dialog returns the following
under the console or an X term:

otaku% ls|grep html|hd 
  c3 84 c3 96 c3 9c c3 a4  c3 b6 c3 bc c3 9f 2e 68  |Ã.Ã.Ã.ÀöÌÃ..h|
0010  74 6d 6c 0a   |tml.|
0014

when i start gnome and use the open dialog both the `touch`ed ÄÖÜäöüß file as
well as the ÄÖÜäöüß.html saved by chromium looks fine.

is gtk maybe switching to unicode when saving non-asciichars, instead to
ISO8859-15? however when it accesses a filename it can understand unicode as
well as ISO8859-15?

can i instruct gtk to always use ISO8859-15 when saving filenames?

cheers.
alex

-- 
a13x
___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org


Re: problem with german umlauts and gtk apps (maybe unicode issue)

2011-06-08 Thread Chuck Swiger
On Jun 8, 2011, at 12:57 PM, Alexander Best wrote:
 *however* when i save a file via the gtk save dialog (lets say from within
 chromium), the filename only gets displayed correctly in the gtk open dialog.
 
 saving a file ÄÖÜäöüß.html from chromium's gtk dialog returns the following
 under the console or an X term:
 
 otaku% ls|grep html|hd 
   c3 84 c3 96 c3 9c c3 a4  c3 b6 c3 bc c3 9f 2e 68  |Ã.Ã.Ã.ÀöÌÃ..h|
 0010  74 6d 6c 0a   |tml.|
 0014

That's a UTF-8 representation; c3 84 is Unicode U+00C4 aka LATIN CAPITAL 
LETTER A WITH DIAERESIS.  

 when i start gnome and use the open dialog both the `touch`ed ÄÖÜäöüß file 
 as
 well as the ÄÖÜäöüß.html saved by chromium looks fine.
 
 is gtk maybe switching to unicode when saving non-asciichars, instead to
 ISO8859-15? however when it accesses a filename it can understand unicode as
 well as ISO8859-15?

Yes, that appears to be the case.

 can i instruct gtk to always use ISO8859-15 when saving filenames?

Dunno.  Fortunately, ISO 8859-15 can be stored on FreeBSD's UFS filesystem 
without lossage-- other character sets can't since UFS doesn't do Unicode per 
se, just UTF-8.

Regards,
-- 
-Chuck

___
freebsd-questions@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-questions
To unsubscribe, send any mail to freebsd-questions-unsubscr...@freebsd.org