On Thu, Nov 30, 2006 at 11:54:51AM +0000, [EMAIL PROTECTED] wrote:
> On Wed, Nov 29, 2006 at 05:25:25PM -0800, Daniel Yek wrote:
> > I am attempting to handle raw filenames (which may be encoded differently 
> > than the character set used by the filesystem) gracefully.
> > 
> [...]
> > with a raw character outside of UTF-8 character set):
> > 
> > Character:  P  r  e  s  e  n  t  a  c  i  ó  n  ó     .  s  x  i
> > Hex code:   50 72 65 73 65 6e 74 61 63 69 f3 6e c3 b3 2e 73 78 69
> > 
> > To be converted to this:
> > Character:  P  r  e  s  e  n  t  a  c  i  %  f  3  n  ó     .  s  x  i
> > Hex code:   50 72 65 73 65 6e 74 61 63 69 25 66 33 6e c3 b3 2e 73 78 69
> 
> And how is the converter supposed to guess that this "raw character"
> (here 0xf3 and perhaps lots of following bytes) has to be interpreted as
> an iso-8859-1 (or iso-8859-2) encoded thing (what you seem to imply
> here)? This could be as well an "??" or an "??" (to cite some unibyte
> encodings...

I suppose the goal is to preserve information about the
bytes in a situation their interpretation (i.e. what
characters they represent) is already lost, and in that case
your question is void.  Whether or not this can be actually
helpful I will not judge.

OP: I doubt there is any function doing this, but UTF-8
validation is very simple so you can write the function
easily yourself.

Yeti


--
Whatever.
_______________________________________________
gtk-app-devel-list mailing list
gtk-app-devel-list@gnome.org
http://mail.gnome.org/mailman/listinfo/gtk-app-devel-list

Reply via email to