Re[2]: [Haskell-cafe] I/O and utf8

2006-01-12 Thread Bulat Ziganshin
Hello Einar, Wednesday, January 11, 2006, 6:14:44 PM, you wrote: EK Do you plan on supporting things like HTTP where the character set EK is only known in the middle of the parsing? yes, it is supported, see Examples/Encoding.hs in the http://freearc.narod.ru/Binary.tar.gz : h -

Re: [Haskell-cafe] I/O and utf8

2006-01-11 Thread Einar Karttunen
On 10.01 10:25, Bulat Ziganshin wrote: i have the question about this issue - i also want to provide autodetection mechanism, which relies on first bytes of text files to set proper encoding. what is the standard rules to encode utf8/utf16 encoding used for text in file in these first bytes?

Re[2]: [Haskell-cafe] I/O and utf8

2006-01-10 Thread Bulat Ziganshin
Hello John, Tuesday, January 10, 2006, 2:08:44 AM, you wrote: i want to read a file encoded in utf8 and at a later time output portions of it on the console. Is there an easy way to do this in haskell? using the standard i/o functions i can read the file but the output gives me \1071 ...

RE: Re[2]: [Haskell-cafe] I/O and utf8

2006-01-10 Thread Bayley, Alistair
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On Behalf Of Bulat Ziganshin i have the question about this issue - i also want to provide autodetection mechanism, which relies on first bytes of text files to set proper encoding. what is the standard rules to encode utf8/utf16 encoding

Re: [Haskell-cafe] I/O and utf8

2006-01-09 Thread John Meacham
On Sun, Jan 08, 2006 at 11:26:05AM +, Andreas Kägi wrote: hello i want to read a file encoded in utf8 and at a later time output portions of it on the console. Is there an easy way to do this in haskell? using the standard i/o functions i can read the file but the output gives me \1071

[Haskell-cafe] I/O and utf8

2006-01-08 Thread Andreas Kägi
hello i want to read a file encoded in utf8 and at a later time output portions of it on the console. Is there an easy way to do this in haskell? using the standard i/o functions i can read the file but the output gives me \1071 ... instead of the unicode characters.