Re: [julia-users] UTF8, how to procesed text data

2016-10-21 Thread Gregory Salvan
Hi,
there is a library that let you specify the encoding type when opening
files:

https://github.com/nalimilan/StringEncodings.jl

2016-10-20 19:32 GMT+02:00 :

> Julia ver 5 is OK , but is new problem with space after ś, ć . More in new
> post...
>
> Paul
>
> W dniu środa, 19 października 2016 15:04:13 UTC+2 użytkownik Milan
> Bouchet-Valat napisał:
>>
>> Le mercredi 19 octobre 2016 à 06:02 -0700, program...@gmail.com a
>> écrit :
>> > Version 0.3.12, udate to 5 ?
>> Yes. 0.3.x versions are unsupported for some time now.
>>
>>
>> Regards
>>
>> > > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com a
>> > > écrit :
>> > > > Data file is coding UTF8 but i cant procedsed this datain Julia
>> > > ?
>> > > > What wrong ?
>> > > >
>> > > > o=open("data.txt")
>> > > >
>> > > > julia> temp=readline(io)
>> > > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26
>> > > > paździehniga,1\n"
>> > > >
>> > > > julia> temp[61:65]
>> > > > "aźdz"
>> > > >
>> > > > julia> findin(temp[61:65],"d")
>> > > > ERROR: invalid UTF-8 character index
>> > > >  in next at utf8.jl:64
>> > > >  in findin at array.jl:1179
>> > > You didn't say what version of Julia you're using. The bug seems
>> > > to
>> > > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to
>> > > upgrade.
>> > >
>> > > (Note that in general you shouldn't index into strings with
>> > > arbitrary
>> > > integers: only values referring to the beginning of a Unicode code
>> > > point are valid.)
>> > >
>> > >
>> > > Regards
>> >
>>
>


Re: [julia-users] UTF8, how to procesed text data

2016-10-20 Thread programistawpf
Julia ver 5 is OK , but is new problem with space after ś, ć . More in new 
post...

Paul

W dniu środa, 19 października 2016 15:04:13 UTC+2 użytkownik Milan 
Bouchet-Valat napisał:
>
> Le mercredi 19 octobre 2016 à 06:02 -0700, program...@gmail.com 
>  a 
> écrit : 
> > Version 0.3.12, udate to 5 ? 
> Yes. 0.3.x versions are unsupported for some time now. 
>
>
> Regards 
>
> > > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com a  
> > > écrit :  
> > > > Data file is coding UTF8 but i cant procedsed this datain Julia 
> > > ?  
> > > > What wrong ?  
> > > >  
> > > > o=open("data.txt")  
> > > >  
> > > > julia> temp=readline(io)  
> > > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26  
> > > > paździehniga,1\n"  
> > > >  
> > > > julia> temp[61:65]  
> > > > "aźdz"  
> > > >  
> > > > julia> findin(temp[61:65],"d")  
> > > > ERROR: invalid UTF-8 character index  
> > > >  in next at utf8.jl:64  
> > > >  in findin at array.jl:1179  
> > > You didn't say what version of Julia you're using. The bug seems 
> > > to  
> > > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to 
> > > upgrade.  
> > > 
> > > (Note that in general you shouldn't index into strings with 
> > > arbitrary  
> > > integers: only values referring to the beginning of a Unicode code  
> > > point are valid.)  
> > > 
> > > 
> > > Regards  
> >   
>


Re: [julia-users] UTF8, how to procesed text data

2016-10-19 Thread Milan Bouchet-Valat
Le mercredi 19 octobre 2016 à 06:02 -0700, programista...@gmail.com a
écrit :
> Version 0.3.12, udate to 5 ?
Yes. 0.3.x versions are unsupported for some time now.


Regards

> > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com a 
> > écrit : 
> > > Data file is coding UTF8 but i cant procedsed this datain Julia
> > ? 
> > > What wrong ? 
> > > 
> > > o=open("data.txt") 
> > > 
> > > julia> temp=readline(io) 
> > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 
> > > paździehniga,1\n" 
> > > 
> > > julia> temp[61:65] 
> > > "aźdz" 
> > > 
> > > julia> findin(temp[61:65],"d") 
> > > ERROR: invalid UTF-8 character index 
> > >  in next at utf8.jl:64 
> > >  in findin at array.jl:1179 
> > You didn't say what version of Julia you're using. The bug seems
> > to 
> > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to
> > upgrade. 
> > 
> > (Note that in general you shouldn't index into strings with
> > arbitrary 
> > integers: only values referring to the beginning of a Unicode code 
> > point are valid.) 
> > 
> > 
> > Regards 
>  


Re: [julia-users] UTF8, how to procesed text data

2016-10-19 Thread programistawpf
Version 0.3.12, udate to 5 ?

W dniu środa, 19 października 2016 14:56:15 UTC+2 użytkownik Milan 
Bouchet-Valat napisał:
>
> Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com 
>  a 
> écrit : 
> > Data file is coding UTF8 but i cant procedsed this datain Julia ? 
> > What wrong ? 
> > 
> > o=open("data.txt") 
> > 
> > julia> temp=readline(io) 
> > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 
> > paździehniga,1\n" 
> > 
> > julia> temp[61:65] 
> > "aźdz" 
> > 
> > julia> findin(temp[61:65],"d") 
> > ERROR: invalid UTF-8 character index 
> >  in next at utf8.jl:64 
> >  in findin at array.jl:1179 
> You didn't say what version of Julia you're using. The bug seems to 
> happen on 0.4.7, but not on 0.5.0, so I'd encourage you to upgrade. 
>
> (Note that in general you shouldn't index into strings with arbitrary 
> integers: only values referring to the beginning of a Unicode code 
> point are valid.) 
>
>
> Regards 
>
 


Re: [julia-users] UTF8, how to procesed text data

2016-10-19 Thread Milan Bouchet-Valat
Le mercredi 19 octobre 2016 à 04:46 -0700, programista...@gmail.com a
écrit :
> Data file is coding UTF8 but i cant procedsed this datain Julia ?
> What wrong ?
> 
> o=open("data.txt")
> 
> julia> temp=readline(io)
> "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26
> paździehniga,1\n"
> 
> julia> temp[61:65]
> "aźdz"
> 
> julia> findin(temp[61:65],"d")
> ERROR: invalid UTF-8 character index
>  in next at utf8.jl:64
>  in findin at array.jl:1179
You didn't say what version of Julia you're using. The bug seems to
happen on 0.4.7, but not on 0.5.0, so I'd encourage you to upgrade.

(Note that in general you shouldn't index into strings with arbitrary
integers: only values referring to the beginning of a Unicode code
point are valid.)


Regards


Re: [julia-users] UTF8, how to procesed text data

2016-10-19 Thread Stefan Karpinski
http://docs.julialang.org/en/release-0.5/manual/strings/

On Wed, Oct 19, 2016 at 7:46 AM,  wrote:

> Data file is coding UTF8 but i cant procedsed this datain Julia ? What
> wrong ?
>
> o=open("data.txt")
>
> julia> temp=readline(io)
> "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26
> paździehniga,1\n"
>
> julia> temp[61:65]
> "aźdz"
>
> julia> findin(temp[61:65],"d")
> ERROR: invalid UTF-8 character index
>  in next at utf8.jl:64
>  in findin at array.jl:1179
>
> Paul
>