Re: [julia-users] UTF8, how to procesed text data
Hi, there is a library that let you specify the encoding type when opening files: https://github.com/nalimilan/StringEncodings.jl 2016-10-20 19:32 GMT+02:00 : > Julia ver 5 is OK , but is new problem with space after ś, ć . More in new > post... > > Paul > > W dniu środa, 19 października 2016 15:04:13 UTC+2 użytkownik Milan > Bouchet-Valat napisał: >> >> Le mercredi 19 octobre 2016 à 06:02 -0700, program...@gmail.com a >> écrit : >> > Version 0.3.12, udate to 5 ? >> Yes. 0.3.x versions are unsupported for some time now. >> >> >> Regards >> >> > > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com a >> > > écrit : >> > > > Data file is coding UTF8 but i cant procedsed this datain Julia >> > > ? >> > > > What wrong ? >> > > > >> > > > o=open("data.txt") >> > > > >> > > > julia> temp=readline(io) >> > > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 >> > > > paździehniga,1\n" >> > > > >> > > > julia> temp[61:65] >> > > > "aźdz" >> > > > >> > > > julia> findin(temp[61:65],"d") >> > > > ERROR: invalid UTF-8 character index >> > > > in next at utf8.jl:64 >> > > > in findin at array.jl:1179 >> > > You didn't say what version of Julia you're using. The bug seems >> > > to >> > > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to >> > > upgrade. >> > > >> > > (Note that in general you shouldn't index into strings with >> > > arbitrary >> > > integers: only values referring to the beginning of a Unicode code >> > > point are valid.) >> > > >> > > >> > > Regards >> > >> >
Re: [julia-users] UTF8, how to procesed text data
Julia ver 5 is OK , but is new problem with space after ś, ć . More in new post... Paul W dniu środa, 19 października 2016 15:04:13 UTC+2 użytkownik Milan Bouchet-Valat napisał: > > Le mercredi 19 octobre 2016 à 06:02 -0700, program...@gmail.com > a > écrit : > > Version 0.3.12, udate to 5 ? > Yes. 0.3.x versions are unsupported for some time now. > > > Regards > > > > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com a > > > écrit : > > > > Data file is coding UTF8 but i cant procedsed this datain Julia > > > ? > > > > What wrong ? > > > > > > > > o=open("data.txt") > > > > > > > > julia> temp=readline(io) > > > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 > > > > paździehniga,1\n" > > > > > > > > julia> temp[61:65] > > > > "aźdz" > > > > > > > > julia> findin(temp[61:65],"d") > > > > ERROR: invalid UTF-8 character index > > > > in next at utf8.jl:64 > > > > in findin at array.jl:1179 > > > You didn't say what version of Julia you're using. The bug seems > > > to > > > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to > > > upgrade. > > > > > > (Note that in general you shouldn't index into strings with > > > arbitrary > > > integers: only values referring to the beginning of a Unicode code > > > point are valid.) > > > > > > > > > Regards > > >
Re: [julia-users] UTF8, how to procesed text data
Le mercredi 19 octobre 2016 à 06:02 -0700, programista...@gmail.com a écrit : > Version 0.3.12, udate to 5 ? Yes. 0.3.x versions are unsupported for some time now. Regards > > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com a > > écrit : > > > Data file is coding UTF8 but i cant procedsed this datain Julia > > ? > > > What wrong ? > > > > > > o=open("data.txt") > > > > > > julia> temp=readline(io) > > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 > > > paździehniga,1\n" > > > > > > julia> temp[61:65] > > > "aźdz" > > > > > > julia> findin(temp[61:65],"d") > > > ERROR: invalid UTF-8 character index > > > in next at utf8.jl:64 > > > in findin at array.jl:1179 > > You didn't say what version of Julia you're using. The bug seems > > to > > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to > > upgrade. > > > > (Note that in general you shouldn't index into strings with > > arbitrary > > integers: only values referring to the beginning of a Unicode code > > point are valid.) > > > > > > Regards >
Re: [julia-users] UTF8, how to procesed text data
Version 0.3.12, udate to 5 ? W dniu środa, 19 października 2016 14:56:15 UTC+2 użytkownik Milan Bouchet-Valat napisał: > > Le mercredi 19 octobre 2016 à 04:46 -0700, program...@gmail.com > a > écrit : > > Data file is coding UTF8 but i cant procedsed this datain Julia ? > > What wrong ? > > > > o=open("data.txt") > > > > julia> temp=readline(io) > > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 > > paździehniga,1\n" > > > > julia> temp[61:65] > > "aźdz" > > > > julia> findin(temp[61:65],"d") > > ERROR: invalid UTF-8 character index > > in next at utf8.jl:64 > > in findin at array.jl:1179 > You didn't say what version of Julia you're using. The bug seems to > happen on 0.4.7, but not on 0.5.0, so I'd encourage you to upgrade. > > (Note that in general you shouldn't index into strings with arbitrary > integers: only values referring to the beginning of a Unicode code > point are valid.) > > > Regards >
Re: [julia-users] UTF8, how to procesed text data
Le mercredi 19 octobre 2016 à 04:46 -0700, programista...@gmail.com a écrit : > Data file is coding UTF8 but i cant procedsed this datain Julia ? > What wrong ? > > o=open("data.txt") > > julia> temp=readline(io) > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 > paździehniga,1\n" > > julia> temp[61:65] > "aźdz" > > julia> findin(temp[61:65],"d") > ERROR: invalid UTF-8 character index > in next at utf8.jl:64 > in findin at array.jl:1179 You didn't say what version of Julia you're using. The bug seems to happen on 0.4.7, but not on 0.5.0, so I'd encourage you to upgrade. (Note that in general you shouldn't index into strings with arbitrary integers: only values referring to the beginning of a Unicode code point are valid.) Regards
Re: [julia-users] UTF8, how to procesed text data
http://docs.julialang.org/en/release-0.5/manual/strings/ On Wed, Oct 19, 2016 at 7:46 AM, wrote: > Data file is coding UTF8 but i cant procedsed this datain Julia ? What > wrong ? > > o=open("data.txt") > > julia> temp=readline(io) > "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 > paździehniga,1\n" > > julia> temp[61:65] > "aźdz" > > julia> findin(temp[61:65],"d") > ERROR: invalid UTF-8 character index > in next at utf8.jl:64 > in findin at array.jl:1179 > > Paul >
[julia-users] UTF8, how to procesed text data
Data file is coding UTF8 but i cant procedsed this datain Julia ? What wrong ? o=open("data.txt") julia> temp=readline(io) "3699778,13,2,gdbiehz jablej gupując szybgi Injehnej dg 26 paździehniga,1\n" julia> temp[61:65] "aźdz" julia> findin(temp[61:65],"d") ERROR: invalid UTF-8 character index in next at utf8.jl:64 in findin at array.jl:1179 Paul