Yes, thanks anyway. I sent a trimmed down version to John Myles White. 
Seems the file might contain non-valid UTF-8 but then it should not be 
possible to read it in via readtable imho since writetable will then fail. 
Might be something else though.

Cheers,

Robert

Den söndag 14 juni 2015 kl. 22:03:49 UTC+2 skrev Milan Bouchet-Valat:
>
> Le dimanche 14 juin 2015 à 22:01 +0200, Milan Bouchet-Valat a écrit : 
> > Le dimanche 14 juin 2015 à 12:21 -0700, Robert Feldt a écrit : 
> > > I read in a csv file which has UTF-8 characters, filter it but when I 
> > > try to write it back to disk with writetable I get: 
> > > 
> > > invalid UTF-8 character index 
> > >  in next at 
> /Applications/Julia-0.3.8.app/Contents/Resources/julia/lib/julia/sys.dylib 
> > >  in need_full_hex at 
> /Applications/Julia-0.3.8.app/Contents/Resources/julia/lib/julia/sys.dylib 
> > >  in print_escaped at string.jl:868 
> > >  in escapedprint at 
> /Users/feldt/.julia/v0.3/DataFrames/src/abstractdataframe/io.jl:12 
> > >  in printtable at 
> /Users/feldt/.julia/v0.3/DataFrames/src/abstractdataframe/io.jl:41 
> > >  in anonymous at 
> /Users/feldt/.julia/v0.3/DataFrames/src/abstractdataframe/io.jl:108 
> > >  in open at iostream.jl:137 
> > >  in writetable at 
> /Users/feldt/.julia/v0.3/DataFrames/src/abstractdataframe/io.jl:107 
> > > 
> > > Not sure this is a DataFrames problem. Might be print_escaped? Any 
> > > advice or hints? It doesn't help to indicate encoding = :utf8 when 
> > > calling readtable (which should be the default anyway). 
> > Could you send a reduced version of the file to reproduce the bug? That 
> > would be very helpful. 
> Ah, just seen you file an issue in GitHub: 
> https://github.com/JuliaStats/DataFrames.jl/issues/813 
>

Reply via email to