[ 
https://issues.apache.org/jira/browse/ARROW-7018?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16962497#comment-16962497
 ] 

Neal Richardson commented on ARROW-7018:
----------------------------------------

Hi [~vidaringa], to help us address the issue, could you please provide a 
self-contained minimal code example that reproduces the issue, along with a 
printout of your R {{sessionInfo()}}? These encoding issues are notoriously 
difficult, so I want to be sure that we're seeing exactly what you're seeing. 
Thanks!

> Special characters as question mark in parquet files in R
> ---------------------------------------------------------
>
>                 Key: ARROW-7018
>                 URL: https://issues.apache.org/jira/browse/ARROW-7018
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 0.15.0
>         Environment: I'm running R on Windows 10
>            Reporter: Vidar Ingason
>            Priority: Major
>
> Hello.
> I'm new to the arrow package in R and I'm having a trouble regarding special 
> characters (Icelandic). I have a large data set and everything is fine until 
> I write the file to disk and read it in again (i.e. I use write_parquet() and 
> then read_parquet()). When I read the data back in to R special characters 
> turn into question mark. I.e. Veitingastaðir becomes Veitingasta�ir.
> This does not happen when I use .csv.
> Is there anything I can do when I write the .parquet file to disk or when I 
> read it in to prevent this?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to