[ 
https://issues.apache.org/jira/browse/ARROW-16768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17553546#comment-17553546
 ] 

Dewey Dunnington commented on ARROW-16768:
------------------------------------------

I don't know how common its usage is, but there's a base R function that will 
happily do this for you.

{code:R}
x <- c(1, 1, 2, 2, 3, NA)
addNA(x)
#> [1] 1    1    2    2    3    <NA>
#> Levels: 1 2 3 <NA>
{code}

> [R] Factor levels cannot contain NA
> -----------------------------------
>
>                 Key: ARROW-16768
>                 URL: https://issues.apache.org/jira/browse/ARROW-16768
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: R
>    Affects Versions: 7.0.0
>            Reporter: Kieran Martin
>            Priority: Minor
>
> If you try to write a data frame with a factor with a missing value to 
> parquet, you get the error: "Error: Invalid: Cannot insert dictionary values 
> containing nulls". 
> This seems likely due to how the metadata for factors is currently captured 
> in parquet files. Reprex follows:
>  
> library(arrow)
> bad_data <- data.frame(A = factor(1, 2, NA))
> write_parquet(bad_data, tempfile())
>  
>  



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to