devinrkeane opened a new issue, #37034:
URL: https://github.com/apache/arrow/issues/37034

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   Hi - we use arrow in R and python for a wide range of projects at my company 
but are running into a datset issue with R-arrow I am not sure how to track 
down, it may be something with my system.  Recently updated to 12.0.1.1 on an 
M1 Max x86 running on Ventura 13.4  
   
   ```
   library(arrow)
   library(testthat)
   ```
   Running the following I'm able to read/write parquet file no problem:
   
   ```
   
   data <- ggplot2::diamonds
   tmpdir <- tempdir()
   
   # read parquet works fine
   write_parquet(data, file.path(tmpdir, "test.parquet"))
   data2 <- read_parquet(file.path(tmpdir, "test.parquet"))
   expect_equal(data$carat, data2$carat)
   
   ```
   
   But trying to write a dataset I get an immediate "Fatal Error" crash in 
Rstudio.  Running in R, This happens:
   
   ```
   # fatal error
   write_dataset(data, path = tmpdir, partitioning = "cut")
   ```
   
   I get what appears to be a prompt asking me to make a Selection (see image), 
with no options. I just put in "1" and get the 
   
   `zsh: illegal hardware instruction  R` error`
   
   Tracing the error i made it to `Schema__WithMetadata()` before it broke.
   
   <img width="439" alt="Screenshot 2023-08-07 at 11 59 06 AM" 
src="https://github.com/apache/arrow/assets/20322761/2fd0063e-0897-4fa6-8c2c-b5163b7c5ecd";>
   
   Some context:
   
   `open_dataset` also has the same result on some preexisting datasets we've 
created at my company, this is actually where I came across the error and then 
found `write_dataset` was doing something similar.
   
   I dont know why sessionInfo says Im running under Big Sur/Monterrey below, 
maybe that has something to do with it? My company has  software manager 
(kandji) and updated this mac to Ventura a while ago. Could just be me :/
   
   ```
   sessionInfo()
   
   R version 4.1.3 (2022-03-10)
   Platform: x86_64-apple-darwin17.0 (64-bit)
   Running under: macOS Big Sur/Monterey 10.16
   
   Matrix products: default
   BLAS:   
/Library/Frameworks/R.framework/Versions/4.1/Resources/lib/libRblas.0.dylib
   LAPACK: 
/Library/Frameworks/R.framework/Versions/4.1/Resources/lib/libRlapack.dylib
   
   locale:
   [1] en_US.UTF-8/en_US.UTF-8/en_US.UTF-8/C/en_US.UTF-8/en_US.UTF-8
   
   attached base packages:
   [1] stats     graphics  grDevices datasets  utils     methods   base     
   
   other attached packages:
   [1] ggplot2_3.3.6   testthat_3.1.10 arrow_12.0.1.1 
   
   loaded via a namespace (and not attached):
    [1] magrittr_2.0.1   tidyselect_1.1.0 bit_4.0.4        munsell_0.5.0   
    [5] colorspace_1.4-1 R6_2.4.1         rlang_1.1.1      fansi_0.4.1     
    [9] dplyr_1.0.7      tools_4.1.3      grid_4.1.3       gtable_0.3.0    
   [13] utf8_1.1.4       DBI_1.1.0        cli_3.6.1        withr_2.5.0     
   [17] ellipsis_0.3.2   yaml_2.2.1       bit64_4.0.5      assertthat_0.2.1
   [21] tibble_3.1.7     lifecycle_1.0.3  crayon_1.4.1     brio_1.1.2      
   [25] purrr_1.0.1      vctrs_0.6.2      glue_1.6.2       pillar_1.7.0    
   [29] compiler_4.1.3   generics_0.1.3   scales_1.1.1     renv_0.17.3     
   [33] pkgconfig_2.0.3 
   
   
   arrow_info()
   
   Arrow package version: 12.0.1.1
   
   Capabilities:
                  
   acero      TRUE
   dataset    TRUE
   substrait FALSE
   parquet    TRUE
   json       TRUE
   s3         TRUE
   gcs       FALSE
   utf8proc   TRUE
   re2        TRUE
   snappy     TRUE
   gzip       TRUE
   brotli     TRUE
   zstd       TRUE
   lz4        TRUE
   lz4_frame  TRUE
   lzo       FALSE
   bz2        TRUE
   jemalloc   TRUE
   mimalloc   TRUE
   
   Memory:
                     
   Allocator mimalloc
   Current    0 bytes
   Max        0 bytes
   
   Runtime:
                             
   SIMD Level          sse4_2
   Detected SIMD Level sse4_2
   
   Build:
                                       
   C++ Library Version           12.0.1
   C++ Compiler              AppleClang
   C++ Compiler Version 14.0.3.14030022
   
   ```
   
   - I have no problems running this test script on my other mac, which is an 
Mac Pro Intel chip on Ventura 13.5. Others have had no issue but they're all 
Intel.  We all run on the same `renv` project.
   
   - ran `brew install apache-arrow` as well, which updated/installed 
`12.0.1_4`, but my understanding is the mac binary for R-arrow comes with 
everything, I assume that version wouldnt be an issue anyway but could there be 
confusion there?
   
   
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to