sometimesabird opened a new issue, #35156:
URL: https://github.com/apache/arrow/issues/35156

   ### Describe the bug, including details regarding any error messages, 
version, and platform.
   
   An otherwise perfectly functioning arrow dataset does not finish the command 
`write_dataset` when passing on a hive structure, and I have to interrupt R. 
Looking at the folder structure, it seems to be writing files perfectly well 
until some point after which no new files are written -- but the job isn't 
finished. 
   
   The dataset also writes well into a single file (`write_dataset` without 
partitioning or grouping). It also writes well when I create less groups that I 
would like to. I haven't seen anyone complain about this, so I suspect that I 
am doing something so silly that no one has attempted before. Am I creating too 
many groups?
   
   Grouping that works: A, B, C, D, E, where all groups are binary.
   
   Grouping that doesn't work: A, B, C, D, X, where X has 90+ values (and not 
all values exist for each level of other variable. So, say, a combination A=1, 
B=1, C=1, D=1 might not have X=67.
   
   Grouping that *crashes*: X, A, B, C, D.
   
   I am on Garuda Linux (Arch-based) with R version 4.2.3.
   
   ### Component(s)
   
   R


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to