sometimesabird opened a new issue, #35156: URL: https://github.com/apache/arrow/issues/35156
### Describe the bug, including details regarding any error messages, version, and platform. An otherwise perfectly functioning arrow dataset does not finish the command `write_dataset` when passing on a hive structure, and I have to interrupt R. Looking at the folder structure, it seems to be writing files perfectly well until some point after which no new files are written -- but the job isn't finished. The dataset also writes well into a single file (`write_dataset` without partitioning or grouping). It also writes well when I create less groups that I would like to. I haven't seen anyone complain about this, so I suspect that I am doing something so silly that no one has attempted before. Am I creating too many groups? Grouping that works: A, B, C, D, E, where all groups are binary. Grouping that doesn't work: A, B, C, D, X, where X has 90+ values (and not all values exist for each level of other variable. So, say, a combination A=1, B=1, C=1, D=1 might not have X=67. Grouping that *crashes*: X, A, B, C, D. I am on Garuda Linux (Arch-based) with R version 4.2.3. ### Component(s) R -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
