tadeja opened a new issue, #49948: URL: https://github.com/apache/arrow/issues/49948
### Describe the bug, including details regarding any error messages, version, and platform. ### Describe the bug, including details regarding any error messages, version, and platform. Umbrella issue for follow-up to closed #49930/merged PR #49931 (a temporary workaround!). The MSYS `gcc 15.2 -> 16.1` update post 2026 April 30 broke the MINGW64 MinGW job. #49931 pins `gcc-libs` to 15.2 to mask/workaround it. This issue tracks the actual fixes so the pin step can be reverted (later with #49945). Sub-issues will be opened for individual failures as triage progresses. [First failure on main](https://github.com/apache/arrow/actions/runs/25213414021/job/73928546151#step:12:2644): ``` The following tests FAILED: 41 - arrow-async-utility-test (Exit code 0xc0000374) arrow-tests unittest 62 - arrow-dataset-dataset-writer-test (SEGFAULT) arrow_dataset unittest 65 - arrow-dataset-file-test (Failed) arrow_dataset unittest Error: Process completed with exit code 8. ``` [Another CI run from PR 49462](https://github.com/apache/arrow/actions/runs/25458933029?pr=49462) **Empirical isolation** (PR #49931 commit `fb4d7a0`, pins removed, MinIO bumped) - [MINGW64 CI run](https://github.com/apache/arrow/actions/runs/25442601074/job/74637722454#step:13:1327): ``` The following tests FAILED: 41 - arrow-async-utility-test (Exit code 0xc0000374) arrow-tests unittest 44 - arrow-threading-utility-test (Timeout) arrow-tests unittest 62 - arrow-dataset-dataset-writer-test (Failed) arrow_dataset unittest 65 - arrow-dataset-file-test (Failed) arrow_dataset unittest 76 - arrow-s3fs-test (Timeout) arrow-tests filesystem unittest ``` CLANG64 in the same run: `100% tests passed`. Confirmed gcc-16 is the trigger (not the gcc-libs cascade or aws-sdk-cpp/MinIO behavior). --- **Per-test status:** (always failing) - `arrow-async-utility-test` : Heap corruption (0xc0000374) - Root cause unclear - `arrow-threading-utility-test` : SegFault then 300s timeout - Could be another `__emutls` site or a gcc-16 codegen issue - `arrow-dataset-dataset-writer-test` : `std::bad_weak_ptr` in `BatchWriteConcurrent` - Possibly an Arrow`shared_from_this()` lifetime bug exposed by gcc-16 - `arrow-dataset-file-test` : SegFault - Possibly the same race as `dataset-writer-test` - `arrow-s3fs-test` : `_fut.Wait()` timeout in `GetFileInfoGeneratorStress` (the s3fs subtest that hammers the thread pool). Same thread-pool class as the others; the SDK-side `MissingContentMD5` failure mode was separately resolved by #49931's MinIO bump. (intermittent) - `arrow-acero-hash-join-node-test` : `HashJoin.Random` (`parallel = true`) doesn't `emit the expected number of batches`. Example CI failed logs [here1](https://github.com/apache/arrow/actions/runs/25484292531/job/74775821192?pr=49462#step:12:754) and [here2](https://github.com/apache/arrow/actions/runs/25365304357/job/74374523124) - Root cause unknown (fixed) - `arrow-json-test` : `MultipleChunksParallel` crash - Fixed by #49462 --- **This issue closes when** - all currently unfixed tests pass on current upstream MSYS2 packages - at that point #49945 reverts #49931's temporary pin step (remove `- name: Pin MSYS2 packages`) ### Component(s) Continuous Integration, C++ -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
