paleolimbot commented on pull request #11730: URL: https://github.com/apache/arrow/pull/11730#issuecomment-994934312
Trying to track down the source of this, and one other theory is that it's and issue with the lifecycle of the underlying buffers (that they are getting freed out from under the `struct ArrowArray`). Again, just a theory! But it would explain the mix of intermittent success, corrupted data, and segfaults. The Array is created here: https://github.com/duckdb/duckdb/blob/048bce50e92b643e90cf7cc6bccda56a82378961/src/common/types/data_chunk.cpp#L684-L725 To my reading, there's nothing in the private data that guarantees that the underlying `DataChunk` won't get deleted: https://github.com/duckdb/duckdb/blob/048bce50e92b643e90cf7cc6bccda56a82378961/src/common/types/data_chunk.cpp#L272-L297 ...but perhaps I'm misreading the C++ here. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org