adamreeve commented on issue #44917: URL: https://github.com/apache/arrow/issues/44917#issuecomment-2515793810
Hi @mpoeter, you could try using the non-Arrow Parquet API, which is lower level and gives you more control, but can be more complex when dealing with nulls and nested data. The `TypedColumnReader::ReadBatch` method reads into a memory location provided by the user: https://github.com/apache/arrow/blob/50f2d6e04e8323119d4dd31506827ee398d6b8e4/cpp/src/parquet/column_reader.h#L219-L220 (Although it does look like there's an opportunity to optimise the Arrow based reader) -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
