wgtmac commented on issue #33652: URL: https://github.com/apache/arrow/issues/33652#issuecomment-1401673575
> I'm don't think you are missing any API, although I am not sure the use case for exposing it during the write. Why not call `Tell()` on the sink after flushing? I think `compressed_written_bytes` would be useful if we want to limit compressed size of row group if we buffer row group in the parquet writer. For now we can only get size of flushed row groups via `Tell()` and raw (uncompressed) size of buffered row group via `total_bytes_written()`. As we cannot precisely estimate the compression ratio, it is not easy to limit the actual row group size after compression. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
