wgtmac commented on issue #33652:
URL: https://github.com/apache/arrow/issues/33652#issuecomment-1401673575

   > I'm don't think you are missing any API, although I am not sure the use 
case for exposing it during the write. Why not call `Tell()` on the sink after 
flushing?
   
   I think `compressed_written_bytes` would be useful if we want to limit 
compressed size of row group if we buffer row group in the parquet writer.
   
   For now we can only get size of flushed row groups via `Tell()` and raw 
(uncompressed) size of buffered row group via `total_bytes_written()`. As we 
cannot precisely estimate the compression ratio, it is not easy to limit the 
actual row group size after compression.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to