[GitHub] [arrow-rs] pacman82 commented on issue #1691: Make current position available in `FileWriter`.

GitBox Thu, 02 Jun 2022 11:18:46 -0700


pacman82 commented on issue #1691:
URL: https://github.com/apache/arrow-rs/issues/1691#issuecomment-1145173167


   Hello @tustvold , thanks for your help here. Now I am a little bit confused. 
In order to implement <https://github.com/pacman82/odbc2parquet/issues/190> 
(tl;dr I want to stop writing row groups as soon as the file size surpasses a 
user defined threshold, and start writing the next row group into a new file) 
should I add the `compressed_size` of the row groups or used the 
`total_byte_size` of the flushed groups? What would be the difference between 
the two? Shouldn't be hard to change on my end. If you do not advice otherwise, 
I'll run with the `compressed_size` and see if my users are happy about it.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

[GitHub] [arrow-rs] pacman82 commented on issue #1691: Make current position available in `FileWriter`.

Reply via email to