pacman82 commented on issue #1691: URL: https://github.com/apache/arrow-rs/issues/1691#issuecomment-1145173167
Hello @tustvold , thanks for your help here. Now I am a little bit confused. In order to implement <https://github.com/pacman82/odbc2parquet/issues/190> (tl;dr I want to stop writing row groups as soon as the file size surpasses a user defined threshold, and start writing the next row group into a new file) should I add the `compressed_size` of the row groups or used the `total_byte_size` of the flushed groups? What would be the difference between the two? Shouldn't be hard to change on my end. If you do not advice otherwise, I'll run with the `compressed_size` and see if my users are happy about it. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
