Re: [I] [C++] write_dataset: max_open_files does not close least recently used file [arrow]

via GitHub Tue, 17 Dec 2024 06:46:06 -0800


xWaita commented on issue #45038:
URL: https://github.com/apache/arrow/issues/45038#issuecomment-2548644983


   There is a secondary write_dataset parameter in `max_rows_per_file` that can 
be used to control how large each file will get. 
   
   For `max_open_files`, it seems the primary intention is to prevent too many 
open files from building up in memory, in which case it would make sense to 
choose a file that is most likely to be complete, i.e. least recently used


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Re: [I] [C++] write_dataset: max_open_files does not close least recently used file [arrow]

Reply via email to