Hi all, I start this discussion to comment on the change to disable the use of mmap by default, which represents a risk in non-local/pseudo file systems that can affect performance. Part of the solution would be to have a flag at the compilation level that allows you to activate or deactivate the use of mmap in arrow C++/pyarrow. Here in [1] an analysis on the use of mmap in Database Management System is presented
Thanks. [1] https://db.cs.cmu.edu/papers/2022/cidr2022-p13-crotty.pdf