[
https://issues.apache.org/jira/browse/ARROW-5502?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16861100#comment-16861100
]
Neal Richardson commented on ARROW-5502:
----------------------------------------
Memory mapping would make the loading in memory to copy to R lazy, and will be
necessary for things like {{read_parquet(f, col_select)}} to not read all
columns into Arrow before copying to R.
Yes, I believed it was possible now, but that's not a friendly enough interface
for package users, IMO.
> [R] file readers should mmap
> ----------------------------
>
> Key: ARROW-5502
> URL: https://issues.apache.org/jira/browse/ARROW-5502
> Project: Apache Arrow
> Issue Type: Improvement
> Components: R
> Reporter: Neal Richardson
> Priority: Major
> Fix For: 0.14.0
>
>
> Arrow is supposed to let you work with datasets bigger than memory. Memory
> mapping is a big part of that. It should be the default way that files are
> read in the `read_*` functions. To disable memory mapping, we could use a
> global `option()`, or a function argument, but that might clutter the
> interface. Or we could not give a choice and only fall back to not memory
> mapping if the platform/file system doesn't support it.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)