[
https://issues.apache.org/jira/browse/ARROW-12681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17344847#comment-17344847
]
David Li commented on ARROW-12681:
----------------------------------
FWIW, maybe the better way would be to expose a {{columns}} option or similar
instead of wholesale exposing the IpcReadOptions struct? At least, I'm thinking
of Parquet where we don't expose (Arrow)ReaderProperties but rather various
keyword arguments.
(Though of course, as discussed in ARROW-12683, this won't reduce network
transfer either way; we always read the entire record batch.)
> [Python] Expose IpcReadOptions to ipc facility
> ----------------------------------------------
>
> Key: ARROW-12681
> URL: https://issues.apache.org/jira/browse/ARROW-12681
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Francois Saint-Jacques
> Priority: Minor
>
> I would like to be able to read only a subset of columns from a given IPC
> file. To do this, we need to expose the EXPERIMENTAL (is it still?)
> IpcReaderOptions.include_fields option. The reason is that the file is on a
> remote storage and can't mmap thus I want to minimize network transfer.
> I do not know the best way to "pythonize" IpcReaderOptions and would need
> help on this.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)