[
https://issues.apache.org/jira/browse/ARROW-12036?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Neal Richardson resolved ARROW-12036.
-------------------------------------
Resolution: Duplicate
Thanks, this will be handled in ARROW-9657
> [R] dataset by a single parquet file
> ------------------------------------
>
> Key: ARROW-12036
> URL: https://issues.apache.org/jira/browse/ARROW-12036
> Project: Apache Arrow
> Issue Type: Wish
> Reporter: Zsolt Kegyes-Brassai
> Priority: Minor
>
> I like using the {{dplyr}} in conjunction with
> [datasets|https://arrow.apache.org/docs/r/articles/dataset.html], it results
> in a clean code.
> There are times, when I would like to use the same workflow just for a single
> (larger) parquet file and in most of those cases it doesn’t make sense to
> create a separate folder for just one file.
> (the {{read_parquet()}} provides options only for selecting the columns, no
> filtering and grouping)
> Is it possible/does it make sense to extend the {{open_dataset()}} with an
> option to specify just a single file?
--
This message was sent by Atlassian Jira
(v8.3.4#803005)