[
https://issues.apache.org/jira/browse/ARROW-14612?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17448251#comment-17448251
]
David Li commented on ARROW-14612:
----------------------------------
The current APIs trim the filenames before they're handed to partitioning, but
assuming we can change that, we add or update the partitioning schemes to allow
for this as well without too much trouble, I think. (If the filenames weren't
trimmed, then it could already be done - at least in C++ - via a
FunctionPartitioning.)
> [C++] Support for filename-based partitioning
> ---------------------------------------------
>
> Key: ARROW-14612
> URL: https://issues.apache.org/jira/browse/ARROW-14612
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Nicola Crane
> Priority: Major
>
> Directory-based partitioning is a feature of Arrow, but could we support
> filename-based partitioning?
> e.g. I have a series of CSV files here all called something like
> {{foo_month_year.csv}} and it'd be nice to be able to read them in and then
> the month/year bits of the filenames then appear as fields I can filter on
> etc.
>
--
This message was sent by Atlassian Jira
(v8.20.1#820001)