[
https://issues.apache.org/jira/browse/ARROW-17089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17571094#comment-17571094
]
Aldrin Montana commented on ARROW-17089:
----------------------------------------
I linked ARROW-17092, because I think it would be nice if it was addressed in
the same release as this issue.
relevant mailing list posts:
https://lists.apache.org/thread/5kglmy2q9wk17whpr8p4oploxlsdgwyz
> [Python] Use `.arrow` as extension for IPC file dataset
> -------------------------------------------------------
>
> Key: ARROW-17089
> URL: https://issues.apache.org/jira/browse/ARROW-17089
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Affects Versions: 8.0.0
> Reporter: SHIMA Tatsuya
> Assignee: SHIMA Tatsuya
> Priority: Major
> Labels: pull-request-available
> Time Spent: 10m
> Remaining Estimate: 0h
>
> Same as ARROW-17088
> As noted in the following document, the recommended extension for IPC files
> is now `.arrow`.
> > We recommend the “.arrow” extension for files created with this format.
> https://arrow.apache.org/docs/format/Columnar.html#ipc-file-format
> However, currently when writing a dataset with the
> {{pyarrow.dataset.write_dataset}} function, the default extension is
> {{.feather}} when {{arrow}} or {{ipc}} or {{feather}} is selected.
> https://github.com/apache/arrow/blob/b8067151db9bfc04860285fdd8b5e73703346037/python/pyarrow/_dataset.pyx#L1149-L1151
--
This message was sent by Atlassian Jira
(v8.20.10#820010)