Weston Pace created ARROW-13224:
-----------------------------------
Summary: [Python][Doc] Documentation missing for
pyarrow.dataset.write_dataset
Key: ARROW-13224
URL: https://issues.apache.org/jira/browse/ARROW-13224
Project: Apache Arrow
Issue Type: Improvement
Components: Documentation, Python
Reporter: Weston Pace
I don't believe this is meant to be internal. pyarrow.parquet.write_to_dataset
uses this (if use_legacy_dataset=False) but the parquet API doesn't expose the
same features. A new example should also probably be added to the Tabular
Datasets section of the docs explaining why write_dataset can take in a scanner
(e.g. memory preserving, ability to write a dataset from flight or any record
batch source, etc.)
--
This message was sent by Atlassian Jira
(v8.3.4#803005)