Yaron Gvili created ARROW-16582:
-----------------------------------
Summary: Include DATASET in list of components in PyArrow's dev
page
Key: ARROW-16582
URL: https://issues.apache.org/jira/browse/ARROW-16582
Project: Apache Arrow
Issue Type: Improvement
Reporter: Yaron Gvili
PyArrow's dev page has a [build-and-test
section|https://arrow.apache.org/docs/developers/python.html#build-and-test]
that currently does not list DATASET as a component. Using a recent Arrow
version (commit e5e490), I observed DATASET was mandatory for the successful
completion of the test suite ran by `{color:#201f1e}python -m pytest
pyarrow/{color}`, as recommended on the page. Without `export
PYARROW_WITH_DATASET=1`, I observed errors with `test_dataset.py`,
`test_exec_plan.py`, and a couple others.
Since DATASET is intended to be an optional component, it should be listed on
this section. In addition, the documented test suite command should be updated
to one that doesn't fail without the DATASET component being selected (or else
the test suite itself should be fixed).
--
This message was sent by Atlassian Jira
(v8.20.7#820007)