Yaron Gvili created ARROW-16582:
-----------------------------------

             Summary: Include DATASET in list of components in PyArrow's dev 
page
                 Key: ARROW-16582
                 URL: https://issues.apache.org/jira/browse/ARROW-16582
             Project: Apache Arrow
          Issue Type: Improvement
            Reporter: Yaron Gvili


PyArrow's dev page has a [build-and-test 
section|https://arrow.apache.org/docs/developers/python.html#build-and-test] 
that currently does not list DATASET as a component. Using a recent Arrow 
version (commit e5e490), I observed DATASET was mandatory for the successful 
completion of the test suite ran by `{color:#201f1e}python -m pytest 
pyarrow/{color}`, as recommended on the page. Without `export 
PYARROW_WITH_DATASET=1`, I observed errors with `test_dataset.py`, 
`test_exec_plan.py`, and a couple others.

Since DATASET is intended to be an optional component, it should be listed on 
this section. In addition, the documented test suite command should be updated 
to one that doesn't fail without the DATASET component being selected (or else 
the test suite itself should be fixed).



--
This message was sent by Atlassian Jira
(v8.20.7#820007)

Reply via email to