[
https://issues.apache.org/jira/browse/ARROW-16582?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche resolved ARROW-16582.
-------------------------------------------
Resolution: Fixed
Issue resolved by pull request 13187
[https://github.com/apache/arrow/pull/13187]
> [Python] Include DATASET in list of components in PyArrow's dev page
> --------------------------------------------------------------------
>
> Key: ARROW-16582
> URL: https://issues.apache.org/jira/browse/ARROW-16582
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Documentation, Python
> Reporter: Yaron Gvili
> Assignee: Raúl Cumplido
> Priority: Major
> Labels: pull-request-available
> Fix For: 9.0.0
>
> Time Spent: 40m
> Remaining Estimate: 0h
>
> PyArrow's dev page has a [build-and-test
> section|https://arrow.apache.org/docs/developers/python.html#build-and-test]
> that currently does not list DATASET as a component. Using a recent Arrow
> version (commit e5e490), I observed DATASET was mandatory for the successful
> completion of the test suite ran by `{color:#201f1e}python -m pytest
> pyarrow/{color}`, as recommended on the page. Without `export
> PYARROW_WITH_DATASET=1`, I observed errors with `test_dataset.py`,
> `test_exec_plan.py`, and a couple others.
> Since DATASET is intended to be an optional component, it should be listed on
> this section. In addition, the documented test suite command should be
> updated to one that doesn't fail without the DATASET component being selected
> (or else the test suite itself should be fixed).
--
This message was sent by Atlassian Jira
(v8.20.7#820007)