Raúl Cumplido created ARROW-16548:
-------------------------------------
Summary: [Python] Fix test_parquet_invalid_version test when no
parquet or dataset are defined
Key: ARROW-16548
URL: https://issues.apache.org/jira/browse/ARROW-16548
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 8.0.0
Reporter: Raúl Cumplido
Assignee: Raúl Cumplido
Fix For: 9.0.0
Working on the minimal_builds I have found that when building arrow and pyarrow
without DATASET nor PARQUET the following error happens:
{code:java}
def test_parquet_invalid_version(tempdir):
table = pa.table({'a': [1, 2, 3]})
with pytest.raises(ValueError, match="Unsupported Parquet format
version"):
> _write_table(table, tempdir / 'test_version.parquet', version="2.2")
E NameError: name '_write_table' is not defined
pyarrow/tests/parquet/test_basic.py:52: NameError{code}
This is the first test on the parquet/test_basic.py module.
I have done some investigation and if I change the test order (move that test
on the file), the test that fails is a different one, always the first test on
the module.
If I import (but not use) from pyarrow.tests.parquet.common import pytestmark
no test failures happen.
There seems to be a bug on how the pytestmark is injected. I am creating a
ticket because it seems it might require more than a MINOR fix.
--
This message was sent by Atlassian Jira
(v8.20.7#820007)