[
https://issues.apache.org/jira/browse/ARROW-9577?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Apache Arrow JIRA Bot reassigned ARROW-9577:
--------------------------------------------
Assignee: Antoine Pitrou (was: Apache Arrow JIRA Bot)
> [Python][C++] posix_madvise error on Debian in pyarrow 1.0.0
> ------------------------------------------------------------
>
> Key: ARROW-9577
> URL: https://issues.apache.org/jira/browse/ARROW-9577
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 1.0.0
> Environment: Installed with Miniconda (for Debian; used pip for the
> Ubuntu test)
> Reporter: Jim Pivarski
> Assignee: Antoine Pitrou
> Priority: Major
> Labels: pull-request-available
> Fix For: 1.0.1, 2.0.0
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> The following writes and reads back from a Parquet file in both pyarrow
> 0.17.0 and 1.0.0 on Ubuntu 18.04:
>
> {code:java}
> >>> import pyarrow.parquet
> >>> a = pyarrow.array([[1.1, 2.2, 3.3], [], [4.4, 5.5]])
> >>> t = pyarrow.Table.from_batches([pyarrow.RecordBatch.from_arrays([a],
> >>> ["stuff"])])
> >>> pyarrow.parquet.write_table(t, "stuff.parquet")
> >>> t2 = pyarrow.parquet.read_table("stuff.parquet") {code}
>
> However, the same thing raises the following exception on Debian 9 (stretch)
> in pyarrow 1.0.0 but not in pyarrow 0.17.0:
> {code:java}
> Traceback (most recent call last):
> File "<stdin>", line 1, in <module>
> File
> "/home/jpivarski/miniconda3/lib/python3.7/site-packages/pyarrow/parquet.py",
> line 1564, in read_table
> filters=filters,
> File
> "/home/jpivarski/miniconda3/lib/python3.7/site-packages/pyarrow/parquet.py",
> line 1433, in __init__
> partitioning=partitioning)
> File
> "/home/jpivarski/miniconda3/lib/python3.7/site-packages/pyarrow/dataset.py",
> line 667, in dataset
> return _filesystem_dataset(source, **kwargs)
> File
> "/home/jpivarski/miniconda3/lib/python3.7/site-packages/pyarrow/dataset.py",
> line 434, in _filesystem_dataset
> return factory.finish(schema)
> File "pyarrow/_dataset.pyx", line 1451, in
> pyarrow._dataset.DatasetFactory.finish
> File "pyarrow/error.pxi", line 122, in
> pyarrow.lib.pyarrow_internal_check_status
> File "pyarrow/error.pxi", line 99, in pyarrow.lib.check_status
> OSError: posix_madvise failed. Detail: [errno 0] Success{code}
> It's a little odd that the error says that it failed with "detail: success".
> That suggests to me that an "if" predicate is backward (missing "not"), which
> might only be triggered on some OS/distributions.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)