jacques created ARROW-2598: ------------------------------ Summary: [Python] table.to_pandas segfault Key: ARROW-2598 URL: https://issues.apache.org/jira/browse/ARROW-2598 Project: Apache Arrow Issue Type: Bug Components: Python Reporter: jacques
Here is a small snippet which produce a segfault: {noformat} In [1]: import pyarrow as pa In [2]: import pyarrow.parquet as pq In [3]: pa_ar = pa.array([[], []]) In [4]: pq.write_table( ...: table=pa.Table.from_arrays([pa_ar],["test"]), ...: where="test5.parquet", ...: compression="snappy", ...: flavor="spark" ...: ) In [5]: pq.read_table("test5.parquet") Out[5]: pyarrow.Table test: list<item: null> child 0, item: null In [6]: pq.read_table("test5.parquet").to_pydict() Out[6]: OrderedDict([(u'test', [None, None])]) In [7]: pq.read_table("test5.parquet").to_pandas() Segmentation fault {noformat} -- This message was sent by Atlassian JIRA (v7.6.3#76005)