[ 
https://issues.apache.org/jira/browse/ARROW-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-2598:
--------------------------------
    Labels: parquet  (was: )

> [Python]  table.to_pandas segfault
> ----------------------------------
>
>                 Key: ARROW-2598
>                 URL: https://issues.apache.org/jira/browse/ARROW-2598
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: jacques
>            Priority: Major
>              Labels: parquet
>
> Here is a small snippet which produces a segfault:
> {noformat}
> In [1]: import pyarrow as pa
> In [2]: import pyarrow.parquet as pq
> In [3]: pa_ar = pa.array([[], []])
> In [4]: pq.write_table(
>    ...:     table=pa.Table.from_arrays([pa_ar],["test"]),
>    ...:     where="test5.parquet",
>    ...:     compression="snappy",
>    ...:     flavor="spark"
>    ...: )
> In [5]: pq.read_table("test5.parquet")
> Out[5]: 
> pyarrow.Table
> test: list<item: null>
>   child 0, item: null
> In [6]: pq.read_table("test5.parquet").to_pydict()
> Out[6]: OrderedDict([(u'test', [None, None])])
> In [7]: pq.read_table("test5.parquet").to_pandas()
> Segmentation fault
> {noformat}
> I thank you in advance for having this fixed.
> Best, 
> Jacques



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to