[ 
https://issues.apache.org/jira/browse/ARROW-2598?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

jacques updated ARROW-2598:
---------------------------
    Description: 
Here is a small snippet which produces a segfault:
{noformat}
In [1]: import pyarrow as pa

In [2]: import pyarrow.parquet as pq

In [3]: pa_ar = pa.array([[], []])

In [4]: pq.write_table(
   ...:     table=pa.Table.from_arrays([pa_ar],["test"]),
   ...:     where="test5.parquet",
   ...:     compression="snappy",
   ...:     flavor="spark"
   ...: )

In [5]: pq.read_table("test5.parquet")
Out[5]: 
pyarrow.Table
test: list<item: null>
  child 0, item: null

In [6]: pq.read_table("test5.parquet").to_pydict()
Out[6]: OrderedDict([(u'test', [None, None])])

In [7]: pq.read_table("test5.parquet").to_pandas()
Segmentation fault

{noformat}

I thank you in advance for having this fixed.

Best, 

Jacques

  was:
Here is a small snippet which produce a segfault:

{noformat}

In [1]: import pyarrow as pa

In [2]: import pyarrow.parquet as pq

In [3]: pa_ar = pa.array([[], []])

In [4]: pq.write_table(
   ...:     table=pa.Table.from_arrays([pa_ar],["test"]),
   ...:     where="test5.parquet",
   ...:     compression="snappy",
   ...:     flavor="spark"
   ...: )

In [5]: pq.read_table("test5.parquet")
Out[5]: 
pyarrow.Table
test: list<item: null>
  child 0, item: null

In [6]: pq.read_table("test5.parquet").to_pydict()
Out[6]: OrderedDict([(u'test', [None, None])])

In [7]: pq.read_table("test5.parquet").to_pandas()
Segmentation fault

 

{noformat}


> [Python]  table.to_pandas segfault
> ----------------------------------
>
>                 Key: ARROW-2598
>                 URL: https://issues.apache.org/jira/browse/ARROW-2598
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: Python
>            Reporter: jacques
>            Priority: Major
>
> Here is a small snippet which produces a segfault:
> {noformat}
> In [1]: import pyarrow as pa
> In [2]: import pyarrow.parquet as pq
> In [3]: pa_ar = pa.array([[], []])
> In [4]: pq.write_table(
>    ...:     table=pa.Table.from_arrays([pa_ar],["test"]),
>    ...:     where="test5.parquet",
>    ...:     compression="snappy",
>    ...:     flavor="spark"
>    ...: )
> In [5]: pq.read_table("test5.parquet")
> Out[5]: 
> pyarrow.Table
> test: list<item: null>
>   child 0, item: null
> In [6]: pq.read_table("test5.parquet").to_pydict()
> Out[6]: OrderedDict([(u'test', [None, None])])
> In [7]: pq.read_table("test5.parquet").to_pandas()
> Segmentation fault
> {noformat}
> I thank you in advance for having this fixed.
> Best, 
> Jacques



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to