[
https://issues.apache.org/jira/browse/ARROW-6568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Benjamin Kietzman reassigned ARROW-6568:
----------------------------------------
Assignee: Benjamin Kietzman
> [C++][Python][Parquet] pyarrow.parquet crash writing zero-chunk
> dictionary-type column
> --------------------------------------------------------------------------------------
>
> Key: ARROW-6568
> URL: https://issues.apache.org/jira/browse/ARROW-6568
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.14.1
> Environment: Pyarrow v0.14.1, manylinux1
> Reporter: Adam Hooper
> Assignee: Benjamin Kietzman
> Priority: Major
> Fix For: 0.15.0
>
>
> Trying to write a zero-RecordBatch file to parquet:
> {code:python}
> import pyarrow
> import pyarrow.parquet
> table = pyarrow.Table.from_batches([], pyarrow.schema([('A',
> pyarrow.dictionary(pyarrow.int32(), pyarrow.string()))]))
> pyarrow.parquet.write_table(table, 'x.parquet')
> {code}
> ... I receive an error and Python exits with exit code {{139}}:
> {noformat}
> WARNING: Logging before InitGoogleLogging() is written to STDERR
> F0915 18:37:23.099939 1 table.cc:64] Check failed: (chunks.size()) > (0)
> cannot construct ChunkedArray from empty vector and omitted type
> *** Check failure stack trace: ***
> {noformat}
--
This message was sent by Atlassian Jira
(v8.3.2#803003)