Victor Jimenez created ARROW-1998:
-------------------------------------
Summary: [Python] Table.from_pandas crashes when data frame is
empty
Key: ARROW-1998
URL: https://issues.apache.org/jira/browse/ARROW-1998
Project: Apache Arrow
Issue Type: Bug
Components: Python
Affects Versions: 0.8.0
Environment: Windows 10 Build 15063.850
Reporter: Victor Jimenez
Loading an empty CSV file, and then attempting to create a PyArrow Table from
it makes the application crash. The following code should be able to reproduce
the issue:
{code}
import numpy as np
import pandas as pd
import pyarrow as pa
FIELDS = ['id', 'name']
NUMPY_TYPES = {
'id': np.int64,
'name': np.unicode
}
PYARROW_SCHEMA = pa.schema([
pa.field('id', pa.int64()),
pa.field('name', pa.string())
])
file = open('input.csv', 'w')
file.close()
df = pd.read_csv(
'input.csv',
header=None,
names=FIELDS,
dtype=NUMPY_TYPES,
engine='c',
)
pa.Table.from_pandas(df, schema=PYARROW_SCHEMA)
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)