Victor Jimenez created ARROW-1998:
-------------------------------------

             Summary: [Python] Table.from_pandas crashes when data frame is 
empty
                 Key: ARROW-1998
                 URL: https://issues.apache.org/jira/browse/ARROW-1998
             Project: Apache Arrow
          Issue Type: Bug
          Components: Python
    Affects Versions: 0.8.0
         Environment: Windows 10 Build 15063.850
            Reporter: Victor Jimenez


Loading an empty CSV file, and then attempting to create a PyArrow Table from 
it makes the application crash. The following code should be able to reproduce 
the issue:
{code}
import numpy as np
import pandas as pd
import pyarrow as pa

FIELDS = ['id', 'name']
NUMPY_TYPES = {
    'id': np.int64,
    'name': np.unicode
}
PYARROW_SCHEMA = pa.schema([
    pa.field('id', pa.int64()),
    pa.field('name', pa.string())
])

file = open('input.csv', 'w')
file.close()

df = pd.read_csv(
    'input.csv',
    header=None,
    names=FIELDS,
    dtype=NUMPY_TYPES,
    engine='c',
)

pa.Table.from_pandas(df, schema=PYARROW_SCHEMA)
{code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to