Victor Jimenez created ARROW-1998: ------------------------------------- Summary: [Python] Table.from_pandas crashes when data frame is empty Key: ARROW-1998 URL: https://issues.apache.org/jira/browse/ARROW-1998 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.8.0 Environment: Windows 10 Build 15063.850 Reporter: Victor Jimenez
Loading an empty CSV file, and then attempting to create a PyArrow Table from it makes the application crash. The following code should be able to reproduce the issue: {code} import numpy as np import pandas as pd import pyarrow as pa FIELDS = ['id', 'name'] NUMPY_TYPES = { 'id': np.int64, 'name': np.unicode } PYARROW_SCHEMA = pa.schema([ pa.field('id', pa.int64()), pa.field('name', pa.string()) ]) file = open('input.csv', 'w') file.close() df = pd.read_csv( 'input.csv', header=None, names=FIELDS, dtype=NUMPY_TYPES, engine='c', ) pa.Table.from_pandas(df, schema=PYARROW_SCHEMA) {code} -- This message was sent by Atlassian JIRA (v7.6.3#76005)