Li Jin created ARROW-1291: ----------------------------- Summary: [Python] pa.RecordBatch.from_pandas doesn't accept DataFrame with numeric column names Key: ARROW-1291 URL: https://issues.apache.org/jira/browse/ARROW-1291 Project: Apache Arrow Issue Type: Bug Affects Versions: 0.5.0 Reporter: Li Jin
{code} import pyarrow as pa import pandas as pd df = pd.DataFrame([1]) pa.RecordBatch.from_pandas(df) {code} Exception: {code} TypeError Traceback (most recent call last) <ipython-input-5-670ba4a2ddb2> in <module>() 3 4 df = pd.DataFrame([1]) ----> 5 pa.RecordBatch.from_pandas(df) table.pxi in pyarrow.lib.RecordBatch.from_pandas() table.pxi in pyarrow.lib._dataframe_to_arrays() /home/icexelloss/miniconda3/envs/spark-dev/lib/python3.5/site-packages/pyarrow/pandas_compat.py in construct_metadata(df, index_levels, preserve_index, types) 187 arrow_type=arrow_type 188 ) --> 189 for name, arrow_type in zip(df.columns, df_types) 190 ] + ( 191 [ /home/icexelloss/miniconda3/envs/spark-dev/lib/python3.5/site-packages/pyarrow/pandas_compat.py in <listcomp>(.0) 187 arrow_type=arrow_type 188 ) --> 189 for name, arrow_type in zip(df.columns, df_types) 190 ] + ( 191 [ /home/icexelloss/miniconda3/envs/spark-dev/lib/python3.5/site-packages/pyarrow/pandas_compat.py in get_column_metadata(column, name, arrow_type) 125 raise TypeError( 126 'Column name must be a string. Got column {} of type {}'.format( --> 127 name, type(name).__name__ 128 ) 129 ) TypeError: Column name must be a string. Got column 0 of type int64 {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029)