Wes McKinney created ARROW-4832:
-----------------------------------
Summary: [Python] pandas Index metadata for RangeIndex is incorrect
Key: ARROW-4832
URL: https://issues.apache.org/jira/browse/ARROW-4832
Project: Apache Arrow
Issue Type: Bug
Components: Python
Reporter: Wes McKinney
Fix For: 0.14.0
I'm looking at ARROW-1639 to optimize storage and loading of RangeIndex, but in
the meantime I wanted to report this oddness:
{code}
In [9]: df = pd.DataFrame({'a': [1, 2, 3]})
In [10]: json.loads(pa.Table.from_pandas(df).schema.metadata[b'pandas'])
Out[10]:
{'index_columns': ['__index_level_0__'],
'column_indexes': [{'name': None,
'field_name': None,
'pandas_type': 'unicode',
'numpy_type': 'object',
'metadata': {'encoding': 'UTF-8'}}],
'columns': [{'name': 'a',
'field_name': 'a',
'pandas_type': 'int64',
'numpy_type': 'int64',
'metadata': None},
{'name': None,
'field_name': '__index_level_0__',
'pandas_type': 'int64',
'numpy_type': 'int64',
'metadata': None}],
'pandas_version': '0.23.4'}
{code}
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)