[
https://issues.apache.org/jira/browse/ARROW-9040?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
cmsxbc updated ARROW-9040:
--------------------------
Description:
Loading a parquet file write by pandas with default index.
When call _ParquetDatasetV2.read(columns=['column'], use_pandas_metadata=True),
"TypeError: unhashable type 'dict'" will be raised from
{code:java}
index_columns = set(_get_pandas_index_columns(metadata)){code}
Is it because of pandas default index?
was:
When call _ParquetDatasetV2.read(columns=['column'], use_pandas_metadata=True),
"TypeError: unhashable type 'dict'" will be raised from
{code:java}
index_columns = set(_get_pandas_index_columns(metadata))
{code}
> [Python][Parquet]"_ParquetDatasetV2" fail to read with columns and
> use_pandas_metadata=True
> -------------------------------------------------------------------------------------------
>
> Key: ARROW-9040
> URL: https://issues.apache.org/jira/browse/ARROW-9040
> Project: Apache Arrow
> Issue Type: Bug
> Components: Python
> Affects Versions: 0.17.1
> Reporter: cmsxbc
> Priority: Major
>
> Loading a parquet file write by pandas with default index.
> When call _ParquetDatasetV2.read(columns=['column'],
> use_pandas_metadata=True),
> "TypeError: unhashable type 'dict'" will be raised from
> {code:java}
> index_columns = set(_get_pandas_index_columns(metadata)){code}
> Is it because of pandas default index?
>
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)