pranav kohli created ARROW-2728: ----------------------------------- Summary: Pyarrow not adding partition columns when given a glob path Key: ARROW-2728 URL: https://issues.apache.org/jira/browse/ARROW-2728 Project: Apache Arrow Issue Type: Bug Components: Python Affects Versions: 0.9.0 Environment: pyarrow : 0.9.0.post1 dask : 0.17.1 Mac OS Reporter: pranav kohli
I am saving a dask dataframe to parquet with two partition columns using the pyarrow engine. The problem arises in scanning the partition columns. When I scan using the directory path, I get the partition columns in the output dataframe, whereas if I scan using the glob path, I dont get these columns -- This message was sent by Atlassian JIRA (v7.6.3#76005)