[
https://issues.apache.org/jira/browse/ARROW-3388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17231553#comment-17231553
]
Joris Van den Bossche commented on ARROW-3388:
----------------------------------------------
I am fine with requiring a schema from the user to get boolean data.
bq. Partition columns are inferred with dictionary type by default
Not in C++, I think? (it's an option to do it?)
> [C++][Dataset] Automatically detect boolean partition columns
> -------------------------------------------------------------
>
> Key: ARROW-3388
> URL: https://issues.apache.org/jira/browse/ARROW-3388
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Uwe Korn
> Priority: Major
> Labels: dataset, dataset-parquet-read, parquet
>
> Saving a {{ParquetDataset}} using a boolean column as a partitioning column
> will store {{True/False}} as the values in the path. On reload these columns
> will then be string columns with the values {{'True'}} and {{'False'}}.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)