[
https://issues.apache.org/jira/browse/ARROW-10347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-10347:
------------------------------------------
Description:
See https://www.mail-archive.com/[email protected]/msg00680.html, and my
answer to it (experimentation in
https://nbviewer.jupyter.org/gist/jorisvandenbossche/9382de2eb96db5db2ef801f63a359082).
It seems we support that the partition field is also present in the actual
data, but it's probably good to add some explicit tests to ensure the expected
behaviour.
> [Python][Dataset] Test behaviour in case of duplicate partition field / data
> column
> -----------------------------------------------------------------------------------
>
> Key: ARROW-10347
> URL: https://issues.apache.org/jira/browse/ARROW-10347
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Joris Van den Bossche
> Priority: Major
>
> See https://www.mail-archive.com/[email protected]/msg00680.html, and my
> answer to it (experimentation in
> https://nbviewer.jupyter.org/gist/jorisvandenbossche/9382de2eb96db5db2ef801f63a359082).
>
> It seems we support that the partition field is also present in the actual
> data, but it's probably good to add some explicit tests to ensure the
> expected behaviour.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)