[ 
https://issues.apache.org/jira/browse/ARROW-10347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joris Van den Bossche updated ARROW-10347:
------------------------------------------
    Description: 
See https://www.mail-archive.com/[email protected]/msg00680.html, and my 
answer to it (experimentation in 
https://nbviewer.jupyter.org/gist/jorisvandenbossche/9382de2eb96db5db2ef801f63a359082).
 
It seems we support that the partition field is also present in the actual 
data, but it's probably good to add some explicit tests to ensure the expected 
behaviour.

> [Python][Dataset] Test behaviour in case of duplicate partition field / data 
> column
> -----------------------------------------------------------------------------------
>
>                 Key: ARROW-10347
>                 URL: https://issues.apache.org/jira/browse/ARROW-10347
>             Project: Apache Arrow
>          Issue Type: Improvement
>          Components: Python
>            Reporter: Joris Van den Bossche
>            Priority: Major
>
> See https://www.mail-archive.com/[email protected]/msg00680.html, and my 
> answer to it (experimentation in 
> https://nbviewer.jupyter.org/gist/jorisvandenbossche/9382de2eb96db5db2ef801f63a359082).
>  
> It seems we support that the partition field is also present in the actual 
> data, but it's probably good to add some explicit tests to ensure the 
> expected behaviour.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to