[
https://issues.apache.org/jira/browse/ARROW-11206?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17381571#comment-17381571
]
Weston Pace commented on ARROW-11206:
-------------------------------------
I think MakeStruct is good and much clearer. This caused plenty of confusion
for me as well. I doubt many python uses are using pyarrow.compute.project
anyways since they probably specify it via the scanner or dataset options.
> [C++][Dataset][Python] Consider hiding/renaming "project"
> ---------------------------------------------------------
>
> Key: ARROW-11206
> URL: https://issues.apache.org/jira/browse/ARROW-11206
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Affects Versions: 2.0.0
> Reporter: Ben Kietzman
> Assignee: Ben Kietzman
> Priority: Major
> Labels: compute, dataset, pull-request-available
> Fix For: 5.0.0
>
> Time Spent: 20m
> Remaining Estimate: 0h
>
> The "project" compute Function is necessary for ARROW-11174. However it is
> not intended for direct use outside an Expression ([where the correspondence
> to projection is not immediately
> obvious|https://github.com/apache/arrow/pull/9131#issuecomment-757764173]) so
> it may be preferable to do one/more of:
> * rename the function to "wrap_struct" or similar so it does make sense
> outside Expressions
> * ensure the function is not exposed to python/R bindings except through
> Expressions
> * remove the function from the default registry
--
This message was sent by Atlassian Jira
(v8.3.4#803005)