[
https://issues.apache.org/jira/browse/ARROW-12736?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
ASF GitHub Bot updated ARROW-12736:
-----------------------------------
Labels: pull-request-available (was: )
> [C++] Eliminate unnecessary copy in FieldPath::Get()
> ----------------------------------------------------
>
> Key: ARROW-12736
> URL: https://issues.apache.org/jira/browse/ARROW-12736
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: David Li
> Assignee: David Li
> Priority: Major
> Labels: pull-request-available
> Fix For: 5.0.0
>
> Time Spent: 10m
> Remaining Estimate: 0h
>
> FieldPath::Get() uses RecordBatch::column_data which returns a
> vector<shared_ptr<>> by _value_ instead of by _reference_ - so with wide
> schemas, we end up copying and then destroying a lot of shared_ptrs. When
> done in a tight loop - as with dataset projection - this is a major
> performance pessimization.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)