[
https://issues.apache.org/jira/browse/ARROW-13737?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Neal Richardson resolved ARROW-13737.
-------------------------------------
Resolution: Fixed
Issue resolved by pull request 10994
[https://github.com/apache/arrow/pull/10994]
> [C++] Support scalar columns in hash aggregations (was: hash_sum on scalar
> column segfaults)
> --------------------------------------------------------------------------------------------
>
> Key: ARROW-13737
> URL: https://issues.apache.org/jira/browse/ARROW-13737
> Project: Apache Arrow
> Issue Type: Improvement
> Components: C++
> Reporter: Neal Richardson
> Assignee: David Li
> Priority: Major
> Labels: kernel, pull-request-available, query-engine
> Fix For: 6.0.0
>
> Time Spent: 2h
> Remaining Estimate: 0h
>
> Reproducer in R. Works fine without grouping, but segfaults if you group:
> {code}
> tab <- Table$create(a=1:4, b=6:9)
> tab %>%
> mutate(counter = 1) %>%
> summarize(count = sum(counter))
> Table
> 1 rows x 1 columns
> $count <double>
> tab %>%
> mutate(counter = 1) %>%
> group_by(a) %>%
> summarize(count = sum(counter))
> *** caught segfault ***
> address 0x0, cause 'invalid permissions'
> Traceback:
> 1: .Call(`_arrow_ExecPlan_run`, plan, final_node)
> 2: ExecPlan_run(self, node)
> 3: plan$Run(final_node)
> 4: do_exec_plan(.data, group_vars = dplyr::group_vars(.data))
> 5: do_arrow_summarize(.data, ...)
> ...
> {code}
--
This message was sent by Atlassian Jira
(v8.3.4#803005)