[ 
https://issues.apache.org/jira/browse/ARROW-17783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Yifei Yang updated ARROW-17783:
-------------------------------
    Description: 
When using arrow's aggregate kernel with table transferred from arrow flight 
(DoGet), it may crash at arrow::util::CheckAlignment(). However using original 
data it works well, also if I first serialize the transferred table into bytes 
then recreate an arrow table using the bytes, it works well.

"flight-alignment-test" attached is the minimal test that can produce the 
issue, which basically does "sum(total_revenue) group by l_suppkey" using the 
table from "DoGet()". ("DummyNode" is just used to be the producer of the 
aggregate node as the producer is required to create the aggregate node)

  was:
When using arrow's aggregate kernel with table transferred from arrow flight 
(DoGet), it may crash at arrow::util::CheckAlignment(). However using original 
data it works well, also if I first serialize the transferred table into bytes 
then recreate an arrow table using the bytes, it works well.

I'm still trying to reproduce the crash stably in a minimal test case...


> Using table from flight in aggregate kernel may not pass alignment check
> ------------------------------------------------------------------------
>
>                 Key: ARROW-17783
>                 URL: https://issues.apache.org/jira/browse/ARROW-17783
>             Project: Apache Arrow
>          Issue Type: Bug
>          Components: C++, FlightRPC
>    Affects Versions: 6.0.0, 8.0.0
>            Reporter: Yifei Yang
>            Priority: Major
>         Attachments: flight-alignment-test.zip
>
>
> When using arrow's aggregate kernel with table transferred from arrow flight 
> (DoGet), it may crash at arrow::util::CheckAlignment(). However using 
> original data it works well, also if I first serialize the transferred table 
> into bytes then recreate an arrow table using the bytes, it works well.
> "flight-alignment-test" attached is the minimal test that can produce the 
> issue, which basically does "sum(total_revenue) group by l_suppkey" using the 
> table from "DoGet()". ("DummyNode" is just used to be the producer of the 
> aggregate node as the producer is required to create the aggregate node)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to