viirya opened a new issue, #167:
URL: https://github.com/apache/arrow-datafusion-comet/issues/167

   ### Describe the bug
   
   When I work on SortMergeJoin, there are some TPCDS query failures happened 
with errors like:
   
   ```
   - q38 *** FAILED ***
     java.lang.Exception: Expected "struct<[count(1):bigint]>", but got 
"struct<[]>" Schema did not match
   SELECT count(*)
   FROM (
          SELECT DISTINCT
            c_last_name,
            c_first_name,
            d_date
          FROM store_sales, date_dim, customer
          WHERE store_sales.ss_sold_date_sk = date_dim.d_date_sk
            AND store_sales.ss_customer_sk = customer.c_customer_sk
            AND d_month_seq BETWEEN 1200 AND 1200 + 11
          INTERSECT
          SELECT DISTINCT
            c_last_name,
            c_first_name,
            d_date
          FROM catalog_sales, date_dim, customer
          WHERE catalog_sales.cs_sold_date_sk = date_dim.d_date_sk
            AND catalog_sales.cs_bill_customer_sk = customer.c_customer_sk
            AND d_month_seq BETWEEN 1200 AND 1200 + 11
          INTERSECT
          SELECT DISTINCT
            c_last_name,
            c_first_name,
            d_date
          FROM web_sales, date_dim, customer
          WHERE web_sales.ws_sold_date_sk = date_dim.d_date_sk
            AND web_sales.ws_bill_customer_sk = customer.c_customer_sk
            AND d_month_seq BETWEEN 1200 AND 1200 + 11
        ) hot_cust
   LIMIT 100
   ...
   om.apple.boson.BosonNativeException
   Arrow error: Invalid argument error: RowConverter column schema mismatch, 
expected Utf8 got Date32
   ```
   
   It is because DataFusion `coalesce` function returns a `Date32` array from 
`Date32` inputs (this is correct) but its return type is `Utf8`. The details 
are in https://github.com/apache/arrow-datafusion/issues/9458. The fix is at 
https://github.com/apache/arrow-datafusion/pull/9459.
   
   ### Steps to reproduce
   
   _No response_
   
   ### Expected behavior
   
   _No response_
   
   ### Additional context
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to