[I] [VL] table scan fallback gets wrong query plan [incubator-gluten]

via GitHub Sat, 27 Jul 2024 00:21:49 -0700


FelixYBW opened a new issue, #6612:
URL: https://github.com/apache/incubator-gluten/issues/6612


   ### Backend
   
   VL (Velox)
   
   ### Bug description
   
   A query has parquet scan with complex data type fallbacked. 
   If I set spark.sql.parquet.enableVectorizedReader = False and run the query 
twice, the first one is correct, but the second one wrongly add ColumnartoRow, 
caused error `UnsafeRow cannot be cast to 
org.apache.spark.sql.vectorized.ColumnarBatch`
   
   First query plan:
   ```
   +- ^ FilterExecTransformer (5)
      +- ^ InputIteratorTransformer (4)
         +- RowToVeloxColumnar (2)
            +- Scan parquet  (1)
   ```
   
   The second query plan:
   ```
   +- ^ InputIteratorTransformer (5)
      +- RowToVeloxColumnar (3)
         +- * ColumnarToRow (2)
             +- Scan parquet  (1)
   ```
   
   Similarly if I set spark.sql.parquet.enableVectorizedReader = True, the 
first run reports error and the second run successed.
   
   ### Spark version
   
   Spark-3.2.x
   
   ### Spark configurations
   
   _No response_
   
   ### System information
   
   _No response_
   
   ### Relevant logs
   
   _No response_


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[I] [VL] table scan fallback gets wrong query plan [incubator-gluten]

Reply via email to