FelixYBW opened a new issue, #6612:
URL: https://github.com/apache/incubator-gluten/issues/6612
### Backend
VL (Velox)
### Bug description
A query has parquet scan with complex data type fallbacked.
If I set spark.sql.parquet.enableVectorizedReader = False and run the query
twice, the first one is correct, but the second one wrongly add ColumnartoRow,
caused error `UnsafeRow cannot be cast to
org.apache.spark.sql.vectorized.ColumnarBatch`
First query plan:
```
+- ^ FilterExecTransformer (5)
+- ^ InputIteratorTransformer (4)
+- RowToVeloxColumnar (2)
+- Scan parquet (1)
```
The second query plan:
```
+- ^ InputIteratorTransformer (5)
+- RowToVeloxColumnar (3)
+- * ColumnarToRow (2)
+- Scan parquet (1)
```
Similarly if I set spark.sql.parquet.enableVectorizedReader = True, the
first run reports error and the second run successed.
### Spark version
Spark-3.2.x
### Spark configurations
_No response_
### System information
_No response_
### Relevant logs
_No response_
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]