Yohahaha commented on issue #7174: URL: https://github.com/apache/incubator-gluten/issues/7174#issuecomment-2413527607
root cause is Spark merge two parquet part file's schema when `spark.sql.parquet.mergeSchema=true`, file1 schema is `s struct<a int>`, file2 schema is `s struct<c int>`, merged schema is `s struct<a int, c int>`. filter `s.c = 1` generate GetStructFields(s, 1) expr, Velox will set missing column to null. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
