baibaichen opened a new pull request, #11580: URL: https://github.com/apache/incubator-gluten/pull/11580
## What changes are proposed in this pull request? Fixes #11550 (partial) - **Enable `GlutenXmlExpressionsSuite`** in VeloxTestSettings for both spark40 and spark41 (was TODO disabled for spark41) - **Fix mixin**: `GlutenTestsCommonTrait` → `GlutenTestsTrait`. The prior PR (#11512) added `GlutenXmlExpressionsSuite` with `GlutenTestsCommonTrait`, which does not enable Gluten execution for the test suite. - **Exclude `from_xml- invalid data`**: Gluten overrides `checkEvaluation` to execute expressions via DataFrame (`df.select().collect()`), which throws `SparkException` directly instead of wrapping it in `TestFailedException`. Same pattern as `from_json - invalid data`. - **Fix woodstox classpath conflict**: Exclude `hadoop-common` transitive dependency from `hive-llap-common` in both `gluten-ut/pom.xml` and spark-specific pom files. Hadoop ships a shaded woodstox (`org.apache.hadoop.shaded.com.ctc.wstx.*`) whose property names are incompatible with the non-shaded woodstox used by Spark XML, causing `IllegalArgumentException: Unrecognized property` in `to_xml` tests. ## How was this patch tested? Ran `GlutenXmlExpressionsSuite` on both spark40 and spark41: - spark40: 28 passed, 1 ignored (`from_xml- invalid data`) ✅ - spark41: 28 passed, 1 ignored (`from_xml- invalid data`) ✅ Compiled successfully with both `spark-4.0` and `spark-4.1` profiles. ## Was this patch authored or co-authored using generative AI tooling? Generated-by: GitHub Copilot CLI -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
