GitHub user GlutenPerfBot created a discussion: February 27, 2026: Weekly Status Update in Gluten
*This weekly update is generated by LLMs. You're welcome to join our [Github](https://github.com/apache/incubator-gluten/discussions) for in-depth discussions.* ## Overall Activity Summary The Apache Gluten project has been highly active over the past 7 days with 42 pull requests and 20+ issues, focusing on major infrastructure improvements, performance optimizations, and Spark 4.x compatibility. The community is preparing for the 1.6.0 release while advancing multiple backend enhancements. ## Key Ongoing Projects - **Build System Modernization**: @baibaichen delivered major improvements (#11560, #11595) reducing incremental build times from ~3 minutes to under 30 seconds through Ninja build system adoption and smart caching - **Bloop Integration**: @liuneng1994 added Bloop build server integration (#11645) achieving 35.9x speedup for incremental compilation - **Spark 4.x Compatibility**: Multiple contributors working on test suite stabilization (#11550, #11580) with 51 unique suites across Spark 4.0/4.1 versions - **Performance Optimizations**: @JkSelf implemented broadcast hash join optimization (#8931) showing 1.29x performance improvement in TPC-DS benchmarks - **Iceberg Integration**: @rui-mo and @jinchengchenghh working on enabling Iceberg tests (#11630, #11631, #11641) ## Priority Items - **GPU CI Infrastructure**: @zhouyuan needs help with GPU CI job restoration (#11611) - container updates required due to FBOS upgrade - **Memory Management**: @wForget's RSS shuffle writer OOM issue (#11542) requires immediate attention for production stability - **Arm64 Build Issues**: @odidev and @huangshiyou reporting compilation failures on Azure Arm64 (#11633, #11639) - **Scala Compilation**: @baibaichen fixed incremental compilation mode (#11659) - critical for developer productivity ## Notable Discussions - #11603: Gluten 1.6.0 release coordination by @zhztheplayer - Spark 3.2 support deprecated, preview Spark 4.0 support included - #11585: @FelixYBW tracking useful Velox PRs not merged upstream - community coordination effort - #10929: @WangGuangxin proposing Bolt backend integration - Velox fork from ByteDance with production features ## Emerging Trends - **AI-Driven Development**: Multiple PRs explicitly mention AI tooling usage (Claude, GitHub Copilot) for development acceleration - **Developer Experience Focus**: Significant engineering effort on build performance and tooling improvements - **Production Readiness**: Shift from basic functionality to production-ready features like memory management and comprehensive testing - **Multi-Backend Strategy**: Growing interest in supporting multiple execution backends beyond Velox ## Good First Issues - #11513: Input_file_name() returns "" on iceberg tables - Well-defined bug with clear scope, good introduction to Gluten's data lake integration - #11501: Docker dependency caching - Straightforward infrastructure improvement to pre-install Java dependencies in CI Docker images - #10134: ANSI mode support - Comprehensive tracking issue with individual tasks that can be picked up independently, excellent for learning Spark SQL internals - #11400: Spark 4.1.x failed unit tests - Multiple test failures need investigation, good for understanding Gluten's test framework GitHub link: https://github.com/apache/incubator-gluten/discussions/11671 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
