GitHub user GlutenPerfBot created a discussion: February 20, 2026: Weekly Status Update in Gluten
*This weekly update is generated by LLMs. You're welcome to join our [Github](https://github.com/apache/incubator-gluten/discussions) for in-depth discussions.* ## Overall Activity Summary The Apache Gluten project has been highly active over the past 7 days with 42 pull requests and 20+ issues, focusing on major infrastructure improvements, performance optimizations, and Spark 4.x compatibility. The community is preparing for the 1.6.0 release while advancing multiple backend enhancements. ## Key Ongoing Projects **Build System Modernization** - Gradle Build Support: @liuneng1994 is leading a comprehensive effort (#11576) to add Gradle as an alternative to Maven, featuring multi-version support, native C++ integration, and significant build performance improvements - Incremental Build Optimization: @baibaichen delivered major improvements (#11560, #11595) reducing incremental build times from ~3 minutes to under 30 seconds through Ninja build system adoption and smart caching **Performance & Memory Management** - Native Delta Statistics Writer: @zhztheplayer achieved remarkable 61% performance improvement (#11419) by eliminating C2R overhead through native Velox aggregation tasks - Broadcast Hash Join Optimization: @JkSelf implemented executor-level hash table caching (#8931) showing 1.29x performance improvement in TPC-DS benchmarks - Memory Management: Multiple PRs addressing off-heap memory issues in shuffle operations (#11542, #11540) **Spark 4.x Compatibility** - Python 3.10 Migration: @ReemaAlzaid completed CI updates (#11481, #11519) to support Spark 4.1's Python requirements - Test Suite Stabilization: @baibaichen and team are systematically fixing disabled test suites (#11550, #11580) with 51 unique suites across Spark 4.0/4.1 versions ## Priority Items **Critical Infrastructure** - GPU CI Infrastructure: @zhouyuan temporarily disabled GPU CI (#11612) due to FBOS upgrade compatibility issues - needs container updates - S3 Integration Testing: @Mariamalmesfer enabled comprehensive S3 integration tests (#11516) closing a long-standing gap **Function Support Expansion** - ANSI Mode Implementation: @PHILO-HE is coordinating comprehensive ANSI SQL compliance (#10134) with multiple contributors working on type casting and arithmetic functions - Missing Spark Functions: @zhztheplayer added support for approx_count_distinct_for_intervals (#11599) essential for Spark CBO + histogram functionality ## Notable Discussions **Release Planning** - Gluten 1.6.0 Release: @zhztheplayer is coordinating the upcoming release (#11603) with version bump completed (#11592) **New Backend Introduction** - Bolt Backend Integration: @WangGuangxin initiated discussion (#10929) about integrating Bolt, a Velox fork from ByteDance with production-hardened features and LLVM-based JIT compilation ## Emerging Trends - **AI-Driven Development**: Multiple PRs explicitly mention AI tooling usage (Claude, GitHub Copilot) for development acceleration - **Production Optimization**: Focus shifting from basic functionality to production-ready features like memory management, performance tuning, and comprehensive testing - **Multi-Backend Strategy**: Growing interest in supporting multiple execution backends beyond Velox - **Build Performance**: Significant engineering effort on developer experience improvements ## Good First Issues **#10134: ANSI Mode Support** Skills needed: Scala, SQL, Type Systems Why it's good: Comprehensive tracking issue with individual tasks that can be picked up independently, excellent for learning Spark SQL internals **#11513: Input_file_name() returns "" on iceberg tables** Skills needed: Java/Scala, Iceberg integration Why it's good: Well-defined bug with clear scope, good introduction to Gluten's data lake integration **#11501: Docker Dependency Caching** Skills needed: Docker, CI/CD, Maven Why it's good: Straightforward infrastructure improvement with clear requirements to pre-install Java dependencies in CI Docker images for faster builds GitHub link: https://github.com/apache/incubator-gluten/discussions/11638 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
