GitHub user GlutenPerfBot created a discussion: February 13, 2026: Weekly Status Update in Gluten
*This weekly update is generated by LLMs. You're welcome to join our [Github](https://github.com/apache/incubator-gluten/discussions) for in-depth discussions.* ## Overall Activity Summary The Apache Gluten project has been highly active over the past 7 days with 42 pull requests and 20+ issues, focusing on major infrastructure improvements, performance optimizations, and Spark 4.x compatibility. The community is preparing for the 1.6.0 release while advancing multiple backend enhancements. ## Key Ongoing Projects ### Build System Modernization - **Gradle Build Support**: @liuneng1994 is leading a comprehensive effort (#11576) to add Gradle as an alternative to Maven, featuring multi-version support, native C++ integration, and significant build performance improvements - **Incremental Build Optimization**: @baibaichen delivered major improvements (#11560, #11595) reducing incremental build times from ~3 minutes to under 30 seconds through Ninja build system adoption and smart caching ### Performance & Memory Management - **Native Delta Statistics Writer**: @zhztheplayer achieved remarkable 61% performance improvement (#11419) by eliminating C2R overhead through native Velox aggregation tasks - **Broadcast Hash Join Optimization**: @JkSelf implemented executor-level hash table caching (#8931) showing 1.29x performance improvement in TPC-DS benchmarks - **Memory Management**: Multiple PRs addressing off-heap memory issues in shuffle operations (#11542, #11540) ### Spark 4.x Compatibility - **Python 3.10 Migration**: @ReemaAlzaid completed CI updates (#11481, #11519) to support Spark 4.1's Python requirements - **Test Suite Stabilization**: @baibaichen and team are systematically fixing disabled test suites (#11550, #11580) with 51 unique suites across Spark 4.0/4.1 versions ## Priority Items ### Critical Infrastructure - **GPU CI Infrastructure**: @zhouyuan temporarily disabled GPU CI (#11612) due to FBOS upgrade compatibility issues - needs container updates - **S3 Integration Testing**: @Mariamalmesfer enabled comprehensive S3 integration tests (#11516) closing a long-standing gap ### Function Support Expansion - **ANSI Mode Implementation**: @PHILO-HE is coordinating comprehensive ANSI SQL compliance (#10134) with multiple contributors working on type casting and arithmetic functions - **Missing Spark Functions**: @zhztheplayer added support for `approx_count_distinct_for_intervals` (#11599) essential for Spark CBO + histogram functionality ## Notable Discussions ### Release Planning - **Gluten 1.6.0 Release**: @zhztheplayer is coordinating the upcoming release (#11603) with version bump completed (#11592) ### New Backend Introduction - **Bolt Backend Integration**: @WangGuangxin initiated discussion (#10929) about integrating Bolt, a Velox fork from ByteDance with production-hardened features and LLVM-based JIT compilation ## Emerging Trends 1. **AI-Driven Development**: Multiple PRs explicitly mention AI tooling usage (Claude, GitHub Copilot) for development acceleration 2. **Production Optimization**: Focus shifting from basic functionality to production-ready features like memory management, performance tuning, and comprehensive testing 3. **Multi-Backend Strategy**: Growing interest in supporting multiple execution backends beyond Velox 4. **Build Performance**: Significant engineering effort on developer experience improvements ## Good First Issues ### #11501: Docker Dependency Caching **Skills needed**: Docker, CI/CD, Maven **Why it's good**: Straightforward infrastructure improvement with clear requirements to pre-install Java dependencies in CI Docker images for faster builds ### #11400: Spark 4.1 Test Fixes **Skills needed**: Scala, Spark, Testing **Why it's good**: Well-documented issue with specific test failures listed, great for understanding Gluten's testing framework ### #10134: ANSI Mode Support **Skills needed**: Scala, SQL, Type Systems **Why it's good**: Comprehensive tracking issue with individual tasks that can be picked up independently, excellent for learning Spark SQL internals ### #11550: Disabled Test Suite Restoration **Skills needed**: Scala, Testing, Debugging **Why it's good**: Clear table of disabled tests with failure counts, perfect for contributors wanting to improve test coverage GitHub link: https://github.com/apache/incubator-gluten/discussions/11614 ---- This is an automatically sent email for [email protected]. To unsubscribe, please send an email to: [email protected] --------------------------------------------------------------------- To unsubscribe, e-mail: [email protected] For additional commands, e-mail: [email protected]
