GitHub user GlutenPerfBot created a discussion: February 20, 2026: Weekly 
Status Update in Gluten

*This weekly update is generated by LLMs. You're welcome to join our 
[Github](https://github.com/apache/incubator-gluten/discussions) for in-depth 
discussions.*

## Overall Activity Summary
The Apache Gluten project has been highly active over the past 7 days with 42 
pull requests and 20+ issues, focusing on major infrastructure improvements, 
performance optimizations, and Spark 4.x compatibility. The community is 
preparing for the 1.6.0 release while advancing multiple backend enhancements.

## Key Ongoing Projects

**Build System Modernization**
- Gradle Build Support: @liuneng1994 is leading a comprehensive effort (#11576) 
to add Gradle as an alternative to Maven, featuring multi-version support, 
native C++ integration, and significant build performance improvements
- Incremental Build Optimization: @baibaichen delivered major improvements 
(#11560, #11595) reducing incremental build times from ~3 minutes to under 30 
seconds through Ninja build system adoption and smart caching

**Performance & Memory Management**
- Native Delta Statistics Writer: @zhztheplayer achieved remarkable 61% 
performance improvement (#11419) by eliminating C2R overhead through native 
Velox aggregation tasks
- Broadcast Hash Join Optimization: @JkSelf implemented executor-level hash 
table caching (#8931) showing 1.29x performance improvement in TPC-DS benchmarks
- Memory Management: Multiple PRs addressing off-heap memory issues in shuffle 
operations (#11542, #11540)

**Spark 4.x Compatibility**
- Python 3.10 Migration: @ReemaAlzaid completed CI updates (#11481, #11519) to 
support Spark 4.1's Python requirements
- Test Suite Stabilization: @baibaichen and team are systematically fixing 
disabled test suites (#11550, #11580) with 51 unique suites across Spark 
4.0/4.1 versions

## Priority Items

**Critical Infrastructure**
- GPU CI Infrastructure: @zhouyuan temporarily disabled GPU CI (#11612) due to 
FBOS upgrade compatibility issues - needs container updates
- S3 Integration Testing: @Mariamalmesfer enabled comprehensive S3 integration 
tests (#11516) closing a long-standing gap

**Function Support Expansion**
- ANSI Mode Implementation: @PHILO-HE is coordinating comprehensive ANSI SQL 
compliance (#10134) with multiple contributors working on type casting and 
arithmetic functions
- Missing Spark Functions: @zhztheplayer added support for 
approx_count_distinct_for_intervals (#11599) essential for Spark CBO + 
histogram functionality

## Notable Discussions

**Release Planning**
- Gluten 1.6.0 Release: @zhztheplayer is coordinating the upcoming release 
(#11603) with version bump completed (#11592)

**New Backend Introduction**
- Bolt Backend Integration: @WangGuangxin initiated discussion (#10929) about 
integrating Bolt, a Velox fork from ByteDance with production-hardened features 
and LLVM-based JIT compilation

## Emerging Trends

- **AI-Driven Development**: Multiple PRs explicitly mention AI tooling usage 
(Claude, GitHub Copilot) for development acceleration
- **Production Optimization**: Focus shifting from basic functionality to 
production-ready features like memory management, performance tuning, and 
comprehensive testing
- **Multi-Backend Strategy**: Growing interest in supporting multiple execution 
backends beyond Velox
- **Build Performance**: Significant engineering effort on developer experience 
improvements

## Good First Issues

**#10134: ANSI Mode Support**
Skills needed: Scala, SQL, Type Systems
Why it's good: Comprehensive tracking issue with individual tasks that can be 
picked up independently, excellent for learning Spark SQL internals

**#11513: Input_file_name() returns "" on iceberg tables**
Skills needed: Java/Scala, Iceberg integration
Why it's good: Well-defined bug with clear scope, good introduction to Gluten's 
data lake integration

**#11501: Docker Dependency Caching**
Skills needed: Docker, CI/CD, Maven
Why it's good: Straightforward infrastructure improvement with clear 
requirements to pre-install Java dependencies in CI Docker images for faster 
builds

GitHub link: https://github.com/apache/incubator-gluten/discussions/11638

----
This is an automatically sent email for [email protected].
To unsubscribe, please send an email to: [email protected]


---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to