vamshikrishnakyatham opened a new pull request, #13736: URL: https://github.com/apache/hudi/pull/13736
### Change Logs Implemented a generic SQL filter framework for Hudi procedures to enable server-side filtering capabilities. Added `HoodieProcedureFilterUtils` utility class that provides reusable SQL expression filtering functionality that can be applied to any procedure result set (added to hudi clean procedures in the current test). The framework supports complex filter expressions and maintains backward compatibility with existing procedure calls. ### Impact **Public API Changes:** - Added optional `filter` parameter to procedure calls for SQL-like filtering - New `HoodieProcedureFilterUtils` utility class for procedure developers - Backward compatible - existing procedure calls continue to work without changes **Performance Impact:** - **Positive**: Reduces data transfer by filtering results at source instead of client-side - **Positive**: Limits processing to only required records before applying filters **User-Facing Features:** - Users can now apply SQL expressions to filter procedure results - Supports complex conditions (e.g., `plan_time > "20231201000000"`, `action = "clean"`) - Extensible framework that can be adopted by other procedures ### Risk level: Low **Verification:** - Comprehensive unit tests added for filter utility functions - Integration tests verify filter expressions work correctly with procedure results - Backward compatibility verified - all existing tests pass without modification - Performance tested with large datasets to ensure no regression ### Documentation Update **Required Updates:** - Update procedure documentation to include new optional `filter` parameter - Add examples of filter expressions in procedure usage guides - Document `HoodieProcedureFilterUtils` API for procedure developers - Create Jira ticket for Hudi website documentation updates **Config Changes:** None - no new configs added or defaults changed ### Contributor's checklist - [x] Read through [contributor's guide](https://hudi.apache.org/contribute/how-to-contribute) - [x] Change Logs and Impact were stated clearly - [x] Adequate tests were added if applicable - [ ] CI passed -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
