vamshikrishnakyatham opened a new pull request, #13736:
URL: https://github.com/apache/hudi/pull/13736

   ### Change Logs
   
   Implemented a generic SQL filter framework for Hudi procedures to enable 
server-side filtering capabilities. Added `HoodieProcedureFilterUtils` utility 
class that provides reusable SQL expression filtering functionality that can be 
applied to any procedure result set (added to hudi clean procedures in the 
current test). The framework supports complex filter expressions and maintains 
backward compatibility with existing procedure calls.
   
   ### Impact
   
   **Public API Changes:**
   - Added optional `filter` parameter to procedure calls for SQL-like filtering
   - New `HoodieProcedureFilterUtils` utility class for procedure developers
   - Backward compatible - existing procedure calls continue to work without 
changes
   
   **Performance Impact:**
   - **Positive**: Reduces data transfer by filtering results at source instead 
of client-side
   - **Positive**: Limits processing to only required records before applying 
filters
   
   **User-Facing Features:**
   - Users can now apply SQL expressions to filter procedure results
   - Supports complex conditions (e.g., `plan_time > "20231201000000"`, `action 
= "clean"`)
   - Extensible framework that can be adopted by other procedures
   
   ### Risk level: Low
   
   **Verification:**
   - Comprehensive unit tests added for filter utility functions
   - Integration tests verify filter expressions work correctly with procedure 
results
   - Backward compatibility verified - all existing tests pass without 
modification
   - Performance tested with large datasets to ensure no regression
   
   ### Documentation Update
   
   **Required Updates:**
   - Update procedure documentation to include new optional `filter` parameter
   - Add examples of filter expressions in procedure usage guides
   - Document `HoodieProcedureFilterUtils` API for procedure developers
   - Create Jira ticket for Hudi website documentation updates
   
   **Config Changes:** None - no new configs added or defaults changed
   
   ### Contributor's checklist
   
   - [x] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [x] Change Logs and Impact were stated clearly
   - [x] Adequate tests were added if applicable
   - [ ] CI passed


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to