vamshikrishnakyatham opened a new pull request, #13816:
URL: https://github.com/apache/hudi/pull/13816

   ## Change Logs
   
   This PR introduces the `ShowMetadataTableHistoryProcedure`, a new Spark SQL 
procedure that displays timeline information for both data table and metadata 
table side-by-side, enabling analysis of metadata table synchronization and 
evolution.
   
   ### Key Features:
   - **Unified Timeline View**: Shows data table and metadata table timelines 
in a single result
   - **Side-by-Side Format**: Metadata table columns first, then data table 
columns
   - **Time Formatting**: Proper MM-dd HH:mm:ss formatting for 
requested/inflight/completed times
   - **Archive Support**: Includes archived timeline via `showArchived` 
parameter
   - **Filtering**: SQL-based filtering and time range support 
(`startTime`/`endTime`)
   - **Error Handling**: Graceful handling when metadata table doesn't exist
   
   ## Impact
   
   **New Public API**: 
   - `show_metadata_table_history` procedure with parameters: `table`, `path`, 
`limit`, `showArchived`, `filter`, `startTime`, `endTime`
   - 11-column output schema showing metadata table and data table timeline 
information
   
   **User Benefits**:
   - Analyze metadata table synchronization with data operations
   - Debug metadata table issues and performance bottlenecks
   - Monitor metadata table bootstrap and maintenance operations
   
   **Performance Impact**: Minimal - read-only procedure with efficient 
timeline scanning.
   
   ## Risk level (write none, low medium or high below)
   **Low** - New read-only procedure that doesn't modify data or table state.
   
   ## Documentation Update
   Update Hudi Spark SQL procedures documentation to include 
`show_metadata_table_history` usage examples and parameter descriptions.
   
   ## Contributor's checklist
   - [x] Read through [contributor's 
guide](https://hudi.apache.org/contribute/how-to-contribute)
   - [x] Change Logs and Impact were stated clearly
   - [x] Adequate tests were added if applicable
   - [x] CI passed
   
   ### Usage Example:
   ```sql
   -- Basic usage
   CALL show_metadata_table_history(table => 'my_table');
   
   -- With archives and filtering
   CALL show_metadata_table_history(
     table => 'my_table', 
     showArchived => true,
     filter => "metadata_table_action = 'commit'"
   );
   ```


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to