prashantwason opened a new pull request, #17451:
URL: https://github.com/apache/hudi/pull/17451

   ## Summary
   Added `clearJobStatus()` calls to properly clear job status after parallel 
operations complete across the codebase.
   
   This PR ports the fix from commit `0c262346f51e9d6602ffdbd65320b43eecaf07b5` 
and extends it to all locations where `setJobStatus()` is called without a 
corresponding cleanup.
   
   ## Changes
   This PR adds `clearJobStatus()` calls in **10 files** with a total of **29 
new calls**:
   
   ### High Priority (Parallel operations):
   - `HoodieTableMetadataUtil.java` - 2 locations (reading record keys from 
base files and file slices)
   - `HoodieBackedTableMetadataWriter.java` - 6 locations (listing partitions, 
creating metadata records, bulk insert/upsert)
   - `FSUtils.java` - 1 location (parallel listing paths)
   - `HadoopFSUtils.java` - 1 location (parallel listing paths)
   - `SecondaryIndexRecordGenerationUtils.java` - 1 location (reading secondary 
keys)
   
   ### Medium Priority (Write/commit operations):
   - `BaseSparkCommitActionExecutor.java` - 4 locations (clustering updates, 
partitioning/writing, building workload profile, commit status collect)
   - `CleanPlanActionExecutor.java` - 2 locations (obtaining partitions to 
clean, generating file slices to clean)
   - `BaseWriteHelper.java` - 1 location (tagging operation)
   - `HoodieBloomIndex.java` - 2 locations (loading key ranges for file slices 
and meta index)
   - `BaseHoodieCompactionPlanGenerator.java` - 1 location (looking for files 
to compact)
   
   ## Pattern
   For each `setJobStatus()` call, added a corresponding `clearJobStatus()` 
call after the parallel operation completes:
   
   ```java
   engineContext.setJobStatus(...);
   // parallel operation using engineContext.map/parallelize/flatMap/etc
   engineContext.clearJobStatus();  // <-- ADDED
   ```
   
   This prevents job status from remaining set indefinitely after operations 
complete.
   
   ## Test Plan
   - Existing unit and integration tests should pass
   - No functional changes, only cleanup operations added
   
   🤖 Generated with [Claude Code](https://claude.com/claude-code)


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to