LuciferYang opened a new pull request #30026:
URL: https://github.com/apache/spark/pull/30026


   ### What changes were proposed in this pull request?
   
   The purpose of this pr is to resolve SPARK-32978.
   
   The main reason of bad case describe in SPARK-32978 is the 
`BasicWriteTaskStatsTracker` directly reports the new added partition number of 
each task, which makes it impossible to remove duplicate data in driver side.
   
   The main of this pr is change to report partitionValues to driver and remove 
duplicate data at driver side to make sure the number of dynamic part metric is 
correct.
    
   ### Why are the changes needed?
   The the number of dynamic part metric we display on the UI should be correct.
   
   ### Does this PR introduce _any_ user-facing change?
   No
   
   
   ### How was this patch tested?
   Add a new test case refer to described in SPARK-32978
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to