leixm opened a new pull request, #3185: URL: https://github.com/apache/celeborn/pull/3185
### What changes were proposed in this pull request? Fix data lost when push merged data of replica and hard split happen. ### Why are the changes needed? There is a problem with replicate rpc callback. The code should satisfy the following conditions: when comparing the status returned by primary and replica data, the status on the left should be used as the final status for the client, FAILURE > HARD_SPLIT > CONGESTION > SUCCESS. The status on the right cannot cover the status on the left. There are two problems with the code now 1. CONGESTION can cover HARD_SPLIT, which will cause data loss 2. HARD_SPLIT can cover FAILURE, which will affect the exclude worker logic, and there may be some problems ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Existing UTs. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
