RexXiong commented on PR #3597:
URL: https://github.com/apache/celeborn/pull/3597#issuecomment-3949965716

   > @SteNicholas So my changes are explained in the PR description pretty well 
I think. While @eolivelli was running his tests, he noticed that the current 
faulty behaviour presents a problem when the auto-scaling spawns new nodes. In 
this moment, the system is under pressure, and yet the newly-spawned nodes 
don't report remote disks, leading to performance degredation and the need to 
spawn more nodes.
   > 
   > In hindsight though, I should have limited the PR to just this. It's just 
that while I was trying to understand the code, I made the other changes to 
make it more readable for myself. In the end I decided to include them too 
since we will be working on this part of the project a lot in the future.
   > 
   > Also, don't forget to help me with writing a test!
   
   for PR overall LGTM, Additionally, you can use `MiniClusterFeature` to 
create a virtual cluster and validate the worker's registration with the master.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to