akpatnam25 commented on PR #2700:
URL: https://github.com/apache/celeborn/pull/2700#issuecomment-2303887040
> I think this change can make retrospective debugging hard as we will have
no insight into which workers the shuffle data was written to and what was the
behaviour of those workers during that time.
>
> Instead of making it completely a debug log IMO we should at least output
some basic info about the worker host selected for shuffle this will reduce the
noise but also leave some space for debugging issues.
>
> Ex –
>
> ```
> Host: 10.152.63.41
> WorkerStatus: WorkerStatus{state=Normal, stateStartTime=1724244049248}
> NumOfSlotsAllocated: 100
> ```
I think this will still be noisy in a large fleet, especially in the driver
logs where users might be looking for other logs/errors. I feel that if truly
needed for debugging, the user can then enable debug logging for it.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]