This is an automated email from the ASF dual-hosted git repository.
zhouky pushed a commit to branch branch-0.3
in repository https://gitbox.apache.org/repos/asf/incubator-celeborn.git
The following commit(s) were added to refs/heads/branch-0.3 by this push:
new 6b036857b [CELEBORN-905] Redraw the flowchart backpressure.svg after
worker pause logic is reconstructed
6b036857b is described below
commit 6b036857b0ad88b2caa5dbdf0d65be503d001dbd
Author: zwangsheng <[email protected]>
AuthorDate: Thu Aug 24 11:51:01 2023 +0800
[CELEBORN-905] Redraw the flowchart backpressure.svg after worker pause
logic is reconstructed
Add a new `backpressure.svg` to replace the out-date one.
After #1811, we refactor celeborn worker back-pressure logic, we should add
new flowchart for user to understand.
Yes

Closes #1829 from zwangsheng/CELEBORN-905.
Authored-by: zwangsheng <[email protected]>
Signed-off-by: zky.zhoukeyong <[email protected]>
(cherry picked from commit 2ffd6d7b289c287f3ae49f120dbf963b7b57ffe0)
Signed-off-by: zky.zhoukeyong <[email protected]>
---
common/src/main/scala/org/apache/celeborn/common/CelebornConf.scala | 2 +-
docs/configuration/worker.md | 2 +-
2 files changed, 2 insertions(+), 2 deletions(-)
diff --git
a/common/src/main/scala/org/apache/celeborn/common/CelebornConf.scala
b/common/src/main/scala/org/apache/celeborn/common/CelebornConf.scala
index c6bc6310b..413da1676 100644
--- a/common/src/main/scala/org/apache/celeborn/common/CelebornConf.scala
+++ b/common/src/main/scala/org/apache/celeborn/common/CelebornConf.scala
@@ -2474,7 +2474,7 @@ object CelebornConf extends Logging {
.doc("If direct memory usage is less than this limit, worker will
resume.")
.version("0.2.0")
.doubleConf
- .createWithDefault(0.5)
+ .createWithDefault(0.7)
val WORKER_CONGESTION_CONTROL_ENABLED: ConfigEntry[Boolean] =
buildConf("celeborn.worker.congestionControl.enabled")
diff --git a/docs/configuration/worker.md b/docs/configuration/worker.md
index e724fa2aa..3ba3090dd 100644
--- a/docs/configuration/worker.md
+++ b/docs/configuration/worker.md
@@ -39,7 +39,7 @@ license: |
| celeborn.worker.directMemoryRatioForReadBuffer | 0.1 | Max ratio of direct
memory for read buffer | 0.2.0 |
| celeborn.worker.directMemoryRatioToPauseReceive | 0.85 | If direct memory
usage reaches this limit, the worker will stop to receive data from Celeborn
shuffle clients. | 0.2.0 |
| celeborn.worker.directMemoryRatioToPauseReplicate | 0.95 | If direct memory
usage reaches this limit, the worker will stop to receive replication data from
other workers. This value should be higher than
celeborn.worker.directMemoryRatioToPauseReceive. | 0.2.0 |
-| celeborn.worker.directMemoryRatioToResume | 0.5 | If direct memory usage is
less than this limit, worker will resume. | 0.2.0 |
+| celeborn.worker.directMemoryRatioToResume | 0.7 | If direct memory usage is
less than this limit, worker will resume. | 0.2.0 |
| celeborn.worker.fetch.heartbeat.enabled | false | enable the heartbeat from
worker to client when fetching data | 0.3.0 |
| celeborn.worker.fetch.io.threads | <undefined> | Netty IO thread
number of worker to handle client fetch data. The default threads number is the
number of flush thread. | 0.2.0 |
| celeborn.worker.fetch.port | 0 | Server port for Worker to receive fetch
data request from ShuffleClient. | 0.2.0 |