Weichen Xu created SPARK-44909:
----------------------------------
Summary: Skip starting torch distributor log streaming server when
it is not available
Key: SPARK-44909
URL: https://issues.apache.org/jira/browse/SPARK-44909
Project: Spark
Issue Type: Improvement
Components: ML
Affects Versions: 3.5
Reporter: Weichen Xu
Skip starting torch distributor log streaming server when it is not available.
In some cases, e.g., in a databricks connect cluster, there is some network
limitation that casues starting log streaming server failure, but, this does
not need to break torch distributor training routine.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]