Sam Rohde created BEAM-6777:
-------------------------------
Summary: SDK Harness Resilience
Key: BEAM-6777
URL: https://issues.apache.org/jira/browse/BEAM-6777
Project: Beam
Issue Type: Improvement
Components: runner-dataflow
Reporter: Sam Rohde
Assignee: Sam Rohde
If the Python SDK Harness crashes in any way (user code exception, OOM, etc)
the job will hang and waste resources. The fix is to add a daemon in the SDK
Harness and Runner Harness to communicate with Dataflow to restart the VM when
stuckness is detected.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)