ASF GitHub Bot logged work on BEAM-8539:

                Author: ASF GitHub Bot
            Created on: 09/Nov/19 21:44
            Start Date: 09/Nov/19 21:44
    Worklog Time Spent: 10m 
      Work Description: chadrik commented on pull request #9965: [BEAM-8539] 
Make job state transitions in python-based runners consistent with java-based 
URL: https://github.com/apache/beam/pull/9965#discussion_r344463156

 File path: model/job-management/src/main/proto/beam_job_api.proto
 @@ -213,16 +213,37 @@ message JobMessagesResponse {
 // without needing to pass through STARTING.
 message JobState {
   enum Enum {
+    // The job state reported by a runner cannot be interpreted by the SDK.
+    // The job has been paused, or has not yet started.
     STOPPED = 1;
+    // The job is currently running. (terminal)
 Review comment:
   whoops! fixed.
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
For queries about this service, please contact Infrastructure at:

Issue Time Tracking

    Worklog Id:     (was: 340966)
    Time Spent: 4h 50m  (was: 4h 40m)

> Clearly define the valid job state transitions
> ----------------------------------------------
>                 Key: BEAM-8539
>                 URL: https://issues.apache.org/jira/browse/BEAM-8539
>             Project: Beam
>          Issue Type: Improvement
>          Components: beam-model, runner-core, sdk-java-core, sdk-py-core
>            Reporter: Chad Dombrova
>            Priority: Major
>          Time Spent: 4h 50m
>  Remaining Estimate: 0h
> The Beam job state transitions are ill-defined, which is big problem for 
> anything that relies on the values coming from JobAPI.GetStateStream.
> I was hoping to find something like a state transition diagram in the docs so 
> that I could determine the start state, the terminal states, and the valid 
> transitions, but I could not find this. The code reveals that the SDKs differ 
> on the fundamentals:
> Java InMemoryJobService:
>  * start state: *STOPPED*
>  * run - about to submit to executor:  STARTING
>  * run - actually running on executor:  RUNNING
>  * terminal states: DONE, FAILED, CANCELLED, DRAINED
> Python AbstractJobServiceServicer / LocalJobServicer:
>  * start state: STARTING
>  * terminal states: DONE, FAILED, CANCELLED, *STOPPED*
> I think it would be good to make python work like Java, so that there is a 
> difference in state between a job that has been prepared and one that has 
> additionally been run.
> It's hard to tell how far this problem has spread within the various runners. 
>  I think a simple thing that can be done to help standardize behavior is to 
> implement the terminal states as an enum in the beam_job_api.proto, or create 
> a utility function in each language for checking if a state is terminal, so 
> that it's not left up to each runner to reimplement this logic.

This message was sent by Atlassian Jira

Reply via email to