Sam Whittle created BEAM-10808:
----------------------------------

             Summary: StreamingDataflowWorker streaming rpcs due not always 
observe stream failures, until timeout
                 Key: BEAM-10808
                 URL: https://issues.apache.org/jira/browse/BEAM-10808
             Project: Beam
          Issue Type: Bug
          Components: runner-dataflow
            Reporter: Sam Whittle
            Assignee: Sam Whittle


The StreamingDataflow worker communicates to Streaming Engine with grpc 
streaming rpcs. Occasionaly with service failures, the error is not observed by 
the streaming rpc until the deadline is exceeded.  As the deadline is multiple 
minutes, this adds unnecessary latency to pipelines that are otherwise keeping 
up with seconds delay.

Testing has shown that grpc channel health checks are not sufficient but that 
application level health checks ensure that streaming rpc health issues are 
detected earlier than the stream timeout.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to