Hi, I'm trying to follow this tutorial 
(https://beam.apache.org/documentation/runners/flink/) for a portable runner 
with gosdk.

My setup is as follows:
- flink cluster on a linux host by `./bin/start-cluster.sh`
- JobService on the same host by: `docker run --net=host
    apache/beam_flink1.13_job_server:latest
    --flink-master=localhost:8081`
- run some pipeline with `./go_example_bin -runner=universal
    -endpoint=localhost:8099`

The job always fails with an exception like
    `java.lang.IllegalStateException: No container running for id
    3c831fb28521b7575bdf78d50141a77140674e7c3b2a21786d46d9e47b2bea18`.

It seems that the worker container exits unexpectedly, and the worker
    boot log is as follows:

2021/10/30 03:24:54 Provision info:
pipeline_options:{fields:{key:"beam:option:app_name:v1"  
value:{string_value:"go-job-1-1635564280252965730"}}  
fields:{key:"beam:option:experiments:v1"  
value:{list_value:{values:{string_value:"beam_fn_api"}}}}  
fields:{key:"beam:option:flink_master:v1"  
value:{string_value:"localhost:8281"}}  fields:{key:"beam:option:go_options:v1" 
 value:{struct_value:{fields:{key:"options"  
value:{struct_value:{fields:{key:"hooks"  value:{string_value:"{}"}}}}}}}}  
fields:{key:"beam:option:job_name:v1"  
value:{string_value:"go0job0101635564280252965730-root-1030032442-454e2e1d"}}  
fields:{key:"beam:option:options_id:v1"  value:{number_value:5}}  
fields:{key:"beam:option:output_executable_path:v1"  
value:{null_value:NULL_VALUE}}  fields:{key:"beam:option:parallelism:v1"  
value:{number_value:-1}}  fields:{key:"beam:option:retain_docker_containers:v1" 
 value:{bool_value:false}}  fields:{key:"beam:option:runner:v1"  
value:{null_value:NULL_VALUE}}}  logging_endpoint:{url:"localhost:14431"}  
artifact_endpoint:{url:"localhost:29083"}  
control_endpoint:{url:"localhost:12061"}  
dependencies:{type_urn:"beam:artifact:type:file:v1"  
type_payload:"\ng/tmp/beam-artifact-staging/05eadb2d287a6baaade937ec1d1d2c1d460cbf33658368b55612325600a6d9f4/1-go-worker"
  role_urn:"beam:artifact:role:staging_to:v1"  role_payload:"\n\x06worker"}
2021/10/30 03:24:54 Initializing Go harness: /opt/apache/beam/boot --id=1-1 
--provision_endpoint=localhost:14493
2021/10/30 03:25:02 Failed to retrieve staged files: failed to retrieve 
/tmp/staged in 3 attempts: failed to retrieve chunk for /tmp/staged/worker
        caused by:
rpc error: code = Unknown desc = ; failed to retrieve chunk for 
/tmp/staged/worker
        caused by:
rpc error: code = Unknown desc = ; failed to retrieve chunk for 
/tmp/staged/worker
        caused by:
rpc error: code = Unknown desc = ; failed to retrieve chunk for 
/tmp/staged/worker
        caused by:
rpc error: code = Unknown desc =
    

BTW, the staged artifact can be found at
    
`/tmp/beam-artifact-staging/05eadb2d287a6baaade937ec1d1d2c1d460cbf33658368b55612325600a6d9f4/1-go-worker`
    in the container of jobservice. But I cannot figure out why the
    streaming rpc failed to getArtifact from the
    ArtifactRetrievalService.


Many thanks
wzjmit
    

Reply via email to