[ 
https://issues.apache.org/jira/browse/BEAM-8660?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anonymous updated BEAM-8660:
----------------------------
    Status: Triage Needed  (was: Resolved)

> Override returned artifact staging endpoint
> -------------------------------------------
>
>                 Key: BEAM-8660
>                 URL: https://issues.apache.org/jira/browse/BEAM-8660
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-flink
>            Reporter: Kyle Weaver
>            Priority: P3
>              Labels: portability-flink
>             Fix For: Not applicable
>
>          Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> When running Beam Python pipelines on Flink/Spark/etc, we connect the SDK to 
> the job server using the job_endpoint option. The job server then returns the 
> address of the artifact staging endpoint to the SDK.
> This is problematic when running the job server in network environments where 
> the job server is not aware of its external hostname, for example Kubernetes. 
> In this case, the job server will return something like localhost:8098, which 
> might not be correct. While we do have a --job-host option, this is used both 
> internally and externally, and the internal and external host names may not 
> be the same.
> One solution would be to configure two separate host names in the job server. 
> However I do not prefer this option because of the complexity it adds.
> The more straightforward solution is to add an option to Python that 
> overrides the artifact staging endpoint returned by the server.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to