[ 
https://issues.apache.org/jira/browse/BEAM-13760?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-13760:
---------------------------------
    Fix Version/s: 2.37.0
       Resolution: Fixed
           Status: Resolved  (was: Open)

> Add randomness to default Dataflow job name in Python sdk
> ---------------------------------------------------------
>
>                 Key: BEAM-13760
>                 URL: https://issues.apache.org/jira/browse/BEAM-13760
>             Project: Beam
>          Issue Type: Improvement
>          Components: runner-dataflow
>            Reporter: Will Nicholson
>            Assignee: Will Nicholson
>            Priority: P2
>             Fix For: 2.37.0
>
>   Original Estimate: 48h
>          Time Spent: 2h 10m
>  Remaining Estimate: 45h 50m
>
> Currently, when a Dataflow job is created with the default name in python, 
> the name is a concatenation of the word "beamapp", the username, and the time 
> in microseconds, as seen 
> [here|https://github.com/apache/beam/blob/master/sdks/python/apache_beam/runners/dataflow/internal/apiclient.py#L415-L428].
> Therefore, when two jobs are created by the same user at the same time, the 
> jobs names collide and the second job fails. 
> However, the Java SDK has already solved this problem, by appending a random 
> hex string to the job name, seen 
> [here|https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/options/PipelineOptions.java#L338-L351].
> The objective of this issue is to align the python sdk with the java sdk, by 
> appending a random string to the default job name. 



--
This message was sent by Atlassian Jira
(v8.20.1#820001)

Reply via email to