Will Nicholson created BEAM-13760:
-------------------------------------
Summary: Add randomness to default Dataflow job name in Python sdk
Key: BEAM-13760
URL: https://issues.apache.org/jira/browse/BEAM-13760
Project: Beam
Issue Type: Improvement
Components: runner-dataflow
Reporter: Will Nicholson
Currently, when a Dataflow job is created with the default name in python, the
name is a concatenation of the word "beamapp", the username, and the time in
microseconds, as seen
[here|https://github.com/apache/beam/blob/master/sdks/python/apache_beam/runners/dataflow/internal/apiclient.py#L415-L428].
Therefore, when two jobs are created by the same user at the same time, the
jobs names collide and the second job fails.
However, the Java SDK has already solved this problem, by appending a random
hex string to the job name, seen
[here|https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/options/PipelineOptions.java#L338-L351].
The objective of this issue is to align the python sdk with the java sdk, by
appending a random string to the default job name.
--
This message was sent by Atlassian Jira
(v8.20.1#820001)