damccorm opened a new issue, #21123:
URL: https://github.com/apache/beam/issues/21123
I'm running TFX pipelines on a Flink cluster using Beam in k8s. However,
extra python packages passed to the Flink runner (or rather beam worker
side-car) are only installed once per deployment cycle. Example:
- Flink is deployed and is up and running
- A TFX pipeline starts, submits a job to Flink along with a python whl of
custom code and beam ops.
- The beam worker installs the package and the pipeline finishes
succesfully.
- A new TFX pipeline is build where a new beam fn is introduced, the
pipline is started and the new whl is submitted as in step 2).
- This time, the new package is not being installed in the beam worker
causing the job to fail due to a reference which does not exist in the beam
worker, since it didn't install the new package.
I started using Flink from beam version 2.27 and it has been an issue all
the time.
Imported from Jira
[BEAM-12792](https://issues.apache.org/jira/browse/BEAM-12792). Original Jira
may contain additional context.
Reported by: ConverJens.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]