This page should help you get python and non python deps installed:
https://beam.apache.org/documentation/sdks/python-pipeline-dependencies/

On Wed, Nov 27, 2019 at 12:38 AM Carl Thomé <[email protected]> wrote:

> Hi,
>
> I have a Beam pipeline written in the Python SDK that decodes audio files
> into TFRecord:s. I'd like to run it on DataFlow but I'm missing libsndfile1
> in the workers.
>
> Is there any way of configuring the base image for the DataFlow workers
> (e.g. Dockerfile + apt install) to get audio decoding working?
>
> On a similar note, when it comes to Python dependencies in the DataFlow
> runtime (like librosa), is there a wish list somewhere on which we can
> upvote missing Python libraries?
>
> Cheers,
> Carl Thomé
>

Reply via email to