Hi,

I have a Beam pipeline written in the Python SDK that decodes audio files
into TFRecord:s. I'd like to run it on DataFlow but I'm missing libsndfile1
in the workers.

Is there any way of configuring the base image for the DataFlow workers
(e.g. Dockerfile + apt install) to get audio decoding working?

On a similar note, when it comes to Python dependencies in the DataFlow
runtime (like librosa), is there a wish list somewhere on which we can
upvote missing Python libraries?

Cheers,
Carl Thomé

Reply via email to