You can also configure your own Docker images if you like, instructions
here: https://beam.apache.org/documentation/runtime/environments/

On Wed, Nov 27, 2019 at 12:38 AM Carl Thomé <[email protected]> wrote:

> Hi,
>
> I have a Beam pipeline written in the Python SDK that decodes audio files
> into TFRecord:s. I'd like to run it on DataFlow but I'm missing libsndfile1
> in the workers.
>
> Is there any way of configuring the base image for the DataFlow workers
> (e.g. Dockerfile + apt install) to get audio decoding working?
>
> On a similar note, when it comes to Python dependencies in the DataFlow
> runtime (like librosa), is there a wish list somewhere on which we can
> upvote missing Python libraries?
>
> Cheers,
> Carl Thomé
>

Reply via email to