You can also configure your own Docker images if you like, instructions here: https://beam.apache.org/documentation/runtime/environments/
On Wed, Nov 27, 2019 at 12:38 AM Carl Thomé <[email protected]> wrote: > Hi, > > I have a Beam pipeline written in the Python SDK that decodes audio files > into TFRecord:s. I'd like to run it on DataFlow but I'm missing libsndfile1 > in the workers. > > Is there any way of configuring the base image for the DataFlow workers > (e.g. Dockerfile + apt install) to get audio decoding working? > > On a similar note, when it comes to Python dependencies in the DataFlow > runtime (like librosa), is there a wish list somewhere on which we can > upvote missing Python libraries? > > Cheers, > Carl Thomé >
