Hi, I have a Beam pipeline written in the Python SDK that decodes audio files into TFRecord:s. I'd like to run it on DataFlow but I'm missing libsndfile1 in the workers.
Is there any way of configuring the base image for the DataFlow workers (e.g. Dockerfile + apt install) to get audio decoding working? On a similar note, when it comes to Python dependencies in the DataFlow runtime (like librosa), is there a wish list somewhere on which we can upvote missing Python libraries? Cheers, Carl Thomé
