Hi,
In June 2017, Google introduced server-based shuffle for Datatflow
pipeline, which can result in 5x performance improvement. However, at the
time of announcement this feature was only available for Cloud Dataflow SDK
for Java version 1. What is the status for Dataflow SDK for Python? Is it
Related question:
How can we tell if the docker image of our binary contains the cython
optimized beam or the slower codepath?
The image was built on Google cloud (using *gcloud container builds submit*
).
On Mon, Feb 12, 2018 at 9:32 PM, Ahmet Altay wrote:
> +1 to wheels.