Hi all, Yesterday, there was an announcement from TensorFlow community about the new tf.Transform library [1]. It is a library that allows users to define pre-processing pipelines and run using large scale data processing frameworks. It is a library specifically designed to work with Apache Beam. It is great to see Python SDK getting a larger ecosystem and increased usage.
Also worth mentioning is, PMC member Robert Bradshaw was one of the contributors to this new library. Thank you, Ahmet [1] https://research.googleblog.com/2017/02/preprocessing-for-machine- learning-with.html