Support with an AutoValue coder error

2021-08-30 Thread Juan Pablo Feliciano Báez
Hi everyone. I'm trying to understand the way to resolve an issue I'm getting when I run a pipeline locally after I did adapt a Java POJO class to an Apache Beam Schema by using the AutoValue approach. Below, the code of the transformation where the exception is thrown: public class

Beam Connector for Reading Data from Delta Lake

2021-08-30 Thread Michael Benenson
Hi, folks I have put into the open source Beam connector for reading data from Delta Lake. Please, see https://github.com/mbenenso/beam-deltalake for the connector & https://github.com/mbenenso/beam-deltalake-example for the usage example. -- Thanks -- Michael

[ANNOUNCE] Beam 2.32.0 Released

2021-08-30 Thread Ankur Goenka
The Apache Beam team is pleased to announce the release of version 2.32.0. Apache Beam is an open source unified programming model to define and execute data processing pipelines, including ETL, batch and stream (continuous) processing. See https://beam.apache.org You can download the release

Re: KinesisIO - support for enhanced fan-out

2021-08-30 Thread Alexey Romanenko
There is already an open Jira for this feature [1]. Pavel, feel free to add your thoughts on this there or on dev@. I left my comment on related PR [2] and I’d be happy to help with moving this feature forward. It would be great to re-use the work that was already done. — Alexey [1]

Re: Kafka IO using Python 3.8, Beam 2.31, and Flink 1.13; NoClassDefFoundError HpackDecoder

2021-08-30 Thread Jeremy Lewi
HI Jan, I should clarify that I'm using the portable job runner not the flink job runner. > a) why do you need specifying default environment to the JobServer? That should be done via the PipelineOptions of the SDK that you use for building your Pipeline, or is there any other reason for that?

Freeze time to test a python pipeline with TestPipeline and DirectRunner

2021-08-30 Thread Quentin Sommer
Hi, I'm working on a python streaming pipeline and I'm trying to simulate different execution times to test business logic depending on current time + records data. When I freeze the time using freezegun the TestPipeline never finishes. I'm guessing it is

Re: Kafka IO using Python 3.8, Beam 2.31, and Flink 1.13; NoClassDefFoundError HpackDecoder

2021-08-30 Thread Jan Lukavský
Hi Jeremy, +dev , as this might be interesting for the dev mailing list as well. Couple of questions:  a) why do you need specifying default environment to the JobServer? That should be done via the PipelineOptions of the SDK that you use for building your