Re: Using Dataflow with Pubsub input connector in batch mode

2024-01-21 Thread Alex Van Boxel
. Use the export to Avro (the schema is in the second link), and then your batch beam pipeline input is a bounded input. _/ _/ Alex Van Boxel On Fri, Jan 19, 2024 at 12:18 AM Reuven Lax via user wrote: > Some comments here: >1. All messages in a PubSub topic is not a well-defined sta

Submit a topic for "Future of Beam" community session @ Beam Summit

2023-06-06 Thread Alex Van Boxel
will take notes and share them back with the community! If you are unsure what kind of topic you could submit, I'll give one of *my examples*: "Beam is missing a simple Kubernetes-based runner and production ready, but it doesn't need to have all the features." _/ _/ Alex Van Boxel

Re: Beam pipeline parallel steps

2020-04-29 Thread Alex Van Boxel
Yes, this is not a problem. We do it regularly in our streaming pipelines as a single stream doesn't have enough load for ruining on a Dataflow. So we run different streams in parallel in a single Beam pipelines. Data wise the streams have nothing to do with each other, but transformation wise

Re: Distributed Tracing in Apache Beam

2020-04-17 Thread Alex Van Boxel
Can you explain a bit more of what you want to achieve here? Do you want to trace how your elements go to the pipeline or do you want to see how every ParDo interacts with external systems? On Fri, Apr 17, 2020, 17:38 Rion Williams wrote: > Hi all, > > I'm reaching out today to inquire if

Re: Beam Katas YouTube

2020-03-27 Thread Alex Van Boxel
That's nicely done! Congrats, going to share this immediately. And I actually didn't know where the name Beam came from, now I know :-) _/ _/ Alex Van Boxel On Fri, Mar 27, 2020 at 4:32 AM Henry Suryawirawan wrote: > Hello, > > Just would like to share that recently the Apache B

Re: Hello Beam Community!

2020-03-13 Thread Alex Van Boxel
Welcome Brittany.. _/ _/ Alex Van Boxel On Fri, Mar 13, 2020 at 2:31 AM Brittany Hermann wrote: > Hello Beam Community! > > My name is Brittany Hermann and I recently joined the Open Source team in > Data Analytics at Google. As a Program Manager, I will be focusing on > commu

Re: Missing permissions when running beam on Dataflow Runner

2020-01-16 Thread Alex Van Boxel
You can find all the information about permissions on this page: https://cloud.google.com/dataflow/docs/concepts/security-and-permissions On Thu, Jan 16, 2020, 22:46 Marco Mistroni wrote: > hi all > apologies for this basic question... > i have a simple workflow that i have been running

Re: Feedback on how we use Apache Beam in my company

2019-10-11 Thread Alex Van Boxel
Great writeup. You can add an additional benefit of docker vs templates: Dynamically reconfigure/rebuild your pipelines from external parameters (example arguments), iso only using the Value placeholders. _/ _/ Alex Van Boxel On Wed, Oct 9, 2019 at 4:55 PM Pierre Vanacker < pierre.va

Beam meetup Belgium - Friday November 22

2019-09-23 Thread Alex Van Boxel
Van Boxel

Re: Streaming inserts BQ with Java SDK Beam

2019-05-07 Thread Alex Van Boxel
a shorter time. I know you can force a batch mode in streaming mode, I don't know for the other way around. _/ _/ Alex Van Boxel On Tue, May 7, 2019 at 6:58 PM Andres Angel wrote: > Hello everyone, > > I need to use BigQuery inserts within my beam pipeline, hence I know well > t

Re: First steps: Beam SQL over ProtoBuf

2019-02-02 Thread Alex Van Boxel
nk you, > Anton > > On Fri, Feb 1, 2019 at 9:59 AM Alex Van Boxel wrote: > >> Hi all, >> >> I got my first test pipeline running with *Beam SQL over ProtoBuf*. I >> was so excited I need to shout this to the world ;-) >> >> >> https://github.c

First steps: Beam SQL over ProtoBuf

2019-02-01 Thread Alex Van Boxel
but this is the first step. If you're invested in ProtoBuf and use Beam, follow the repo: https://github.com/anemos-io/proto-beam <https://github.com/anemos-io/proto-beam/blob/master/transform/src/test/java/io/anemos/protobeam/transform/beamsql/BeamSqlPipelineTest.java> _/ _/ Alex Van Boxel

Beam courses

2019-01-13 Thread Alex Van Boxel
Hey all, Our team had the luxury of growing with Beam, we where Dataflow users before it was GA. But now our team has grown, due to a merger. As we will continue using Beam, but then over different sites I'm thinking about training. The question is... Should I create trainings myself. Or do

Beam Runners: What about Batch to Streaming transition

2019-01-11 Thread Alex Van Boxel
? Is it something the community is thinking about? _/ _/ Alex Van Boxel