Hi Ana,

in general, worker nodes do not share any state, and cannot themselves decide which work to accept and which to reject. How the work is distributed to downstream processing is defined by a runner, not the Beam model. On the other hand, what you ask for might be possibly accomplished using a grouping operation - either a GroupByKey or a stateful DoFn might help you with that. Can you further describe your intent?

Best,

 Jan

On 9/7/21 12:32 PM, Ana Markovic wrote:
To whom this may concern,

I've been looking into polyglot data processing frameworks recently, and I read Beam's documentation as well as developed a few examples to get some hands-on experience. I've been wondering, and I haven't found this in the documentation, is there a way to set up worker nodes so they are "opinionated" or "smart" in a sense that they can decide for themselves which jobs they will perform? For example, in a word count example, an opinionated worker node could only decide to monitor occurrences of a specific word if it's among the node's favourite words.

I hope I explained it well, but please let me know if more details are needed to answer this question.

Thankful in advance,
Ana

Reply via email to