Re: how to convert a PCollection to an Array in local memory?

2017-07-10 Thread Kenneth Knowles
Hi bluejoe, Assuming you know that you have a very small PCollection, the way you can do this is by reading it as a side input. See https://beam.apache.org/documentation/programming-guide/#transforms-sideio Here's a snippet as a teaser to read the docs I link to: PCollection

Re: API to query the state of a running dataflow job?

2017-07-10 Thread Randal Moore
Thanks. I will create a JIRA ticket to try to explain. I am planning a service running in kubernetes that will submit dataflow jobs. It will need to know the status of jobs [across service restarts]. Alternatives might be to do some sort of GBK at the end of the job and post the result to