Hi guys, We are glad to announce that the spark runner POC that was re-written from scratch using the structured-streaming framework and the dataset API can now run WordCount !
It is still embryonic. For now it only runs in batch mode and there is no fancy stuff like state, timer, SDF, metrics, ... but it is still a major step forward ! Streaming support work has just started. You can find the branch here: https://github.com/apache/beam/tree/spark-runner_structured-streaming Enjoy, Etienne