well written. I would rewrite the first sentence of the "What is Flink?" section to something like:
"Apache Flink is a general-purpose data processing engine for clusters. It runs on Hadoop components such as HDFS and YARN, as well as stand-alone." Rational: - Flink isn't new (and immature). The code is out for several years. - Compute engine might also be understood as something like an MPI framework. - I would emphasize that we do not require HDFS and YARN, but play nice with it. Do we want to highlight the iterative data flows? 2014-08-26 16:02 GMT+02:00 Stephan Ewen <[email protected]>: > I like it, too >
