Flink Streaming : PartitionBy vs GroupBy differences

2015-07-03 Thread tambunanw
Hi All, I'm trying to digest what's the difference between this two. From my experience in Spark GroupBy will cause shuffling on the network. Is that the same case in Flink ? I've watch videos and read a couple docs about Flink that's actually Flink will compile the user code into it's own opti

Open method is not called with custom implementation RichWindowMapFunction

2015-07-03 Thread tambunanw
Hi All, I'm trying to create some experiment with rich windowing function and operator state. I modify the streaming stock prices from https://github.com/mbalassi/flink/blob/stockprices/flink-staging/flink-streaming/flink-streaming-examples/src/main/scala/org/apache/flink/streaming/scala/exampl

Batch Processing as Streaming

2015-07-02 Thread tambunanw
Hi All, I see that the way batch processing works in Flink is quite different with Spark. It's all about using streaming engine in Flink. I have a couple of question 1. Is there any support on Checkpointing on batch processing also ? Or that's only for streaming 2. I want to ask about operat

Data Source from Cassandra

2015-07-02 Thread tambunanw
Hi All, I want to if there's a custom data source available for Cassandra ? >From my observation seems that we need to implement that by extending InputFormat. Is there any guide on how to do this robustly ? Cheers -- View this message in context: http://apache-flink-user-mailing-list-arc