Hello, We have some in house real time streaming jobs written for Storm and want to see the possibility to migrate to Spark Streaming in the future as our team all think Spark is a very promising technologies (one platform to execute both realtime & interactive jobs) and with excellent documentations.
1. If we focus on the streaming capabilities, what are the main pros/cons at the current moment, is Spark streaming suitable for production use now? 2. In term of message reliability and transaction support, I assume both need to rely on zookeeper, right? 3. In Storm, we are using Topology/Spout/Bolt as the data model, how to translate them to Spark streaming if we want to rewrite our system? Are there any migration guide? 4. Can Spark do distributed RPC like Storm? Thanks for any idea.
