Hi All, i am new to Storm and Kafka and working on POC.
my requirement is get a message from Kafka in json format, spout reading that message and firts bolt converting the json message to different format like csv and the second bolt saving it to hadoop. now i came up with initial design where i can use kafkaspout to read kafka topics and bolt converting it to csv file and next bolt saving in hadoop. i have following questions can the first bold which coverts the message to csv file can omit it? the file would be saving on disk. can a file which is saved on disk can be omitted. how does the second bolt read the file which is saved on disk by first bolt? do we need to serialize message ommitted by spout and/or bolt? sorry if the questions sound silly, this is my first topology with minimum knowledge of storm. if you guys think of proper design how to implement the my requirement can you please let me know thanks in advance -Praveen
