Hi, Well that depends on how you pull data into your storm cluster. If you are pulling it from a queue, you will need to know how the queuing system allows you to retrieve the data. You may have standards such as JMS or even specific implementations as in kafka. Some people even pull data from file systems and databases. Please take a call based on type of data source, acceptable latency in fetching data and the cost of having such a system.
Regards, Richards Peter.
