Hi I have set up a Storm cluster and also a MongoDB cluster using a common set of 10 nodes. My topology involves a spout that connects with a bolt using fields grouping. The bolt contains the logic to put received data into MongoDB. The degree of parallelism for spout is 1 while that for the bolt is set to10.
I need to ensure that the bolt runs with one instance on each of the 10 nodes, to involve all machines. However by default, I see that single spout and the 10 bolt instances all run on a single node. The rest 9 nodes are left idle. Can we somehow have Storm to run the 10 bolt instances on individual machines on my cluster rather than scheduling all of them on a single node? Please suggest. Regards, Raman
