Sachin Goyal created STORM-1686:
-----------------------------------
Summary: Make local-grouping an independent option rather than
just LOCAL_OR_SHUFFLE
Key: STORM-1686
URL: https://issues.apache.org/jira/browse/STORM-1686
Project: Apache Storm
Issue Type: Improvement
Reporter: Sachin Goyal
Currently, Storm distributes the tuples to bolts irrespective of their locality.
So if 1 spout and 5 bolts are running in the same process, storm does not give
any preference to the local 5 bolts (except for LOCAL_OR_SHUFFLE grouping).
Due to this, there is a lot of inter-machine/inter-process communication for
sending out the tuples.
It would be really good if local-mode can be made as an independent option
(*preferLocalBolts*) rather than having just a single mode LOCAL_OR_SHUFFLE
that can distribute locally.
At least for fields-grouping, the local-mode would make a lot of sense.
Currently, the alternatives to the local-mode-with-fields-grouping is:
# Run several topologies, each with 1 worker so as to limit the spouts/bolts to
one machine only.
# Have another bolt between a spout and bolt that can be used for directing the
tuple traffic by directed streams.
But both of these seem to be a lot of work for something that should be made an
independent option - *preferLocalBolts*.
Also see the following SO content:
#
[way-to-apply-multiple-groupings-in-storm|http://stackoverflow.com/questions/36368224/is-there-a-way-to-apply-multiple-groupings-in-storm/36374837#36374837]
#
[fields-grouping-for-same-machine|http://stackoverflow.com/questions/35132136/storm-fields-grouping-for-same-machine?lq=1]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)