Yuzhao Chen created STORM-2286: ---------------------------------- Summary: Strom Rebalance command should support arbitrary component parallelism Key: STORM-2286 URL: https://issues.apache.org/jira/browse/STORM-2286 Project: Apache Storm Issue Type: Bug Components: storm-core Affects Versions: 0.9.3, 0.9.6, 0.10.1, 1.0.1 Reporter: Yuzhao Chen Fix For: 1.x
For legacy reasons, config TOPOLOGY-TASKS is considered first when schedule a topology, for a component, if user don’t specify TOPOLOGY-TASKS, storm just override it to be equal to component parallelism hint, and schedule based on TOPOLOGY-TASKS later on. This works for the most cases, but not Rebalance command. Now, when do Rebalance, the StormBase :component->executors attribute will be overridden in Zookeeper which is used to partition component tasks into executors, as we said above, the TOPOLOGY-TASKS is considered here as the real tasks number for components, something goes weird here: If we override a bigger executor numbers for a component when do rebalance, it just don’t work because smaller TOPOLOGY-TASKS [ not changed since first submitted at all ]is partitioned into bigger number of executors which read from ZooKeeper overridden by Rebalance command, but for smaller task, it works fine. I see that storm support a command like this now: [storm rebalance topology-name [-w wait-time-secs] [-n new-num-workers] [-e component=parallelism]*] which indicate that user can override a component parallelism freely, i think it’s more sensible to support this and it's meaningless to have a restriction like before. -- This message was sent by Atlassian JIRA (v6.3.4#6332)