On Mon, Sep 8, 2014 at 12:59 AM, Jiang Jacky <[email protected]> wrote:
> Hi, Guys > I just read the document about the apache storm, my question is the number > of workers should be equal to the number of supervisor node, right? because > workers control all treads across the topology, so we d better evenly > distribute the number of threads in each node. > > The following article explains the how storm process in parallel. Number of workers are not equal to the number of supervisor nodes. In each supervisor node you can configure how may workers to run on that node. http://www.michael-noll.com/blog/2012/10/16/understanding-the-parallelism-of-a-storm-topology/ > My another question is fieldsgrouping for thousands of IDs, if I want the > same ID always to be emitted to one task, then how can I declare output > fields? how can declareoutputfileds to work with fieldsgrpouping? > > You can declare an ID field and emit the id of the tuple from a bolt/spout. Then you can subscribe using the fieldsgrouping and tuples with the same id will go to the same task. Thanks, Supun.. > Thanks > > > > -- Supun Kamburugamuva Member, Apache Software Foundation; http://www.apache.org E-mail: [email protected]; Mobile: +1 812 369 6762 Blog: http://supunk.blogspot.com
