I have a topology where each datum undergoes a series of processes. I have currently implemented the different processes in individual bolts. In my topology, Bolt 1 consumes from kafka and processes all data points. Bolt 2 processes entire output of bolt 1. Bolt 3 processes only certain data types output by bolt 1 (which amounts to about half of the data output by bolt 1).
What is the overhead on bolt 1 to emit two separate streams for bolts 2 and 3? Which really would help me decide whether 1. Should bolt 1 to emit two separate streams for bolt 2 and bolt 3 OR 2. should bolt 1 emit a single stream which bolts 2 and 3 consume and put an IF-ELSE inside bolt 3 to process only the appropriate data types? - R
