Hello

In storm topology we r receiving tuples in millions from Kafka and we have
to perform some calculations in bolt. Parallely we have bolt that starts
writing into hdfs ,now we have parallelism hint for writing the file is 8.
So 8 files will be there.
Actually problem is once the snapshot data is enriched Nd written to
multiple file nd completed,we have to trigger the other job that will copy
the records from files into database.
How can we find with multiple files created Nd bolt writing paraalely in
files which is the last record written so that we can trigger nextjob.Any
ideas?

Reply via email to