Hello
In storm topology we r receiving tuples in millions from Kafka and we have to perform some calculations in bolt. Parallely we have bolt that starts writing into hdfs ,now we have parallelism hint for writing the file is 8. So 8 files will be there. Actually problem is once the snapshot data is enriched Nd written to multiple file nd completed,we have to trigger the other job that will copy the records from files into database. How can we find with multiple files created Nd bolt writing paraalely in files which is the last record written so that we can trigger nextjob.Any ideas?
