> I have a question here, as I post several threads before, I am using > storm-rdbms to write into postgresqlDB, data was collected from kafkaSpout, > it works. Since it insert into DB once I get a tuple, per row/insert > operation. I have concern that if this type of consuming is fast enough and > will potentially cost the overhead?
That obviously depends on how many events do you expect to process per second, and how you set your parallelism. In general, you're probably using Kafka and Storm, because you DO expect to process quite a few events per second, so you'd usually want to buffer/batch those inserts, if you can. There are some good examples of this in the open source Storm code Hortonworks has published. -TPP
