Re: storm-rdbms consume data from kafka spout fast enough?

Tero Paananen Tue, 09 Dec 2014 13:55:37 -0800

> I have a question here, as I post several threads before, I am using
> storm-rdbms to write into postgresqlDB, data was collected from kafkaSpout,
> it works. Since it insert into DB once I get a tuple, per row/insert
> operation. I have concern that if this type of consuming is fast enough and
> will potentially cost the overhead?


That obviously depends on how many events do you expect to process per
second, and how you set your parallelism.

In general, you're probably using Kafka and Storm, because you DO
expect to process quite a few events per second, so you'd usually want
to buffer/batch those inserts, if you can. There are some good
examples of this in the open source Storm code Hortonworks has
published.

-TPP

Re: storm-rdbms consume data from kafka spout fast enough?

Reply via email to