Event processing use case/examples

Mark Fri, 04 Nov 2011 09:28:22 -0700

I am struggling on some core design concepts and I was hoping someonecould explaining how they use Kafka in their production event for eventprocessing. For example, I've read that LinkedIn has over 60+ metricsthey collect and aggregate.. ie page views, clicks etc. I clearly graspthe concept of logging a page view event to Kafka, but I'm missing thelast part. How does one go about aggregating this data and using it anyother way than a simple data sink.

Taking the "page_view" example further. What is the preferred way oflogging and consuming this event? Would you have a consumer that justconsumes page views? If so, how do you go about making sure you dontreconsume the same message in the event of a conusmer restart? Also foranalytical/reporting needs how do you deal with timeframes? Say myconsumer is subscribe to the "page_view" topic and I want all messagesfrom 8am-9am. Would I read all messages and filter out any that doesn'thave a specific timestamp, or would I create very a seperate topic foreach hour.. ie "page_view/08:00". Same question applies to importingall "page_views" for yesterday into Hadoop.

I know Kafka is a new project and im sure everyones time is constrainedbut I think it would be helpful if some high level examples/use casesand best practices were added to the wiki. This could help gain adoptionand hopeful bring in a more willing contributors :)


Thanks for your help

Event processing use case/examples

Reply via email to