Hi Chris, Nice presentation -- 2 questions:
1. I had wondered about the references to Kafka broker colocation I'd seen around the place. So for example in the 18-node sized cluster you mention you'd have 18 Kafka brokers running there, 1 per host? Do you actually get any sort of data locality benefits from this, is there a way to ensure that the Samza container on host x is processing the partitions of each topic on the collocated Kafka broker? Or am I missing the intent? 2. Interested at your mention of using something like Samza for processing of monitoring and metric type data, it's something we've been talking about internally. Anything been published on what you are doing in that space? Thanks! Garry -----Original Message----- From: Chris Riccomini [mailto:[email protected]] Sent: 17 October 2013 21:54 To: [email protected] Subject: Re: Special Bay Area HUG: Tajo and Samza Hey Guys, On a related note, my talk from the YARN meet up at LinkedIn is now online: https://www.youtube.com/watch?v=7YBmUKjzg7c If you're not too familiar with Samza, this is a great place to start. Also, feedback welcome on presentation content, style, etc. Cheers, Chris On 10/17/13 11:08 AM, "Jakob Homan" <[email protected]> wrote: >Hey everybody- > Join us at LinkedIn Nov. 5 for a special HUG dedicated to two new >awesome Incubator projects, Tajo, a low-latency SQL query engine atop >YARN and Samza. > >http://www.meetup.com/hadoop/events/146077932/ > >-Jakob ----- No virus found in this message. Checked by AVG - www.avg.com Version: 2013.0.3408 / Virus Database: 3222/6751 - Release Date: 10/15/13
