Re: Load Balancing Kafka

2015-07-16 Thread Dana Powers
I think the answer here is that the Kafka protocol includes a broker metadata api. The client uses the broker host(s) you provide to discover the full list of brokers in the cluster (and the topics+partitions each manages/leads). The java client has a similar interface via metadata.brokers.list / b

Re: Load Balancing Kafka

2015-07-15 Thread Jiangjie Qin
AhÅ  It seems you are more focusing on producer side workload balanceÅ  If that is the case, please ignore my previous comments. Jiangjie (Becket) Qin On 7/15/15, 6:01 PM, "Jiangjie Qin" wrote: >If you have pretty balanced traffic on each partition and have set >auto.leader.rebalance.enabled to t

Re: Load Balancing Kafka

2015-07-15 Thread Jiangjie Qin
If you have pretty balanced traffic on each partition and have set auto.leader.rebalance.enabled to true or false, you might not need to do further workload balance. However, in most cases you probably still need to do some sort of load balancing based on the traffic and disk utilization of each b

Re: Load Balancing Kafka

2015-07-15 Thread Terry Bates
Greetings Sandy, Folks smarter than me can correct me if I am wrong. Using Python client you don't have to connect to Zookeeper, so just specifying one of the brokers should be sufficient. In terms of what happens to your messages as your client produces them, they should be randomly assigned to a

Load Balancing Kafka

2015-07-15 Thread Sandy Waters
Hi all, Do I need to load balance against the brokers? I am using the python driver and it seems to only want a single kafka broker host. However, in a situation where I have 10 brokers, is it still fine to just give it one host. Does zookeeper and kafka handle the load balancing and redirect m