[ https://issues.apache.org/jira/browse/GEARPUMP-40?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Manu Zhang updated GEARPUMP-40: ------------------------------- Description: As per [https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines], a single thread Kafka consumer could consume at 89.7MB/s from 6x partition 3x replica topic whose data are evenly distributed on a 3-node GbE cluster. I carried out a similar experiment with KafkaSource and found that the throughput is only at 10MB/s. was: As per [https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines], a single thread Kafka consumer could consume at 89.7MB/s from 6x partition 3x replica topic. I carried out a similar experiment with KafkaSource and found that the throughput is only at 10MB/s. > KafkaSource poor performance > ---------------------------- > > Key: GEARPUMP-40 > URL: https://issues.apache.org/jira/browse/GEARPUMP-40 > Project: Apache Gearpump > Issue Type: Improvement > Components: kafka > Affects Versions: 0.8.0 > Reporter: Manu Zhang > Assignee: Manu Zhang > > As per > [https://engineering.linkedin.com/kafka/benchmarking-apache-kafka-2-million-writes-second-three-cheap-machines], > a single thread Kafka consumer could consume at 89.7MB/s from 6x partition > 3x replica topic whose data are evenly distributed on a 3-node GbE cluster. I > carried out a similar experiment with KafkaSource and found that the > throughput is only at 10MB/s. -- This message was sent by Atlassian JIRA (v6.3.4#6332)