spyzzz commented on issue #2175:
URL: https://github.com/apache/hudi/issues/2175#issuecomment-709293656


   I seems that the pulling from kafka is veryyyy slow ... 
   
   ```
   20/10/15 12:33:48 INFO memory.MemoryStore: Block broadcast_17 stored as 
values in memory (estimated size 29.4 KB, free 3.0 GB)
   20/10/15 12:33:49 INFO consumer.KafkaConsumer: [Consumer 
clientId=consumer-spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor-1,
 
groupId=spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor]
 Seeking to offset 16750 for partition xxx
   20/10/15 12:33:53 INFO consumer.KafkaConsumer: [Consumer 
clientId=consumer-spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor-1,
 
groupId=spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor]
 Seeking to offset 17250 for partition xxx
   20/10/15 12:33:57 INFO consumer.KafkaConsumer: [Consumer 
clientId=consumer-spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor-1,
 
groupId=spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor]
 Seeking to offset 17750 for partition xxx
   20/10/15 12:34:01 INFO consumer.KafkaConsumer: [Consumer 
clientId=consumer-spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor-1,
 
groupId=spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor]
 Seeking to offset 18250 for partition xxx
   20/10/15 12:34:06 INFO consumer.KafkaConsumer: [Consumer 
clientId=consumer-spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor-1,
 
groupId=spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor]
 Seeking to offset 18750 for partition xxx
   20/10/15 12:34:10 INFO consumer.KafkaConsumer: [Consumer 
clientId=consumer-spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor-1,
 
groupId=spark-kafka-source-6d40b7a1-e6f9-4f86-95c3-3f875be5984f-1049685671-executor]
 Seeking to offset 19250 for partition xxx
   ```
   
   Only 500 messages per 5/10s :s


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to