Re: Urgent: Mitigating Slow Consumer Impact and Seeking Open-Source Solutions in Apache Kafka Consumers

Gowtham S Fri, 15 Sep 2023 22:35:00 -0700

Hi Karthik

This appears to be a common challenge related to a slow-consuming
situation. Those with relevant experience in addressing such matters should
be capable of providing assistance.


Thanks and regards,
Gowtham S


On Fri, 15 Sept 2023 at 23:06, Giannis Polyzos <ipolyzos...@gmail.com>
wrote:

> Hi Karthick,
>
> on a high level seems like a data skew issue and some partitions have way
> more data than others?
> What is the number of your devices? how many messages are you processing?
> Most of the things you share above sound like you are looking for
> suggestions around load distribution for Kafka.  i.e number of partitions,
> how to distribute your device data etc.
> It would be good to also share what your flink job is doing as I don't see
> anything mentioned around that.. are you observing back pressure in the
> Flink UI?
>
> Best
>
> On Fri, Sep 15, 2023 at 3:46 PM Karthick <ibmkarthickma...@gmail.com>
> wrote:
>
>> Dear Apache Flink Community,
>>
>>
>>
>> I am writing to urgently address a critical challenge we've encountered
>> in our IoT platform that relies on Apache Kafka and real-time data
>> processing. We believe this issue is of paramount importance and may have
>> broad implications for the community.
>>
>>
>>
>> In our IoT ecosystem, we receive data streams from numerous devices, each
>> uniquely identified. To maintain data integrity and ordering, we've
>> meticulously configured a Kafka topic with ten partitions, ensuring that
>> each device's data is directed to its respective partition based on its
>> unique identifier. This architectural choice has proven effective in
>> maintaining data order, but it has also unveiled a significant problem:
>>
>>
>>
>> *One device's data processing slowness is interfering with other devices'
>> data, causing a detrimental ripple effect throughout our system.*
>>
>> To put it simply, when a single device experiences processing delays, it
>> acts as a bottleneck within the Kafka partition, leading to delays in
>> processing data from other devices sharing the same partition. This issue
>> undermines the efficiency and scalability of our entire data processing
>> pipeline.
>>
>> Additionally, I would like to highlight that we are currently using the
>> default partitioner for choosing the partition of each device's data. If
>> there are alternative partitioning strategies that can help alleviate this
>> problem, we are eager to explore them.
>>
>> We are in dire need of a high-scalability solution that not only ensures
>> each device's data processing is independent but also prevents any
>> interference or collisions between devices' data streams. Our primary
>> objectives are:
>>
>> 1. *Isolation and Independence:* We require a strategy that guarantees
>> one device's processing speed does not affect other devices in the same
>> Kafka partition. In other words, we need a solution that ensures the
>> independent processing of each device's data.
>>
>>
>> 2. *Open-Source Implementation:* We are actively seeking pointers to
>> open-source implementations or references to working solutions that address
>> this specific challenge within the Apache ecosystem or any existing
>> projects, libraries, or community-contributed solutions that align with our
>> requirements would be immensely valuable.
>>
>> We recognize that many Apache Flink users face similar issues and may
>> have already found innovative ways to tackle them. We implore you to share
>> your knowledge and experiences on this matter. Specifically, we are
>> interested in:
>>
>> *- Strategies or architectural patterns that ensure independent
>> processing of device data.*
>>
>> *- Insights into load balancing, scalability, and efficient data
>> processing across Kafka partitions.*
>>
>> *- Any existing open-source projects or implementations that address
>> similar challenges.*
>>
>>
>>
>> We are confident that your contributions will not only help us resolve
>> this critical issue but also assist the broader Apache Flink community
>> facing similar obstacles.
>>
>>
>>
>> Please respond to this thread with your expertise, solutions, or any
>> relevant resources. Your support will be invaluable to our team and the
>> entire Apache Flink community.
>>
>> Thank you for your prompt attention to this matter.
>>
>>
>> Thanks & Regards
>>
>> Karthick.
>>
>

Re: Urgent: Mitigating Slow Consumer Impact and Seeking Open-Source Solutions in Apache Kafka Consumers

Reply via email to