Re: Usual remedy for "Under Replicated" and "Offline Partitions"

2018-02-02 Thread Richard Rodseth
Thanks Jeff! On Fri, Feb 2, 2018 at 11:58 AM, Jeff Widman wrote: > This means either the brokers are not healthy (bad hardware) or that the > replication fetchers can't keep up with the rate of incoming messages. > > If the latter, you need to figure out where the latency bottleneck is and > wha

Re: Usual remedy for "Under Replicated" and "Offline Partitions"

2018-02-02 Thread Jeff Widman
This means either the brokers are not healthy (bad hardware) or that the replication fetchers can't keep up with the rate of incoming messages. If the latter, you need to figure out where the latency bottleneck is and what your latency SLAs are. Common sources of latency bottlenecks: - network h

Usual remedy for "Under Replicated" and "Offline Partitions"

2018-02-02 Thread Richard Rodseth
We have a DataDog integration showing some metrics, and for one of our clusters the above two values are > 0 and highlighted in red. What's the usual remedy (Confluient Platform, OSS version) ? Thanks