For starters, can you elaborate on this a bit? "After a few months running every 15min on the servers, it just stopped working, like this" You had an automated 15-min test that was running fine for months and just stopped working...? Am I interpreting that correctly, and any upgrades in that time?
Also, what's your Kafka health look like? You should be able to get a glimpse via Ambari where you can at least see that the brokers are up and running. I'd also check the Kafka logs to make sure you're not running OOM or getting any sort of exceptions. If you want some additional GUI details, there's a free tool from Yahoo as well - https://github.com/yahoo/kafka-manager Best, Mike Miklavcic On Mon, Apr 8, 2019 at 10:39 AM Thiago Rahal Disposti < [email protected]> wrote: > Hello all, How's it going? > > We've been seeing an issue with load_tool.sh (which we use to collect our > topics EPS): > > After a few months running every 15min on the servers, it just > stopped working, like this: > > [image: image.png] > > It does not write anything else after those messages > > After a little bit of digging, we tried to check the kafka consumer group > metron.load.group with: > /usr/hdp/2.6.5.0-292/kafka/bin/kafka-consumer-groups.sh --bootstrap-server > $BROKERLIST --describe --group metron.load.group > > but it times out every time. > > Have you guys ever seen something like this? > > Thanks! > <http://www.kryptus.com.br> > *Thiago Rahal * > Cybersecurity > > +55 (19) 3112-5000 > [email protected] > > www.kryptus.com <http://www.kryptus.com.br> > > <http://www.kryptus.com.br> > >
