[ 
https://issues.apache.org/jira/browse/KAFKA-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14057570#comment-14057570
 ] 

Oleg commented on KAFKA-1530:
-----------------------------

Hello Guozhang,

We tried using "controlled.shutdown.enable", but we encountered this bug 
https://issues.apache.org/jira/browse/KAFKA-1342 which resulted in node not 
loading data from the leader after restart.

Furthermore, if "controlled.shutdown.enable" worked correctly, we couldn't use 
it because we periodically encounter these bugs: 
https://issues.apache.org/jira/browse/KAFKA-1382 and 
https://issues.apache.org/jira/browse/KAFKA-1407, when a node goes "crazy" and 
stops working properly. In this case it will take it forever to shutdown with 
"controlled.shutdown.enable". Which results in us being unable to determine if 
node is still trying to catch up with the leader (which it does if 
"controlled.shutdown.enable" is on) or it encountered a bug and has just hung. 
These situations happen quite often in our cluster and they render the use of 
"controlled.shutdown.enable" not possible.

So now we just restart Kafkas (sometimes with "kill -9") with the possible 
outcome of loosing some part of data.

Another situation, which in fact is not very regular, but happened, which is 
all Kafka cluster going down due to power shortage in the datacenter. In this 
case some of the Kafkas could have been behind the leader. And we should begin 
starting them in the proper order (from the leaders) for the data not to be 
truncated.

So, we are seeking the way to restart/update Kafka without loosing data.

Best regards,
Oleg

> howto update continuously
> -------------------------
>
>                 Key: KAFKA-1530
>                 URL: https://issues.apache.org/jira/browse/KAFKA-1530
>             Project: Kafka
>          Issue Type: Wish
>            Reporter: Stanislav Gilmulin
>            Priority: Minor
>              Labels: operating_manual, performance
>
> Hi,
>  
> Could I ask you a question about the Kafka update procedure?
> Is there a way to update software, which doesn't require service interruption 
> or lead to data losses?
> We can't stop message brokering during the update as we have a strict SLA.
>  
> Best regards
> Stanislav Gilmulin



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to