Felix, That's a pretty accurate explanation. Thanks,
Jun On Wed, Jan 11, 2012 at 3:16 PM, Felix GV <fe...@mate1inc.com> wrote: > As I understand it, you cannot use a mirrored Kafka cluster as a hot > fail-over. > > You could probably use it as a manual fail-over, but I don't know the > complexity involved in doing that. > > Also, if your source cluster fails while producers were putting data into > it, there will be an "unconsumed window" of data that is lost. This > corresponds to the data that the embedded consumer in the mirrored cluster > did not have time to consume from the source cluster. > > All in all, the mirrored cluster is akin to asynchronous replication, > without any hot fail-over capability. Thus, it provides data redundancy > (outside of the unconsumed window described above) but no extra > availability (unless you count manual interventions). > > KAFKA-50 <https://issues.apache.org/jira/browse/KAFKA-50>, on the other > hand, will provide both asynchronous AND synchronous replication (although > the latter will incur a latency penalty) and will be able to use the > replicas (data redundancy) as hot-fail overs. > > Depending on your personal definition of "highly reliable" (whether it > includes data redundancy and/or availability), I think that should probably > answer your question...? > > To all the Kafka experts: please correct me if the above explanations are > incorrect :) ! > > -- > Felix > > > > On Wed, Jan 11, 2012 at 5:53 PM, Jun Rao <jun...@gmail.com> wrote: > > > It's just that the mirroring logic depends on ZK to be available most of > > the time. > > > > Jun > > > > On Wed, Jan 11, 2012 at 2:35 PM, Christian Carollo <ccaro...@gmail.com > > >wrote: > > > > > I see. But if I used that configuration and then did the mirroring you > > > suggested would that be enough, in your opinion, to be considered > highly > > > reliable? > > > > > > Christian > > > > > > > > > On Jan 11, 2012, at 2:32 PM, Jun Rao wrote: > > > > > > >> For example, can I have one ZK instance and one broker on one > machine > > > and > > > > that is enough to define a ZK cluster and a Kafka Cluster? > > > > > > > > Yes, although you don't get the reliability of ZK now. > > > > > > > > Jun > > > > > > > > > > > > On Wed, Jan 11, 2012 at 2:06 PM, Christian Carollo < > ccaro...@gmail.com > > > >wrote: > > > > > > > >> Jun, > > > >> > > > >> I don't think I ask my question the right way. > > > >> > > > >> What I am trying to understand is what are the minimum constituent > > parts > > > >> of a kafka cluster? > > > >> > > > >> Based on your last email, I am now wondering what are the minimum > > > >> constituent parts of a ZK cluster as well as a Kafka cluster? > > > >> > > > >> For example, can I have one ZK instance and one broker on one > machine > > > and > > > >> that is enough to define a ZK cluster and a Kafka Cluster? > > > >> > > > >> Thanks, > > > >> Christian > > > >> > > > >> > > > >> On Jan 11, 2012, at 1:50 PM, Jun Rao <jun...@gmail.com> wrote: > > > >> > > > >>> Chrsitan, > > > >>> > > > >>> A Kafka cluster containers a ZK cluster and a list of brokers. > When a > > > >>> consumer subscribes to a topic in a kafka cluster, it consumes data > > > >> stored > > > >>> in all brokers in that cluster. > > > >>> > > > >>> Thanks, > > > >>> > > > >>> Jun > > > >>> > > > >>> On Tue, Jan 10, 2012 at 11:28 PM, Christian Carollo < > > > ccaro...@gmail.com > > > >>> wrote: > > > >>> > > > >>>> Thank you Jun that is quite helpful. I have a question about > Kafka > > > >>>> Clusters. What are the minimum number and types of services that > > must > > > >> be > > > >>>> running to make up a Kafka Cluster? > > > >>>> > > > >>>> I ask this because the diagrams (in the Kafka Mirroring document) > > > allude > > > >>>> to a multiple broker environment, however, since each broker does > > not > > > >>>> appear to provide redundancy (as of today) to any of the other > > brokers > > > >> in a > > > >>>> given zookeeper service, it seems like a Kafka Cluster is nothing > > more > > > >> than > > > >>>> a grouping of a single zookeeper instance with a single Kafka > > broker, > > > is > > > >>>> this the correct understanding? > > > >>>> > > > >>>> Thanks, > > > >>>> Christian > > > >>>> > > > >>>> On Jan 10, 2012, at 8:47 AM, Jun Rao wrote: > > > >>>> > > > >>>>> With 0.7, you can set up inter-cluster replication ( > > > >>>>> > https://cwiki.apache.org/confluence/display/KAFKA/Kafka+mirroring > > ). > > > >>>>> > > > >>>>> For the future 0.8 release, we are working on intra-cluster > > > replication > > > >>>>> support and details can be found at > > > >>>>> https://issues.apache.org/jira/browse/KAFKA-50 > > > >>>>> > > > >>>>> Thanks, > > > >>>>> > > > >>>>> Jun > > > >>>>> > > > >>>>> On Mon, Jan 9, 2012 at 9:52 PM, Christian Carollo < > > > ccaro...@gmail.com > > > >>>>> wrote: > > > >>>>> > > > >>>>>> I am looking to implement Kafka in a production environment, > > > however, > > > >> I > > > >>>>>> haven't found in documentation or examples that > > > >>>>>> discuss how to build a redundant implementation. Is there any > > > >>>>>> documentation out their (blogs, articles, etc.) that describes > > > >>>>>> how we can implement such a system with Kafka 0.6 or 0.7. > > > >>>>>> > > > >>>>>> Also, is there a timeframe the community is shooting for, to > > release > > > >>>> 0.8 w/ > > > >>>>>> replication? > > > >>>>>> > > > >>>>>> Thanks > > > >>>>>> Christian > > > >>>>>> > > > >>>> > > > >>>> > > > >> > > > > > > > > >