[DISCUSS] Release Plan for 1.1.1

2018-06-11 Thread Dong Lin
Hi all, I would like to start the process for doing a 1.1.1 bug fix release. 1.1.0 was released Mar 28, 2018, and about 2.5 months have passed and 25 bug fixes have accumulated so far. A few of the more important fixes that have been merged in 1.1 branch so far: KAFKA-6925

[jira] [Resolved] (KAFKA-6946) Keep the session id for incremental fetch when fetch responses are throttled

2018-06-11 Thread Dong Lin (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-6946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lin resolved KAFKA-6946. - Resolution: Fixed > Keep the session id for incremental fetch when fetch responses are thrott

[jira] [Created] (KAFKA-7019) Reduction the contention between metadata update and metadata read operation

2018-06-07 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-7019: --- Summary: Reduction the contention between metadata update and metadata read operation Key: KAFKA-7019 URL: https://issues.apache.org/jira/browse/KAFKA-7019 Project: Kafka

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-06-04 Thread Dong Lin
, Feb 7, 2018 at 11:42 PM, Dong Lin wrote: > Hey Jun, > > Sure, I will come up with a KIP this week. I think there is a way to allow > partition expansion to arbitrary number without introducing new concepts > such as read-only partition or repartition epoch. > > Thanks, >

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-06-04 Thread Dong Lin
ook a look at KIP-287. Producers writing using the new scheme > prior to processor catch up and cut-over makes sense. Thanks. > > On Sat, Apr 14, 2018 at 7:09 PM, Dong Lin wrote: > > > Hey Jeff, > > > > Thanks for the review. The scheme for expanding processors of the

Re: [VOTE] KIP-266: Add TimeoutException for KafkaConsumer#position

2018-05-31 Thread Dong Lin
Thanks for the KIP! I am in favor of the option 1. +1 as well. On Thu, May 31, 2018 at 6:00 PM, Jason Gustafson wrote: > Thanks everyone for the feedback. I've updated the KIP and added > KAFKA-6979. > > -Jason > > On Wed, May 30, 2018 at 3:50 PM, Guozhang Wang wrote: > > > Thanks Jason. I'm i

Re: [DISCUSS] KIP-293 Add new metrics for consumer/replication fetch requests

2018-05-18 Thread Dong Lin
t; bugs ( https://issues.apache.org/jira/browse/KAFKA-6438 ). > > [fwiw, b) also can be solved by having topic TTL (in a fashion similar > to e.g. RabbitMQ) - I will be submitting a relevant KIP soon] > > Yours sincerely, > Adam Kotwasinski > > 2018-05-18 19:49 GMT+01:00 D

Re: [DISCUSS] KIP-293 Add new metrics for consumer/replication fetch requests

2018-05-18 Thread Dong Lin
sh > --bootstrap-server localhost:9092 --topic idontexist` > > Best regards, > Adam > > 2018-05-18 18:49 GMT+01:00 Dong Lin : > > Hey Adam, > > > > Thanks for the KIP. We currently already have the per-topic byte-out-rate > > (not including replication tra

Re: [DISCUSS] KIP-293 Add new metrics for consumer/replication fetch requests

2018-05-18 Thread Dong Lin
Hey Adam, Thanks for the KIP. We currently already have the per-topic byte-out-rate (not including replication traffic) with MBean path kafka.server:name=BytesOutPerSec,type=BrokerTopicMetrics,topic=*. Though this is not the FetchRequest rate, it seems to address the motivation of the KIP by telli

[jira] [Resolved] (KAFKA-3473) KIP-237: More Controller Health Metrics

2018-05-14 Thread Dong Lin (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-3473?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lin resolved KAFKA-3473. - Resolution: Fixed > KIP-237: More Controller Health Metr

Re: [VOTE] KIP-219 - Improve Quota Communication

2018-04-25 Thread Dong Lin
+Jon On Mon, 22 Jan 2018 at 10:38 AM Becket Qin wrote: > Thanks for the discussion and voting. > > KIP-219 has passed with +3 binding votes (Becket, Jun, Rajini). > > On Thu, Jan 18, 2018 at 1:32 AM, Rajini Sivaram > wrote: > > > Hi Becket, > > > > Thanks for the update. Yes, it does address my

Re: [DISCUSS] KIP-219 - Improve Quota Communication

2018-04-25 Thread Dong Lin
ers of Kafka clients > (librdkafka, kafka-python, etc.) for this KIP. > > Ismael > > On Thu, Apr 5, 2018 at 2:50 PM, Jonghyun Lee wrote: > > > Hi, > > > > I have been implementing KIP-219. I discussed the interface changes with > > Becket Qin and

[jira] [Resolved] (KAFKA-6705) producer.send() should not block due to metadata not available

2018-04-24 Thread Dong Lin (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-6705?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lin resolved KAFKA-6705. - Resolution: Won't Fix > producer.send() should not block due to metadata not a

Re: [DISCUSS] KIP-286: producer.send() should not block on metadata update

2018-04-20 Thread Dong Lin
. After the first metadata most likely the producer will not have to wait for metadata to send message. On Fri, Apr 13, 2018 at 11:34 PM, Dong Lin wrote: > Hey Becket, > > Good point! Thanks for the comment. > > I have updated the KIP to move the compression to user thread in th

Re: [VOTE] KIP-279: Fix log divergence between leader and follower after fast leader fail over

2018-04-15 Thread Dong Lin
Thanks for the KIP! LGTM. +1 On Sat, Apr 14, 2018 at 5:31 PM, Ted Yu wrote: > +1 > > On Sat, Apr 14, 2018 at 3:54 PM, Anna Povzner wrote: > > > Hi All, > > > > > > I would like to start the vote on KIP-279: Fix log divergence between > > leader and follower after fast leader fail over. > > > >

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-04-14 Thread Dong Lin
ll be able to catch up in which it will start consuming using > the new scheme afterwards. > > Thanks, > Jeff Chao > > On Sat, Apr 14, 2018 at 6:44 AM, Dong Lin wrote: > > > Thanks for the notes by Jun and Ray. I have read through the notes. It > > seems that the

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-04-14 Thread Dong Lin
ption to validate the key > distribution? > - Jan concerned about how a consumer application would look with the > new "split partition" design. > - KIP introduced callback. Jan doesn't think is useful. Callback > for switching "betwe

[DISCUSS] KIP-287: Support partition and processor expansion for stateful processing jobs

2018-04-14 Thread Dong Lin
Hi all, I have created KIP-287 to provide high level design to support partition and processor expansion for stateful processing. See https://cwiki.apache.org/confluence/display/KAFKA/KIP-287%3A+Support+partition+and+processor+expansion+for+stateful+processing+jobs . Instead of planning to finish

Re: [DISCUSS] KIP-286: producer.send() should not block on metadata update

2018-04-13 Thread Dong Lin
focus on IO. With the proposed > changes, does that mean the producer sender thread will have to do all the > compression as well? Would this become a performance bottleneck? > > Thanks, > > Jiangjie (Becket) Qin > > On Thu, Apr 12, 2018 at 1:41 AM, Dong Lin wrote: > > &

Re: [DISCUSS] KIP-286: producer.send() should not block on metadata update

2018-04-12 Thread Dong Lin
gt; Thanks > > On Wed, Apr 11, 2018 at 8:50 PM, Dong Lin wrote: > > > Hi all, > > > > I have created KIP-286: producer.send() should not block on metadata > > update. See > > https://cwiki.apache.org/confluence/display/KAFKA/KIP- > > 286%3A+producer.se

[DISCUSS] KIP-286: producer.send() should not block on metadata update

2018-04-11 Thread Dong Lin
Hi all, I have created KIP-286: producer.send() should not block on metadata update. See https://cwiki.apache.org/confluence/display/KAFKA/KIP-286%3A+producer.send%28%29+should+not+block+on+metadata+update . The KIP intends to improve user-experience of producer.send() when metadata is not availa

Re: Kafka KIP meeting on Apr. 9 at 9:00am PDT

2018-04-09 Thread Dong Lin
I am really really sorry for missing the KIP meeting. I am on vacation in Shanghai. I drank too much and over slept yesterday night. Thanks much for the KIP meeting notes! I will read through that carefully. On Thu, Apr 5, 2018 at 10:31 PM, Jeff Chao wrote: > Hi Jun, same for me. jeffc...@me.co

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-04-06 Thread Dong Lin
lready produced. It handled it the first time around. Asking it to > > > > > re-produce it into a new partition layout is squarely within its > > scope > > > of > > > > > capabilities. Contrast this with the alternative, asking the > consumer > > > to > > &g

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-04-04 Thread Dong Lin
same even after it had > > grown or shrunk is important. So from my POV I have major concerns that > > this KIP is benefitial in its current state. > > > > What is it that makes everyone so addicted to the idea of linear hashing? > > not attractive at all for me. > &

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-04-03 Thread Dong Lin
think that the general approach of saying that > > "if you use a custom partitioner, you cannot do partition expansion" is > > very reasonable (but I don't think we need to go that far with the > current > > proposal). It's similar to my statement in my emai

Re: [ANNOUNCE] Apache Kafka 1.1.0 Released

2018-03-29 Thread Dong Lin
three key capabilities: > > > > > > > > > > With these APIs, Kafka can be used for two broad classes of application: > > > > ** Building real-time streaming data pipelines that reliably get data > > > > between systems or applications. >

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-27 Thread Dong Lin
nsumers to always consume messages with the same key from the same > > producer in the order they are produced. And we need to provide a way for > > stream use-case to be able to flush/load state when messages with the > same > > key are migrated between consumers. In additi

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-27 Thread Dong Lin
migrated between consumers. In addition to ensuring that this goal is correctly supported, we should do our best to keep the performance and organization overhead of this KIP as low as possible. On Tue, Mar 27, 2018 at 1:14 PM, Dong Lin wrote: > Hey John, > > Thanks much for the

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-27 Thread Dong Lin
making the second copy a local filesystem > operation. > > Even if you follow these two copies up with bootstrapping a new consumer, > it's still rare for this to occur, so you get to amortize these copies over > the lifetime of the topic, whereas a reshuffle just keeps making

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-27 Thread Dong Lin
On Tue, Mar 27, 2018 at 12:04 AM, Dong Lin wrote: > Hey Jan, > > Thanks for the enthusiasm in improving Kafka's design. Now that I have > read through your discussion with Jun, here are my thoughts: > > - The latest proposal should with log compacted topics by properly

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-27 Thread Dong Lin
;>>> the keys in one partition to no more than two partitions, which >>>>>>>> could >>>>>>>> help >>>>>>>> redistribute the state. >>>>>>>> >>>>>>>> You d

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-26 Thread Dong Lin
t;>>>>>>>>> avoid > >>>>>>>>>> copying existing data will be more efficient, but it doesn't > work > >>>>>>>>>> well > >>>>>>>>>> with > >>

[jira] [Created] (KAFKA-6705) producer.send() should be non-blocking

2018-03-22 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6705: --- Summary: producer.send() should be non-blocking Key: KAFKA-6705 URL: https://issues.apache.org/jira/browse/KAFKA-6705 Project: Kafka Issue Type: Improvement

Re: [DISCUSS] KIP-263: Allow broker to skip sanity check of inactive segments on broker startup

2018-03-21 Thread Dong Lin
at I meant by doing some > data driven analysis. Maybe a quick run with hprof would help determine the > root cause of why sanityCheck is slow? > > -Jay > > On Tue, Mar 20, 2018 at 12:13 AM Dong Lin wrote: > > > Hey Jay, > > > > Thanks for your comments! > &g

[jira] [Created] (KAFKA-6697) JBOD configured broker should not die if log directory is invalid

2018-03-20 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6697: --- Summary: JBOD configured broker should not die if log directory is invalid Key: KAFKA-6697 URL: https://issues.apache.org/jira/browse/KAFKA-6697 Project: Kafka Issue

Re: [DISCUSS] KIP-263: Allow broker to skip sanity check of inactive segments on broker startup

2018-03-20 Thread Dong Lin
t; + > len + > > " bytes which is not positive or not a multiple of 8.") > I'm pretty such file.getAbsolutePath is a system call and I assume that > happens whether or not you fail the in-memory check? > > -Jay > > > On Sun, Feb 25, 2018 at 10:27 P

[jira] [Created] (KAFKA-6640) Improve efficiency of KafkaAdminClient.describeTopics()

2018-03-12 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6640: --- Summary: Improve efficiency of KafkaAdminClient.describeTopics() Key: KAFKA-6640 URL: https://issues.apache.org/jira/browse/KAFKA-6640 Project: Kafka Issue Type

[jira] [Created] (KAFKA-6638) Controller should remove replica from ISR if the replica is removed from the replica set

2018-03-12 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6638: --- Summary: Controller should remove replica from ISR if the replica is removed from the replica set Key: KAFKA-6638 URL: https://issues.apache.org/jira/browse/KAFKA-6638 Project

[jira] [Created] (KAFKA-6636) ReplicaFetcherThread should not die if hw < 0

2018-03-12 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6636: --- Summary: ReplicaFetcherThread should not die if hw < 0 Key: KAFKA-6636 URL: https://issues.apache.org/jira/browse/KAFKA-6636 Project: Kafka Issue Type: Improvem

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-11 Thread Dong Lin
ge for me atm > > what > > >>>>> the impact would be, but it's something we should pay attention to. > > >>>>> > > >>>>> > > >>>>> -Matthias > > >>>>> > > >>>>> On 3/6/18 3:45 PM, Jun Rao w

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-08 Thread Dong Lin
If you are not willing to explain to me what I might be overlooking: that > is fine. > But I ask you to not reply to my emails then. Please understand my > frustration with this. > > Best Jan > > > > On 06.03.2018 19:38, Dong Lin wrote: > >> Hi everyone, >> >>

[jira] [Created] (KAFKA-6618) Prevent two controllers from updating znodes concurrently

2018-03-06 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6618: --- Summary: Prevent two controllers from updating znodes concurrently Key: KAFKA-6618 URL: https://issues.apache.org/jira/browse/KAFKA-6618 Project: Kafka Issue Type

[jira] [Resolved] (KAFKA-5626) Producer should be able to negotiate ProduceRequest version with broker

2018-03-06 Thread Dong Lin (JIRA)
[ https://issues.apache.org/jira/browse/KAFKA-5626?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lin resolved KAFKA-5626. - Resolution: Duplicate Fixed in https://issues.apache.org/jira/browse/KAFKA-5960 > Producer should

[jira] [Created] (KAFKA-6617) Improve controller performance by batching reassignment znode write operation

2018-03-06 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6617: --- Summary: Improve controller performance by batching reassignment znode write operation Key: KAFKA-6617 URL: https://issues.apache.org/jira/browse/KAFKA-6617 Project: Kafka

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-06 Thread Dong Lin
lly embrace kafka as a "data plattform". The > > point I also want to make is that copying data this way is completely > > inline with the kafka architecture. it only consists of reading and > writing > > to topics. > > > > I hope it clarifies more why I th

Re: [DISCUSS] KIP-263: Allow broker to skip sanity check of inactive segments on broker startup

2018-03-03 Thread Dong Lin
of files (especially closed, immutable ones) that can be > > saved on disk and checked against at startup ? Wouldn't that help speed > up > > boot time, for all segments ? > > > > On 26 Feb. 2018 5:28 pm, "Dong Lin" wrote: > > > > > H

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-03 Thread Dong Lin
e userland > with a custom partitioner and a small feedbackloop from ProduceResponse => > Partitionier in coorporation with a change management system. > > Best Jan > > > > > > > > > On 28.02.2018 07:13, Dong Lin wrote: > >> Hey Jan, >> >> I

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-03 Thread Dong Lin
t a time to > > avoid needing to reshuffle much data. This approach has the benefit that > at > > any time you have a fixed number of partitions and all data is fully > > partitioned with whatever the partition count you choose is but also has > > the benefit that you c

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-03-01 Thread Dong Lin
d for other purposes, I have instead created ConsumerGroupPositionRequest and ConsumerGroupPositionResponse as suggested. > > Thanks, > Jason > > > On Tue, Feb 27, 2018 at 11:49 PM, Stephane Maarek < > steph...@simplemachines.com.au> wrote: > > > Sounds awes

[jira] [Created] (KAFKA-6604) ReplicaManager should not remove partitions on the log dirctory from high watermark checkpoint file

2018-03-01 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6604: --- Summary: ReplicaManager should not remove partitions on the log dirctory from high watermark checkpoint file Key: KAFKA-6604 URL: https://issues.apache.org/jira/browse/KAFKA-6604

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-27 Thread Dong Lin
stom partitioner that > handles the transition as needed. I would appreciate if someone could point > out what a custom partitioner couldn't handle in this case? > > With the above approach, shrinking a topic becomes the same steps. Without > loosing keys in the discontinued partitions

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-27 Thread Dong Lin
Hey Allen, Thanks for the comments. On Mon, Feb 26, 2018 at 9:27 PM, Allen Wang wrote: > Hi Dong, > > Please see my comments inline. > > Thanks, > Allen > > On Sun, Feb 25, 2018 at 3:33 PM, Dong Lin wrote: > > > Hey Allen, > > > > Thanks for your c

[DISCUSS] KIP-263: Allow broker to skip sanity check of inactive segments on broker startup

2018-02-25 Thread Dong Lin
Hi all, I have created KIP-263: Allow broker to skip sanity check of inactive segments on broker startup. See https://cwiki.apache.org/confluence/display/KAFKA/KIP-263%3A+Allow+broker+to+skip+sanity+check+of+inactive+segments+on+broker+startup . This KIP provides a way to significantly reduce tim

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-25 Thread Dong Lin
nd Connect? > > 2. How does this compare to a solution where we physically split > partitions > > using a linear hashing approach (the partition number is equivalent to > the > > hash bucket in a hash table)? https://en.wikipedia.org/wiki/ > Linear_hashing > > >

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-25 Thread Dong Lin
en.wikipedia.org/wiki/Linear_hashing > > -Jay > > On Sat, Feb 10, 2018 at 3:35 PM, Dong Lin wrote: > > > Hi all, > > > > I have created KIP-253: Support in-order message delivery with partition > > expansion. See > > https://cwiki.apache.org/confluen

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-25 Thread Dong Lin
> > partitions is a quick way of shifting some traffic to the newly added > > brokers. Once the traffic goes down, the newly added brokers will be > > reclaimed (potentially by moving replicas off those brokers). However, if > > one can only add partitions without removing

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-25 Thread Dong Lin
med (potentially by moving replicas off those brokers). However, if > one can only add partitions without removing, eventually, one will hit the > limit. > > Thanks, > > Jun > > On Wed, Feb 21, 2018 at 12:23 PM, Dong Lin wrote: > > > Hey Jun, > > > > Thank

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-25 Thread Dong Lin
d for consumers, producers, controller, > etc, etc. If we have the ability to add partitions when needed, it will be > good to also remove them when no longer needed. > > On Wed, Feb 21, 2018 at 12:24 PM Dong Lin wrote: > > > Hey Jun, > > > > Thanks much for your

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-23 Thread Dong Lin
18 5:19 PM, Jay Kreps wrote: > >> Hey Dong, > >> > >> Two questions: > >> 1. How will this work with Streams and Connect? > >> 2. How does this compare to a solution where we physically split > partitions > >> using a linear hashi

Re: [DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-21 Thread Dong Lin
portant to address though. > Personally I think it is not worth adding more complexity just to optimize this scenario. This imbalance should exist only for a short period of time. If it is important I can think more about how to handle it. > > Thanks, > > Jun > > > >

[jira] [Created] (KAFKA-6571) KafkaProducer.close(0) should be non-blocking

2018-02-18 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6571: --- Summary: KafkaProducer.close(0) should be non-blocking Key: KAFKA-6571 URL: https://issues.apache.org/jira/browse/KAFKA-6571 Project: Kafka Issue Type: Bug

[DISCUSS] KIP-253: Support in-order message delivery with partition expansion

2018-02-10 Thread Dong Lin
Hi all, I have created KIP-253: Support in-order message delivery with partition expansion. See https://cwiki.apache.org/confluence/display/KAFKA/KIP-253%3A+Support+in-order+message+delivery+with+partition+expansion . This KIP provides a way to allow messages of the same key from the same produce

Re: [DISCUSS] KIP-250 Add Support for Quorum-based Producer Acknowledgment

2018-02-07 Thread Dong Lin
p change must be delayed. As far > as I can tell, this strategy hurts availability without even increasing > consistency. > > best, > Colin > > > On Sat, Feb 3, 2018, at 10:03, Dong Lin wrote: > > Hey Guozhang, > > > > I don't have very detailed desi

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-02-07 Thread Dong Lin
nd see if > there is any overlap with KIP-232. Would you be interested in doing that? > If not, we can do that next week. > > Jun > > > On Tue, Feb 6, 2018 at 11:30 AM, Dong Lin wrote: > > > Hey Jun, > > > > Interestingly I am also planning to sketch a K

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-02-06 Thread Dong Lin
n probably be done > > separately. > > > > Thanks, > > > > Jiangjie (Becket) Qin > > > > On Mon, Jan 29, 2018 at 11:52 AM, Dong Lin wrote: > > > > > Hey Colin, > > > > > > On Mon, Jan 29, 2018 at 11:23 AM, Colin McCabe >

Re: [DISCUSS] KIP-250 Add Support for Quorum-based Producer Acknowledgment

2018-02-03 Thread Dong Lin
> Hi Dong, > > Could you elaborate a bit more how controller could affect leaders to > switch between all and quorum? > > > Guozhang > > > On Fri, Feb 2, 2018 at 10:12 PM, Dong Lin wrote: > >> Hey Guazhang, >> >> Got it. Thanks for the detailed

Re: [DISCUSS] KIP-250 Add Support for Quorum-based Producer Acknowledgment

2018-02-02 Thread Dong Lin
re already using "all". In other words, "quorum" > is trading a bit of failure tolerance that is strictly defined on min.isr > for better tail latency. > > > Guozhang > > > On Fri, Feb 2, 2018 at 6:25 PM, Dong Lin wrote: > >> Hey Guozhang, >

Re: [DISCUSS] KIP-250 Add Support for Quorum-based Producer Acknowledgment

2018-02-02 Thread Dong Lin
arios: > > a. ISR list has 3: "all" waits for all 3, "quorum" waits for 2 of them. > b. ISR list has 2: "all" and "quorum" waits for both 2 of them. > c. ISR list has 1: "all" waits for leader to return, while "quorum" would &g

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-31 Thread Dong Lin
Hey Jun, Jason, Thanks for all the comments. Could you see if you can give +1 for the KIP? I am open to make further improvements for the KIP. Thanks, Dong On Tue, Jan 23, 2018 at 3:44 PM, Dong Lin wrote: > Hey Jun, Jason, > > Thanks much for all the review! I will open the voti

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-29 Thread Dong Lin
Hey Colin, On Mon, Jan 29, 2018 at 11:23 AM, Colin McCabe wrote: > > On Mon, Jan 29, 2018 at 10:35 AM, Dong Lin wrote: > > > > > Hey Colin, > > > > > > I understand that the KIP will adds overhead by introducing > per-partition > > > partition

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-29 Thread Dong Lin
Mon, Jan 29, 2018 at 10:35 AM, Dong Lin wrote: > Hey Colin, > > I understand that the KIP will adds overhead by introducing per-partition > partitionEpoch. I am open to alternative solutions that does not incur > additional overhead. But I don't see a better way now. >

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-29 Thread Dong Lin
this KIP, maybe we can do the partition ID in a separate KIP? Thanks, Dong On Mon, Jan 29, 2018 at 10:18 AM, Colin McCabe wrote: > On Fri, Jan 26, 2018, at 12:17, Dong Lin wrote: > > Hey Colin, > > > > > > On Fri, Jan 26, 2018 at 10:16 AM, Colin McCabe &

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-26 Thread Dong Lin
Hey Colin, On Fri, Jan 26, 2018 at 10:16 AM, Colin McCabe wrote: > On Thu, Jan 25, 2018, at 16:47, Dong Lin wrote: > > Hey Colin, > > > > Thanks for the comment. > > > > On Thu, Jan 25, 2018 at 4:15 PM, Colin McCabe > wrote: > > > >

Re: [DISCUSS] KIP-250 Add Support for Quorum-based Producer Acknowledgment

2018-01-25 Thread Dong Lin
27;t > > > break any current promises and provide a separate option for our user. > > > In terms of KIP-250, I feel it is more like the "Semisynchronous > > > Replication" in the MySQL world, and yes it is something between acks=1 > > and > > > acks=i

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-25 Thread Dong Lin
Hey Colin, Thanks for the comment. On Thu, Jan 25, 2018 at 4:15 PM, Colin McCabe wrote: > On Wed, Jan 24, 2018, at 21:07, Dong Lin wrote: > > Hey Colin, > > > > Thanks for reviewing the KIP. > > > > If I understand you right, you maybe suggesting that we can us

[jira] [Created] (KAFKA-6488) Prevent log corruption in case of OOM

2018-01-25 Thread Dong Lin (JIRA)
Dong Lin created KAFKA-6488: --- Summary: Prevent log corruption in case of OOM Key: KAFKA-6488 URL: https://issues.apache.org/jira/browse/KAFKA-6488 Project: Kafka Issue Type: Bug

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-24 Thread Dong Lin
> X is the leader but Y is really the leader, the client will talk to X, and > X will point out its mistake by sending back a NOT_LEADER_FOR_PARTITION. > Then the client can update its metadata again and find the new leader, if > there is one. There is no need for an epoch to handle this

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-24 Thread Dong Lin
o > offset 8, and then call seek(16) to "jump" to a further position, then she > needs to be aware that OORE maybe thrown and she needs to handle it or rely > on reset policy which should not surprise her. > > > I'm +1 on the KIP. > > Guozhang > > >

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-24 Thread Dong Lin
16) > after construction and then poll without committed offset ever stored > before? Admittedly it is rare but we do not programmably disallow it. > > > Guozhang > > On Tue, Jan 23, 2018 at 10:42 PM, Dong Lin wrote: > > > Hey Guozhang, > > > > Thanks much for

Re: [VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-23 Thread Dong Lin
h partition epoch 2 pointing to > leader broker A, while the actual up-to-date metadata has partition epoch 3 > whose leader is now broker B, the metadata refresh will still succeed and > the follow-up fetch request may still see OORE? > > > Guozhang > > > On Tue, Jan 23,

Re: KIP-250 Add Support for Quorum-based Producer Acknowledgment

2018-01-23 Thread Dong Lin
Hey Litao, Thanks for the KIP. I have one quick comment before you provide more detail on how to select the leader with the largest LEO. Do you think it would make sense to change the default behavior of acks=-1, such that broker will acknowledge the message once the message has been replicated t

[VOTE] KIP-232: Detect outdated metadata using leaderEpoch and partitionEpoch

2018-01-23 Thread Dong Lin
Hi all, I would like to start the voting process for KIP-232: https://cwiki.apache.org/confluence/display/KAFKA/KIP-232%3A+Detect+outdated+metadata+using+leaderEpoch+and+partitionEpoch The KIP will help fix a concurrency issue in Kafka which currently can cause message loss or message duplicatio

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-23 Thread Dong Lin
Hey Jun, Jason, Thanks much for all the review! I will open the voting thread. Regards, Dong On Tue, Jan 23, 2018 at 3:37 PM, Jun Rao wrote: > Hi, Dong, > > The current KIP looks good to me. > > Thanks, > > Jun > > On Tue, Jan 23, 2018 at 12:29 PM, Dong Lin wrote

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-23 Thread Dong Lin
Hey Jun, Do you think the current KIP looks OK? I am wondering if we can open the voting thread. Thanks! Dong On Fri, Jan 19, 2018 at 3:08 PM, Dong Lin wrote: > Hey Jun, > > I think we can probably have a static method in Util class to decode the > byte[]. Both KafkaConsumer im

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-19 Thread Dong Lin
h in > the AdminClient too. > > Thanks, > > Jun > > > On Thu, Jan 18, 2018 at 6:30 PM, Dong Lin wrote: > > > Hey Jun, > > > > I agree. I have updated the KIP to remove the class OffetEpoch and > replace > > OffsetEpoch with byte[] in APIs that use it.

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-18 Thread Dong Lin
now. The only thing is > for OffsetEpoch. > If we expose the individual fields in the class, we probably don't need the > encode/decode methods. If we want to hide the details of OffsetEpoch, we > probably don't want expose the individual fields. > > Jun > > On Wed

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-17 Thread Dong Lin
updated the KIP to use the suggested approach. On Wed, Jan 17, 2018 at 9:57 AM, Dong Lin wrote: > Hey Jun, > > Thanks much for the comments. Please see my comments inline. > > On Tue, Jan 16, 2018 at 4:38 PM, Jun Rao wrote: > >> Hi, Dong, >> >> Thanks for the u

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-17 Thread Dong Lin
h class from the KIP. I just added it back with the public methods. Could you take another look? > > Jun > > > On Thu, Jan 11, 2018 at 5:43 PM, Dong Lin wrote: > > > Hey Jun, > > > > Thanks much. I agree that we can not rely on committed offsets to be > alway

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-11 Thread Dong Lin
rner case problems in the future. I am not sure having just a > global_epoch can achieve these. global_epoch is useful to determine which > version of the metadata is newer, especially with topic deletion. > > Thanks, > > Jun > > On Tue, Jan 9, 2018 at 11:34 PM, Dong Lin wrote: > &

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-09 Thread Dong Lin
global epoch. On Tue, Jan 9, 2018 at 6:58 PM, Dong Lin wrote: > Hey Jun, > > Thanks so much. These comments very useful. Please see below my comments. > > On Mon, Jan 8, 2018 at 5:52 PM, Jun Rao wrote: > >> Hi, Dong, >> >> Thanks for the updated KIP. A few mor

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-09 Thread Dong Lin
want to hide them, perhaps we can have sth like the > following. The binary encoding is probably more efficient than JSON for > external storage. > > OffsetEpoch { > static OffsetEpoch decode(byte[]); > > public byte[] encode(); > > public String toString(); > } > T

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-08 Thread Dong Lin
adds much complexity and it makes the behavior consistent: every topic > mutation results in an epoch bump. > > Thanks, > Jason > > On Mon, Jan 8, 2018 at 3:14 PM, Dong Lin wrote: > > > Hey Ismael, > > > > I guess we actually need user to see this field so that

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-08 Thread Dong Lin
gt; > Ismael > > The corresponding seek() and position() APIs might look something like > this: > > > > void seek(TopicPartition partition, long offset, byte[] offsetMetadata); > > byte[] positionMetadata(TopicPartition partition); > > > > What do you think? > &g

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-08 Thread Dong Lin
xternal store. If user stores the object.toString() in the external store, how can user convert the string back to the object? And yes, this only matters to the consumer API. The current KIP continues to send leader_epoch and topic_epoch as separate fields in request/response and the offset topic s

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-05 Thread Dong Lin
following json format: { "version": 1, "topic_epoch": int, "leader_epoch": int. } In comparison to byte[], String has the benefit of being more readable and it is also the same type of the existing metadata field, which is used for a similar purpose by user. Do

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-04 Thread Dong Lin
Hey Jun, Jason, Thanks much for all the feedback. I have updated the KIP based on the latest discussion. Can you help check whether it looks good? Thanks, Dong On Thu, Jan 4, 2018 at 5:36 PM, Dong Lin wrote: > Hey Jun, > > Hmm... thinking about this more, I am not sure that the pro

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-04 Thread Dong Lin
(partition_epoch, > leader_epoch). > > Thanks, > > Jun > > > On Thu, Jan 4, 2018 at 4:22 PM, Dong Lin wrote: > > > Hey Jun, > > > > Thanks much. I like the the new API that you proposed. I am not sure what > > you exactly mean by offset_epoch. I suppose th

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-04 Thread Dong Lin
api works in a similar way as how offset management works now and is > probably what we want in the long term. > 2. It can reset offsets better when there is data loss due to unclean > leader election or correlated replica failure. > 3. It can reset offsets better when topic is rec

Re: [DISCUSS] KIP-232: Detect outdated metadata by adding ControllerMetadataEpoch field

2018-01-04 Thread Dong Lin
he offset, on a consumer restart, it's not clear how > we can ensure the metadata in the consumer is high enough since there is no > metadata version to compare with. > > Thanks, > > Jun > > > On Wed, Jan 3, 2018 at 6:43 PM, Dong Lin wrote: > > > Hey Jun, &

Re: [DISCUSS] KIP-237: More Controller Health Metrics

2018-01-04 Thread Dong Lin
implementation and makes it harder to refactor. That was the reason for the > suggestion. > > Ismael > > On Wed, Dec 13, 2017 at 1:51 AM, Dong Lin wrote: > > > Hey Ismael, > > > > Thanks for your comments. Yeah the set of metrics in the KIP includes the > > t

<    1   2   3   4   5   6   7   8   9   10   >