from:"Steven Wu"


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 21, 2015, 5:48 a.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

move MockMetricsReporter into clients/src/test/java/org/apache/kafka/test


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
  clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
96ac6d0cca990eebe90707465d7d8091c069a4b2 
  clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
21243345311a106f0802ce96c026ba6e815ccf99 
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Deserializer.java 
13be6a38cb356d55e25151776328a3c38c573db4 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java 
PRE-CREATION 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 
  clients/src/test/java/org/apache/kafka/test/MockMetricsReporter.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121



 On April 21, 2015, 3:08 a.m., Guozhang Wang wrote:
  LGTM, besides one minor suggestion: could you move MockMetricsReporter to 
  clients/src/test/java/org/apache/kafka/test?

done. moved MockMetricsReporter to clients/src/test/java/org/apache/kafka/test


- Steven


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/#review80892
---


On April 21, 2015, 5:48 a.m., Steven Wu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/33242/
 ---
 
 (Updated April 21, 2015, 5:48 a.m.)
 
 
 Review request for kafka.
 
 
 Bugs: KAFKA-2121
 https://issues.apache.org/jira/browse/KAFKA-2121
 
 
 Repository: kafka
 
 
 Description
 ---
 
 move MockMetricsReporter into clients/src/test/java/org/apache/kafka/test
 
 
 Diffs
 -
 
   clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
 d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
   clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
 96ac6d0cca990eebe90707465d7d8091c069a4b2 
   clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
 21243345311a106f0802ce96c026ba6e815ccf99 
   clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
 b91e2c52ed0acb1faa85915097d97bafa28c413a 
   
 clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
 70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
   clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
 b3d3d7c56acb445be16a3fbe00f05eaba659be46 
   
 clients/src/main/java/org/apache/kafka/common/serialization/Deserializer.java 
 13be6a38cb356d55e25151776328a3c38c573db4 
   clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
 c2fdc23239bd2196cd912c3d121b591f21393eab 
   
 clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java
  PRE-CREATION 
   
 clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java
  PRE-CREATION 
   clients/src/test/java/org/apache/kafka/test/MockMetricsReporter.java 
 PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/33242/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 20, 2015, 4:51 p.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description
---

fix potential resource leak when KafkaProducer contructor failed in the middle


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
  clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
96ac6d0cca990eebe90707465d7d8091c069a4b2 
  clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
21243345311a106f0802ce96c026ba6e815ccf99 
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Deserializer.java 
13be6a38cb356d55e25151776328a3c38c573db4 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java 
PRE-CREATION 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 20, 2015, 4:57 p.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description
---

fix potential resource leak when KafkaProducer contructor failed in the middle


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
  clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
96ac6d0cca990eebe90707465d7d8091c069a4b2 
  clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
21243345311a106f0802ce96c026ba6e815ccf99 
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Deserializer.java 
13be6a38cb356d55e25151776328a3c38c573db4 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java 
PRE-CREATION 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 20, 2015, 4:52 p.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description
---

fix potential resource leak when KafkaProducer contructor failed in the middle


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
  clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
96ac6d0cca990eebe90707465d7d8091c069a4b2 
  clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
21243345311a106f0802ce96c026ba6e815ccf99 
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Deserializer.java 
13be6a38cb356d55e25151776328a3c38c573db4 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java 
PRE-CREATION 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121

2015-04-19 Thread Steven Wu


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 20, 2015, 3:08 a.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

applied same fix to KafkaConsumer


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
  clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
96ac6d0cca990eebe90707465d7d8091c069a4b2 
  clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
21243345311a106f0802ce96c026ba6e815ccf99 
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121

2015-04-19 Thread Steven Wu



 On April 19, 2015, 11:11 p.m., Ewen Cheslack-Postava wrote:
  LGTM! If this gets merged as is, we should file a follow-up issue for the 
  new consumer, which has the same issue.

OK. I applied the same fix for new consumer. also updated jira title to reflect 
the expanded scope.


- Steven


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/#review80642
---


On April 20, 2015, 3:30 a.m., Steven Wu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/33242/
 ---
 
 (Updated April 20, 2015, 3:30 a.m.)
 
 
 Review request for kafka.
 
 
 Bugs: KAFKA-2121
 https://issues.apache.org/jira/browse/KAFKA-2121
 
 
 Repository: kafka
 
 
 Description
 ---
 
 applied same fix to KafkaConsumer
 
 
 add test for KafkaConsumer
 
 
 Diffs
 -
 
   clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
 d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
   clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
 96ac6d0cca990eebe90707465d7d8091c069a4b2 
   clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
 21243345311a106f0802ce96c026ba6e815ccf99 
   clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
 b91e2c52ed0acb1faa85915097d97bafa28c413a 
   
 clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
 70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
   clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
 b3d3d7c56acb445be16a3fbe00f05eaba659be46 
   clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
 c2fdc23239bd2196cd912c3d121b591f21393eab 
   
 clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java
  PRE-CREATION 
   
 clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java
  PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/33242/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121

2015-04-19 Thread Steven Wu


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 20, 2015, 3:30 a.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

applied same fix to KafkaConsumer


add test for KafkaConsumer


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/ClientUtils.java 
d0da5d7a08a0c3e67e0fe14bb0b0e7c73380f416 
  clients/src/main/java/org/apache/kafka/clients/KafkaClient.java 
96ac6d0cca990eebe90707465d7d8091c069a4b2 
  clients/src/main/java/org/apache/kafka/clients/consumer/KafkaConsumer.java 
21243345311a106f0802ce96c026ba6e815ccf99 
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/clients/producer/internals/Sender.java 
70954cadb5c7e9a4c326afcf9d9a07db230e7db2 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/consumer/KafkaConsumerTest.java 
PRE-CREATION 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121

2015-04-18 Thread Steven Wu


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 19, 2015, 3:09 a.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

fix potential resource leak when KafkaProducer contructor failed in the middle


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
b3d3d7c56acb445be16a3fbe00f05eaba659be46 
  clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
c2fdc23239bd2196cd912c3d121b591f21393eab 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121

2015-04-18 Thread Steven Wu



 On April 16, 2015, 5:29 p.m., Ewen Cheslack-Postava wrote:
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java, 
  line 548
  https://reviews.apache.org/r/33242/diff/2/?file=931792#file931792line548
 
  One idea for making this less verbose and redundant: make all of these 
  classes implement Closeable so we can just write one utility method for 
  trying to close something and catching the exception.
 
 Steven Wu wrote:
 yes. I thought about it. it may break binary compatibility, e.g. 
 Serializer. Sender and Metrics classes are probably only used internally. let 
 me know your thoughts.
 
 Ewen Cheslack-Postava wrote:
 I'm pretty sure it's fine, based on this
 
 Changing the direct superclass or the set of direct superinterfaces of a 
 class type will not break compatibility with pre-existing binaries, provided 
 that the total set of superclasses or superinterfaces, respectively, of the 
 class type loses no members.
 
 from https://docs.oracle.com/javase/specs/jls/se7/html/jls-13.html

ok. you could be correct. here is another reference (easier to understand than 
the jls doc)

Expand superinterface set (direct or inherited)-   Binary 
compatible

from https://wiki.eclipse.org/Evolving_Java-based_APIs_2#Evolving_API_Interfaces


- Steven


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/#review80346
---


On April 19, 2015, 3:09 a.m., Steven Wu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/33242/
 ---
 
 (Updated April 19, 2015, 3:09 a.m.)
 
 
 Review request for kafka.
 
 
 Bugs: KAFKA-2121
 https://issues.apache.org/jira/browse/KAFKA-2121
 
 
 Repository: kafka
 
 
 Description
 ---
 
 fix potential resource leak when KafkaProducer contructor failed in the middle
 
 
 Diffs
 -
 
   clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
 b91e2c52ed0acb1faa85915097d97bafa28c413a 
   clients/src/main/java/org/apache/kafka/common/metrics/Metrics.java 
 b3d3d7c56acb445be16a3fbe00f05eaba659be46 
   clients/src/main/java/org/apache/kafka/common/serialization/Serializer.java 
 c2fdc23239bd2196cd912c3d121b591f21393eab 
   
 clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java
  PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/33242/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 16, 2015, 4:55 p.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

add a unit test file


changes based on Ewen's review feedbacks


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 16, 2015, 5:03 p.m.)


Review request for kafka.


Changes
---

address Ewen's review feedbacks

I'm getting a bunch of checkstyle complaints when I try to test. These should 
all be easy to fix (and should be causing tests to fail before even running). 
The only rule that might not be obvious from the error message is that the 
static final field in MockMetricsReporter is expected to be all-caps since it 
looks like a constant to checkstyle.
 fixed

In the constructor, could we throw some subclass of KafkaException instead? The 
new clients try to stick to that exception hierarchy except in a few special 
cases. Alternatively, maybe if we caught Error and RuntimeException instead of 
Throwable then we could just rethrow the same exception?
 I changed RuntimeException to KafkaException. can't think of a good subclass 
 name for this scenario. ProducerConstructException? hence, stay with the 
 generic KafkaException

The new version of close() will swallow exceptions when called normally (i.e. 
not from the constructor). They'll be logged, but the caller won't see the 
exception anymore. Maybe we should save the first exception and rethrow it?
 refactored a private close(boolean swallowException) method

Exception messages should be capitalized.
 fixed

In the test, we should probably have an assert outside the catch. And is there 
any reason the closeCount is being reset to 0?
 yes. we should have an assert outside the catch
 I was just reset the CLOSE_COUNT in case another test method need to check 
 the count.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

fix potential resource leak when KafkaProducer throws exception in the middle


Diffs
-

  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

(Updated April 16, 2015, 5:44 p.m.)


Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description (updated)
---

add a unit test file


changes based on Ewen's review feedbacks


fix capitalization in error log


Diffs (updated)
-

  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121



 On April 16, 2015, 5:29 p.m., Ewen Cheslack-Postava wrote:
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java, 
  line 531
  https://reviews.apache.org/r/33242/diff/2/?file=931792#file931792line531
 
  This code is all single threaded, is the AtomicReference really 
  necessary here?

not really necessary. just trying to use the compareAndSet. otherwise, I need 
to do if(firstException == null) firstException = t. I can certainly change 
it. let me know.


 On April 16, 2015, 5:29 p.m., Ewen Cheslack-Postava wrote:
  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java, 
  line 548
  https://reviews.apache.org/r/33242/diff/2/?file=931792#file931792line548
 
  One idea for making this less verbose and redundant: make all of these 
  classes implement Closeable so we can just write one utility method for 
  trying to close something and catching the exception.

yes. I thought about it. it may break binary compatibility, e.g. Serializer. 
Sender and Metrics classes are probably only used internally. let me know your 
thoughts.


- Steven


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/#review80346
---


On April 16, 2015, 5:44 p.m., Steven Wu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/33242/
 ---
 
 (Updated April 16, 2015, 5:44 p.m.)
 
 
 Review request for kafka.
 
 
 Bugs: KAFKA-2121
 https://issues.apache.org/jira/browse/KAFKA-2121
 
 
 Repository: kafka
 
 
 Description
 ---
 
 add a unit test file
 
 
 changes based on Ewen's review feedbacks
 
 
 fix capitalization in error log
 
 
 Diffs
 -
 
   clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
 b91e2c52ed0acb1faa85915097d97bafa28c413a 
   
 clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java
  PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/33242/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Steven Wu

Re: Review Request 33242: Patch for KAFKA-2121



 On April 16, 2015, 5:29 p.m., Ewen Cheslack-Postava wrote:
  Looks good, left a few comments.
  
  KafkaConsumer suffers from this same problem. Patching that should be 
  pretty much identical -- any chance you could extend this to cover that as 
  well?

sure. I can extend this to KafkaConsumer later.


- Steven


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/#review80346
---


On April 16, 2015, 5:44 p.m., Steven Wu wrote:
 
 ---
 This is an automatically generated e-mail. To reply, visit:
 https://reviews.apache.org/r/33242/
 ---
 
 (Updated April 16, 2015, 5:44 p.m.)
 
 
 Review request for kafka.
 
 
 Bugs: KAFKA-2121
 https://issues.apache.org/jira/browse/KAFKA-2121
 
 
 Repository: kafka
 
 
 Description
 ---
 
 add a unit test file
 
 
 changes based on Ewen's review feedbacks
 
 
 fix capitalization in error log
 
 
 Diffs
 -
 
   clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
 b91e2c52ed0acb1faa85915097d97bafa28c413a 
   
 clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java
  PRE-CREATION 
 
 Diff: https://reviews.apache.org/r/33242/diff/
 
 
 Testing
 ---
 
 
 Thanks,
 
 Steven Wu

Review Request 33242: Patch for KAFKA-2121

2015-04-15 Thread Steven Wu


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/33242/
---

Review request for kafka.


Bugs: KAFKA-2121
https://issues.apache.org/jira/browse/KAFKA-2121


Repository: kafka


Description
---

add a unit test file


Diffs
-

  clients/src/main/java/org/apache/kafka/clients/producer/KafkaProducer.java 
b91e2c52ed0acb1faa85915097d97bafa28c413a 
  
clients/src/test/java/org/apache/kafka/clients/producer/KafkaProducerTest.java 
PRE-CREATION 

Diff: https://reviews.apache.org/r/33242/diff/


Testing
---


Thanks,

Steven Wu

Re: [DISCUSS] error handling in java KafkaProducer

2015-04-14 Thread Steven Wu

Thanks, Ewen and Guozhang!

I will go with the try-catch option then. here is the jira. feel free to
assign it to me. I will try to submit a patch this week.
https://issues.apache.org/jira/browse/KAFKA-2121

On Mon, Apr 13, 2015 at 7:17 PM, Guozhang Wang wangg...@gmail.com wrote:

It is a valid problem and we should correct it as soon as possible, I'm
with Ewen regarding the solution.

On Mon, Apr 13, 2015 at 5:05 PM, Ewen Cheslack-Postava e...@confluent.io
wrote:

Steven,

Looks like there is even more that could potentially be leaked -- since
key
and value serializers are created and configured at the end, even the IO
thread allocated by the producer could leak. Given that, I think 1 isn't
a
great option since, as you said, it doesn't really address the underlying
issue.

3 strikes me as bad from a user experience perspective. It's true we
might
want to introduce additional constructors to make testing easier, but the
more components I need to allocate myself and inject into the producer's
constructor, the worse the default experience is. And since you would
have
to inject the dependencies to get correct, non-leaking behavior, it will
always be more code than previously (and a backwards incompatible
change).
Additionally, the code creating a the producer would have be more
complicated since it would have to deal with the cleanup carefully
whereas
it previously just had to deal with the exception. Besides, for testing
specifically, you can avoid exposing more constructors just for testing
by
using something like PowerMock that let you mock private methods. That
requires a bit of code reorganization, but doesn't affect the public
interface at all.

So my take is that a variant of 2 is probably best. I'd probably do two
things. First, make close() safe to call even if some fields haven't been
initialized, which presumably just means checking for null fields. (You
might also want to figure out if all the methods close() calls are
idempotent and decide whether some fields should be marked non-final and
cleared to null when close() is called). Second, add the try/catch as you
suggested, but just use close().

-Ewen

On Mon, Apr 13, 2015 at 3:53 PM, Steven Wu stevenz...@gmail.com wrote:

Here is the resource leak problem that we have encountered when 0.8.2
java
KafkaProducer failed in constructor. here is the code snippet of
KafkaProducer to illustrate the problem.

---
public KafkaProducer(ProducerConfig config, SerializerK
keySerializer,
SerializerV valueSerializer) {

// create metrcis reporter via reflection
ListMetricsReporter reporters =

config.getConfiguredInstances(ProducerConfig.METRIC_REPORTER_CLASSES_CONFIG,
MetricsReporter.class);

// validate bootstrap servers
ListInetSocketAddress addresses =

ClientUtils.parseAndValidateAddresses(config.getList(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG));

}
---

let's say MyMetricsReporter creates a thread in constructor. if
hostname
validation threw an exception, constructor won't call the close method
of
MyMetricsReporter to clean up the resource. as a result, we created
thread
leak issue. this becomes worse when we try to auto recovery (i.e. keep
creating KafkaProducer again - failing again - more thread leaks).

there are multiple options of fixing this.

1) just move the hostname validation to the beginning. but this is only
fix
one symtom. it didn't fix the fundamental problem. what if some other
lines
throw an exception.

2) use try-catch. in the catch section, try to call close methods for
any
non-null objects constructed so far.

3) explicitly declare the dependency in the constructor. this way, when
KafkaProducer threw an exception, I can call close method of metrics
reporters for releasing resources.
KafkaProducer(..., ListMetricsReporter reporters)
we don't have to dependency injection framework. but generally hiding
dependency is a bad coding practice. it is also hard to plug in mocks
for
dependencies. this is probably the most intrusive change.

I am willing to submit a patch. but like to hear your opinions on how
we
should fix the issue.

Thanks,
Steven

--
Thanks,
Ewen

--
-- Guozhang

Re: [DISCUSS] error handling in java KafkaProducer

2015-04-14 Thread Steven Wu

I submitted a patch attempt in the jira.

On Tue, Apr 14, 2015 at 10:16 AM, Steven Wu stevenz...@gmail.com wrote:

Thanks, Ewen and Guozhang!

I will go with the try-catch option then. here is the jira. feel free to
assign it to me. I will try to submit a patch this week.
https://issues.apache.org/jira/browse/KAFKA-2121

On Mon, Apr 13, 2015 at 7:17 PM, Guozhang Wang wangg...@gmail.com wrote:

It is a valid problem and we should correct it as soon as possible, I'm
with Ewen regarding the solution.

On Mon, Apr 13, 2015 at 5:05 PM, Ewen Cheslack-Postava e...@confluent.io

wrote:

Steven,

Looks like there is even more that could potentially be leaked -- since
key
and value serializers are created and configured at the end, even the IO
thread allocated by the producer could leak. Given that, I think 1
isn't a
great option since, as you said, it doesn't really address the
underlying
issue.

3 strikes me as bad from a user experience perspective. It's true we
might
want to introduce additional constructors to make testing easier, but
the
more components I need to allocate myself and inject into the producer's
constructor, the worse the default experience is. And since you would
have
to inject the dependencies to get correct, non-leaking behavior, it will
always be more code than previously (and a backwards incompatible
change).
Additionally, the code creating a the producer would have be more
complicated since it would have to deal with the cleanup carefully
whereas
it previously just had to deal with the exception. Besides, for testing
specifically, you can avoid exposing more constructors just for testing
by
using something like PowerMock that let you mock private methods. That
requires a bit of code reorganization, but doesn't affect the public
interface at all.

So my take is that a variant of 2 is probably best. I'd probably do two
things. First, make close() safe to call even if some fields haven't
been
initialized, which presumably just means checking for null fields. (You
might also want to figure out if all the methods close() calls are
idempotent and decide whether some fields should be marked non-final and
cleared to null when close() is called). Second, add the try/catch as
you
suggested, but just use close().

-Ewen

On Mon, Apr 13, 2015 at 3:53 PM, Steven Wu stevenz...@gmail.com
wrote:

Here is the resource leak problem that we have encountered when 0.8.2
java
KafkaProducer failed in constructor. here is the code snippet of
KafkaProducer to illustrate the problem.

---
public KafkaProducer(ProducerConfig config, SerializerK
keySerializer,
SerializerV valueSerializer) {

// create metrcis reporter via reflection
ListMetricsReporter reporters =

config.getConfiguredInstances(ProducerConfig.METRIC_REPORTER_CLASSES_CONFIG,
MetricsReporter.class);

// validate bootstrap servers
ListInetSocketAddress addresses =

ClientUtils.parseAndValidateAddresses(config.getList(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG));

}
---

let's say MyMetricsReporter creates a thread in constructor. if
hostname
validation threw an exception, constructor won't call the close
method of
MyMetricsReporter to clean up the resource. as a result, we created
thread
leak issue. this becomes worse when we try to auto recovery (i.e. keep
creating KafkaProducer again - failing again - more thread leaks).

there are multiple options of fixing this.

1) just move the hostname validation to the beginning. but this is
only
fix
one symtom. it didn't fix the fundamental problem. what if some other
lines
throw an exception.

2) use try-catch. in the catch section, try to call close methods for
any
non-null objects constructed so far.

3) explicitly declare the dependency in the constructor. this way,
when
KafkaProducer threw an exception, I can call close method of metrics
reporters for releasing resources.
KafkaProducer(..., ListMetricsReporter reporters)
we don't have to dependency injection framework. but generally hiding
dependency is a bad coding practice. it is also hard to plug in mocks
for
dependencies. this is probably the most intrusive change.

I am willing to submit a patch. but like to hear your opinions on how
we
should fix the issue.

Thanks,
Steven

--
Thanks,
Ewen

--
-- Guozhang

[DISCUSS] error handling in java KafkaProducer

2015-04-13 Thread Steven Wu

Here is the resource leak problem that we have encountered when 0.8.2 java
KafkaProducer failed in constructor. here is the code snippet of
KafkaProducer to illustrate the problem.

---
public KafkaProducer(ProducerConfig config, SerializerK keySerializer,
SerializerV valueSerializer) {

// create metrcis reporter via reflection
ListMetricsReporter reporters =
config.getConfiguredInstances(ProducerConfig.METRIC_REPORTER_CLASSES_CONFIG,
MetricsReporter.class);

// validate bootstrap servers
ListInetSocketAddress addresses =
ClientUtils.parseAndValidateAddresses(config.getList(ProducerConfig.BOOTSTRAP_SERVERS_CONFIG));

}
---

let's say MyMetricsReporter creates a thread in constructor. if hostname
validation threw an exception, constructor won't call the close method of
MyMetricsReporter to clean up the resource. as a result, we created thread
leak issue. this becomes worse when we try to auto recovery (i.e. keep
creating KafkaProducer again - failing again - more thread leaks).

there are multiple options of fixing this.

1) just move the hostname validation to the beginning. but this is only fix
one symtom. it didn't fix the fundamental problem. what if some other lines
throw an exception.

2) use try-catch. in the catch section, try to call close methods for any
non-null objects constructed so far.

3) explicitly declare the dependency in the constructor. this way, when
KafkaProducer threw an exception, I can call close method of metrics
reporters for releasing resources.
KafkaProducer(..., ListMetricsReporter reporters)
we don't have to dependency injection framework. but generally hiding
dependency is a bad coding practice. it is also hard to plug in mocks for
dependencies. this is probably the most intrusive change.

I am willing to submit a patch. but like to hear your opinions on how we
should fix the issue.

Thanks,
Steven

Re: Metrics package discussion

2015-03-31 Thread Steven Wu

 My main concern is that we don't do the migration in 0.8.3, we will be
left
with some metrics in YM format and some others in KM format (as we start
sharing client code on the broker). This is probably a worse situation to
be in.

+1. I am not sure how our servo adaptor will work if there are two formats
for metrics? unless there is an easy way to check the format (YM/KM).


On Tue, Mar 31, 2015 at 9:40 AM, Jun Rao j...@confluent.io wrote:

 (2) The metrics are clearly part of the client API and we are not changing
 that (at least for the new client). Arguably, the metrics are also part of
 the broker side API. However, since they affect fewer parties (mostly just
 the Kafka admins), it may be easier to make those changes.

 My main concern is that we don't do the migration in 0.8.3, we will be left
 with some metrics in YM format and some others in KM format (as we start
 sharing client code on the broker). This is probably a worse situation to
 be in.

 Thanks,

 Jun

 On Tue, Mar 31, 2015 at 9:26 AM, Gwen Shapira gshap...@cloudera.com
 wrote:

  (2) I believe we agreed that our metrics are a public API. I believe
  we also agree we don't break API in minor releases. So, it seems
  obvious to me that we can't make breaking changes to metrics in minor
  releases. I'm not convinced we did it in the past is a good reason
  to do it again.
 
  Is there a strong reason to do it in a 0.8.3 time-frame?
 
  On Tue, Mar 31, 2015 at 7:59 AM, Jun Rao j...@confluent.io wrote:
   (2) Not sure why we can't do this in 0.8.3. We changed the metrics
 names
  in
   0.8.2 already. Given that we need to share code btw the client and the
   core, and we need to keep the metrics consistent on the broker, it
 seems
   that we have no choice but to migrate to KM. If so, it seems that the
   sooner that we do this, the better. It is important to give people an
  easy
   path for migration. However, it may not be easy to keep the mbean names
   exactly the same. For example, YM has hardcoded attributes (e.g.
   1-min-rate, 5-min-rate, 15-min-rate, etc for rates) that are not
  available
   in KM.
  
   One benefit out of this migration is that one can get the metrics in
 the
   client and the broker in the same way.
  
   Thanks,
  
   Jun
  
   On Mon, Mar 30, 2015 at 9:26 PM, Gwen Shapira gshap...@cloudera.com
  wrote:
  
   (1) It will be interesting to see what others use for monitoring
   integration, to see what is already covered with existing JMX
   integrations and what needs special support.
  
   (2) I think the migration story is more important - this is a
   non-compatible change, right? So we can't do it in 0.8.3 timeframe, it
   has to be in 0.9? And we need to figure out how will users migrate -
   do we just tell everyone please reconfigure all your monitors from
   scratch - don't worry, it is worth it?
   I know you keep saying we did it before and our users are used to it,
   but I think there are a lot more users now, and some of them have
   different compatibility expectations. We probably need to find:
   * A least painful way to migrate - can we keep the names of at least
   most of the metrics intact?
   * Good explanation of what users gain from this painful migration
   (i.e. more accurate statistics due to gazillion histograms)
  
  
  
  
  
  
   On Mon, Mar 30, 2015 at 6:29 PM, Jun Rao j...@confluent.io wrote:
If we are committed to migrating the broker side metrics to KM for
 the
   next
release, we will need to (1) have a story on supporting common
  reporters
(as listed in KAFKA-1930), and (2) see if the current histogram
  support
   is
good enough for measuring things like request time.
   
Thanks,
   
Jun
   
On Mon, Mar 30, 2015 at 3:03 PM, Aditya Auradkar 
aaurad...@linkedin.com.invalid wrote:
   
If we do plan to use the network code in client, I think that is a
  good
reason in favor of migration. It will be unnecessary to have
 metrics
   from
multiple libraries coexist since our users will have to start
  monitoring
these new metrics anyway.
   
I also agree with Jay that in multi-tenant clusters people care
 about
detailed statistics for their own application over global numbers.
   
Based on the arguments so far, I'm +1 for migrating to KM.
   
Thanks,
Aditya
   

From: Jun Rao [j...@confluent.io]
Sent: Sunday, March 29, 2015 9:44 AM
To: dev@kafka.apache.org
Subject: Re: Metrics package discussion
   
There is another thing to consider. We plan to reuse the client
   components
on the server side over time. For example, as part of the security
   work, we
are looking into replacing the server side network code with the
  client
network code (KAFKA-1928). However, the client network already has
   metrics
based on KM.
   
Thanks,
   
Jun
   
On Sat, Mar 28, 2015 at 1:34 PM, Jay Kreps jay.kr...@gmail.com
  wrote:
   
 I

Re: [KIP-DISCUSSION] KIP-13 Quotas

2015-03-20 Thread Steven Wu

   separately.
 That
   will also contain a section on quotas.
  
   3. Dynamic Configuration management - Being discussed in
 KIP-5.
 Basically
   we need something that will model default quotas and allow
   per-client
   overrides.
  
   Is there something else that I'm missing?
  
   Thanks,
   Aditya
   
   From: Jay Kreps [jay.kr...@gmail.com]
   Sent: Wednesday, March 18, 2015 2:10 PM
   To: dev@kafka.apache.org
   Subject: Re: [KIP-DISCUSSION] KIP-13 Quotas
  
   Hey Steven,
  
   The current proposal is actually to enforce quotas at the
   client/application level, NOT the topic level. So if you have
 a
service
   with a few dozen instances the quota is against all of those
instances
   added up across all their topics. So actually the effect would
  be
   the
 same
   either way but throttling gives the producer the choice of
  either
 blocking
   or dropping.
  
   -Jay
  
   On Tue, Mar 17, 2015 at 10:08 AM, Steven Wu 
  stevenz...@gmail.com
   
 wrote:
  
Jay,
   
let's say an app produces to 10 different topics. one of the
   topic
is
   sent
from a library. due to whatever condition/bug, this lib
 starts
   to
 send
messages over the quota. if we go with the delayed response
 approach, it
will cause the whole shared RecordAccumulator buffer to be
   filled
up.
   that
will penalize other 9 topics who are within the quota. that
 is
   the
unfairness point that Ewen and I were trying to make.
   
if broker just drop the msg and return an error/status code
 indicates the
drop and why. then producer can just move on and accept the
   drop.
 shared
buffer won't be saturated and other 9 topics won't be
  penalized.
   
Thanks,
Steven
   
   
   
On Tue, Mar 17, 2015 at 9:44 AM, Jay Kreps 
  jay.kr...@gmail.com
   
 wrote:
   
 Hey Steven,

 It is true that hitting the quota will cause back-pressure
  on
   the
producer.
 But the solution is simple, a producer that wants to avoid
   this
 should
stay
 under its quota. In other words this is a contract between
  the
 cluster
and
 the client, with each side having something to uphold.
 Quite
 possibly
   the
 same thing will happen in the absence of a quota, a client
   that
   produces
an
 unexpected amount of load will hit the limits of the
 server
   and
experience
 backpressure. Quotas just allow you to set that same limit
  at
 something
 lower than 100% of all resources on the server, which is
   useful
 for a
 shared cluster.

 -Jay

 On Mon, Mar 16, 2015 at 11:34 PM, Steven Wu 
stevenz...@gmail.com
wrote:

  wait. we create one kafka producer for each cluster.
 each
 cluster can
 have
  many topics. if producer buffer got filled up due to
  delayed
 response
for
  one throttled topic, won't that penalize other topics
   unfairly?
 it
seems
 to
  me that broker should just return error without delay.
 
  sorry that I am chatting to myself :)
 
  On Mon, Mar 16, 2015 at 11:29 PM, Steven Wu 
 stevenz...@gmail.com
 wrote:
 
   I think I can answer my own question. delayed response
   will
 cause
   the
   producer buffer to be full, which then result in
 either
thread
blocking
  or
   message drop.
  
   On Mon, Mar 16, 2015 at 11:24 PM, Steven Wu 
 stevenz...@gmail.com
  wrote:
  
   please correct me if I am missing sth here. I am not
 understanding
how
   would throttle work without cooperation/back-off from
 producer.
   new
 Java
   producer supports non-blocking API. why would delayed
 response be
able
  to
   slow down producer? producer will continue to fire
  async
 sends.
  
   On Mon, Mar 16, 2015 at 10:58 PM, Guozhang Wang 
   wangg...@gmail.com

   wrote:
  
   I think we are really discussing two separate issues
   here:
  
   1. Whether we should a)
   append-then-block-then-returnOKButThrottled
 or
  b)
   block-then-returnFailDuetoThrottled for quota
 actions
  on
 produce
   requests.
  
   Both these approaches assume some kind of
   well-behaveness
of
 the
  clients:
   option a) assumes the client sets an proper timeout
   value
 while

Re: [KIP-DISCUSSION] KIP-13 Quotas

wait. we create one kafka producer for each cluster. each cluster can have
many topics. if producer buffer got filled up due to delayed response for
one throttled topic, won't that penalize other topics unfairly? it seems to
me that broker should just return error without delay.

sorry that I am chatting to myself :)

On Mon, Mar 16, 2015 at 11:29 PM, Steven Wu stevenz...@gmail.com wrote:

 I think I can answer my own question. delayed response will cause the
 producer buffer to be full, which then result in either thread blocking or
 message drop.

 On Mon, Mar 16, 2015 at 11:24 PM, Steven Wu stevenz...@gmail.com wrote:

 please correct me if I am missing sth here. I am not understanding how
 would throttle work without cooperation/back-off from producer. new Java
 producer supports non-blocking API. why would delayed response be able to
 slow down producer? producer will continue to fire async sends.

 On Mon, Mar 16, 2015 at 10:58 PM, Guozhang Wang wangg...@gmail.com
 wrote:

 I think we are really discussing two separate issues here:

 1. Whether we should a) append-then-block-then-returnOKButThrottled or b)
 block-then-returnFailDuetoThrottled for quota actions on produce
 requests.

 Both these approaches assume some kind of well-behaveness of the clients:
 option a) assumes the client sets an proper timeout value while can just
 ignore OKButThrottled response, while option b) assumes the client
 handles the FailDuetoThrottled appropriately. For any malicious clients
 that, for example, just keep retrying either intentionally or not,
 neither
 of these approaches are actually effective.

 2. For OKButThrottled and FailDuetoThrottled responses, shall we
 encode
 them as error codes or augment the protocol to use a separate field
 indicating status codes.

 Today we have already incorporated some status code as error codes in the
 responses, e.g. ReplicaNotAvailable in MetadataResponse, the pros of this
 is of course using a single field for response status like the HTTP
 status
 codes, while the cons is that it requires clients to handle the error
 codes
 carefully.

 I think maybe we can actually extend the single-code approach to overcome
 its drawbacks, that is, wrap the error codes semantics to the users so
 that
 users do not need to handle the codes one-by-one. More concretely,
 following Jay's example the client could write sth. like this:


 -

   if(error.isOK())
  // status code is good or the code can be simply ignored for this
 request type, process the request
   else if(error.needsRetry())
  // throttled, transient error, etc: retry
   else if(error.isFatal())
  // non-retriable errors, etc: notify / terminate / other handling

 -

 Only when the clients really want to handle, for example
 FailDuetoThrottled
 status code specifically, it needs to:

   if(error.isOK())
  // status code is good or the code can be simply ignored for this
 request type, process the request
   else if(error == FailDuetoThrottled )
  // throttled: log it
   else if(error.needsRetry())
  // transient error, etc: retry
   else if(error.isFatal())
  // non-retriable errors, etc: notify / terminate / other handling

 -

 And for implementation we can probably group the codes accordingly like
 HTTP status code such that we can do:

 boolean Error.isOK() {
   return code  300  code = 200;
 }

 Guozhang

 On Mon, Mar 16, 2015 at 10:24 PM, Ewen Cheslack-Postava 
 e...@confluent.io
 wrote:

  Agreed that trying to shoehorn non-error codes into the error field is
 a
  bad idea. It makes it *way* too easy to write code that looks (and
 should
  be) correct but is actually incorrect. If necessary, I think it's much
  better to to spend a couple of extra bytes to encode that information
  separately (a status or warning section of the response). An
 indication
  that throttling is occurring is something I'd expect to be indicated
 by a
  bit flag in the response rather than as an error code.
 
  Gwen - I think an error code makes sense when the request actually
 failed.
  Option B, which Jun was advocating, would have appended the messages
  successfully. If the rate-limiting case you're talking about had
  successfully committed the messages, I would say that's also a bad use
 of
  error codes.
 
 
  On Mon, Mar 16, 2015 at 10:16 PM, Gwen Shapira gshap...@cloudera.com
  wrote:
 
   We discussed an error code for rate-limiting (which I think made
   sense), isn't it a similar case?
  
   On Mon, Mar 16, 2015 at 10:10 PM, Jay Kreps jay.kr...@gmail.com
 wrote:
My concern is that as soon as you start encoding non-error response
information into error codes the next question is what to do if two
  such
codes apply (i.e. you have a replica down and the response is
  quota'd). I
think I am trying to argue that error should mean why we failed
 your
request, for which there will really only be one reason, and any
 other
useful

Re: [KIP-DISCUSSION] KIP-13 Quotas

I think I can answer my own question. delayed response will cause the
producer buffer to be full, which then result in either thread blocking or
message drop.

On Mon, Mar 16, 2015 at 11:24 PM, Steven Wu stevenz...@gmail.com wrote:

 please correct me if I am missing sth here. I am not understanding how
 would throttle work without cooperation/back-off from producer. new Java
 producer supports non-blocking API. why would delayed response be able to
 slow down producer? producer will continue to fire async sends.

 On Mon, Mar 16, 2015 at 10:58 PM, Guozhang Wang wangg...@gmail.com
 wrote:

 I think we are really discussing two separate issues here:

 1. Whether we should a) append-then-block-then-returnOKButThrottled or b)
 block-then-returnFailDuetoThrottled for quota actions on produce requests.

 Both these approaches assume some kind of well-behaveness of the clients:
 option a) assumes the client sets an proper timeout value while can just
 ignore OKButThrottled response, while option b) assumes the client
 handles the FailDuetoThrottled appropriately. For any malicious clients
 that, for example, just keep retrying either intentionally or not, neither
 of these approaches are actually effective.

 2. For OKButThrottled and FailDuetoThrottled responses, shall we
 encode
 them as error codes or augment the protocol to use a separate field
 indicating status codes.

 Today we have already incorporated some status code as error codes in the
 responses, e.g. ReplicaNotAvailable in MetadataResponse, the pros of this
 is of course using a single field for response status like the HTTP status
 codes, while the cons is that it requires clients to handle the error
 codes
 carefully.

 I think maybe we can actually extend the single-code approach to overcome
 its drawbacks, that is, wrap the error codes semantics to the users so
 that
 users do not need to handle the codes one-by-one. More concretely,
 following Jay's example the client could write sth. like this:


 -

   if(error.isOK())
  // status code is good or the code can be simply ignored for this
 request type, process the request
   else if(error.needsRetry())
  // throttled, transient error, etc: retry
   else if(error.isFatal())
  // non-retriable errors, etc: notify / terminate / other handling

 -

 Only when the clients really want to handle, for example
 FailDuetoThrottled
 status code specifically, it needs to:

   if(error.isOK())
  // status code is good or the code can be simply ignored for this
 request type, process the request
   else if(error == FailDuetoThrottled )
  // throttled: log it
   else if(error.needsRetry())
  // transient error, etc: retry
   else if(error.isFatal())
  // non-retriable errors, etc: notify / terminate / other handling

 -

 And for implementation we can probably group the codes accordingly like
 HTTP status code such that we can do:

 boolean Error.isOK() {
   return code  300  code = 200;
 }

 Guozhang

 On Mon, Mar 16, 2015 at 10:24 PM, Ewen Cheslack-Postava 
 e...@confluent.io
 wrote:

  Agreed that trying to shoehorn non-error codes into the error field is a
  bad idea. It makes it *way* too easy to write code that looks (and
 should
  be) correct but is actually incorrect. If necessary, I think it's much
  better to to spend a couple of extra bytes to encode that information
  separately (a status or warning section of the response). An
 indication
  that throttling is occurring is something I'd expect to be indicated by
 a
  bit flag in the response rather than as an error code.
 
  Gwen - I think an error code makes sense when the request actually
 failed.
  Option B, which Jun was advocating, would have appended the messages
  successfully. If the rate-limiting case you're talking about had
  successfully committed the messages, I would say that's also a bad use
 of
  error codes.
 
 
  On Mon, Mar 16, 2015 at 10:16 PM, Gwen Shapira gshap...@cloudera.com
  wrote:
 
   We discussed an error code for rate-limiting (which I think made
   sense), isn't it a similar case?
  
   On Mon, Mar 16, 2015 at 10:10 PM, Jay Kreps jay.kr...@gmail.com
 wrote:
My concern is that as soon as you start encoding non-error response
information into error codes the next question is what to do if two
  such
codes apply (i.e. you have a replica down and the response is
  quota'd). I
think I am trying to argue that error should mean why we failed
 your
request, for which there will really only be one reason, and any
 other
useful information we want to send back is just another field in the
response.
   
-Jay
   
On Mon, Mar 16, 2015 at 9:51 PM, Gwen Shapira 
 gshap...@cloudera.com
   wrote:
   
I think its not too late to reserve a set of error codes (200-299?)
for non-error codes.
   
It won't be backward compatible (i.e. clients that currently do
 else
throw will throw on non-errors), but perhaps

Re: [KIP-DISCUSSION] KIP-13 Quotas

please correct me if I am missing sth here. I am not understanding how
would throttle work without cooperation/back-off from producer. new Java
producer supports non-blocking API. why would delayed response be able to
slow down producer? producer will continue to fire async sends.

On Mon, Mar 16, 2015 at 10:58 PM, Guozhang Wang wangg...@gmail.com wrote:

 I think we are really discussing two separate issues here:

 1. Whether we should a) append-then-block-then-returnOKButThrottled or b)
 block-then-returnFailDuetoThrottled for quota actions on produce requests.

 Both these approaches assume some kind of well-behaveness of the clients:
 option a) assumes the client sets an proper timeout value while can just
 ignore OKButThrottled response, while option b) assumes the client
 handles the FailDuetoThrottled appropriately. For any malicious clients
 that, for example, just keep retrying either intentionally or not, neither
 of these approaches are actually effective.

 2. For OKButThrottled and FailDuetoThrottled responses, shall we encode
 them as error codes or augment the protocol to use a separate field
 indicating status codes.

 Today we have already incorporated some status code as error codes in the
 responses, e.g. ReplicaNotAvailable in MetadataResponse, the pros of this
 is of course using a single field for response status like the HTTP status
 codes, while the cons is that it requires clients to handle the error codes
 carefully.

 I think maybe we can actually extend the single-code approach to overcome
 its drawbacks, that is, wrap the error codes semantics to the users so that
 users do not need to handle the codes one-by-one. More concretely,
 following Jay's example the client could write sth. like this:


 -

   if(error.isOK())
  // status code is good or the code can be simply ignored for this
 request type, process the request
   else if(error.needsRetry())
  // throttled, transient error, etc: retry
   else if(error.isFatal())
  // non-retriable errors, etc: notify / terminate / other handling

 -

 Only when the clients really want to handle, for example FailDuetoThrottled
 status code specifically, it needs to:

   if(error.isOK())
  // status code is good or the code can be simply ignored for this
 request type, process the request
   else if(error == FailDuetoThrottled )
  // throttled: log it
   else if(error.needsRetry())
  // transient error, etc: retry
   else if(error.isFatal())
  // non-retriable errors, etc: notify / terminate / other handling

 -

 And for implementation we can probably group the codes accordingly like
 HTTP status code such that we can do:

 boolean Error.isOK() {
   return code  300  code = 200;
 }

 Guozhang

 On Mon, Mar 16, 2015 at 10:24 PM, Ewen Cheslack-Postava e...@confluent.io
 
 wrote:

  Agreed that trying to shoehorn non-error codes into the error field is a
  bad idea. It makes it *way* too easy to write code that looks (and should
  be) correct but is actually incorrect. If necessary, I think it's much
  better to to spend a couple of extra bytes to encode that information
  separately (a status or warning section of the response). An
 indication
  that throttling is occurring is something I'd expect to be indicated by a
  bit flag in the response rather than as an error code.
 
  Gwen - I think an error code makes sense when the request actually
 failed.
  Option B, which Jun was advocating, would have appended the messages
  successfully. If the rate-limiting case you're talking about had
  successfully committed the messages, I would say that's also a bad use of
  error codes.
 
 
  On Mon, Mar 16, 2015 at 10:16 PM, Gwen Shapira gshap...@cloudera.com
  wrote:
 
   We discussed an error code for rate-limiting (which I think made
   sense), isn't it a similar case?
  
   On Mon, Mar 16, 2015 at 10:10 PM, Jay Kreps jay.kr...@gmail.com
 wrote:
My concern is that as soon as you start encoding non-error response
information into error codes the next question is what to do if two
  such
codes apply (i.e. you have a replica down and the response is
  quota'd). I
think I am trying to argue that error should mean why we failed your
request, for which there will really only be one reason, and any
 other
useful information we want to send back is just another field in the
response.
   
-Jay
   
On Mon, Mar 16, 2015 at 9:51 PM, Gwen Shapira gshap...@cloudera.com
 
   wrote:
   
I think its not too late to reserve a set of error codes (200-299?)
for non-error codes.
   
It won't be backward compatible (i.e. clients that currently do
 else
throw will throw on non-errors), but perhaps its worthwhile.
   
On Mon, Mar 16, 2015 at 9:42 PM, Jay Kreps jay.kr...@gmail.com
  wrote:
 Hey Jun,

 I'd really really really like to avoid that. Having just spent a
   bunch of
 time on the clients, using the error codes to

Re: [KIP-DISCUSSION] KIP-13 Quotas

Ewen, I see your point regarding the shared buffer. yes, a bad/slow broker
could potentially consume up all buffer. On the other hand, I do like the
batching behavior of shared RecordAccumulator buffer.

On Tue, Mar 17, 2015 at 8:25 AM, Guozhang Wang wangg...@gmail.com wrote:

 Ewen,

 1. I think we are on the same page as per malicious clients, that it
 should not be the target of either approach. I was just trying to separate
 the discussion from what if user just keep retrying and maybe I was not
 clear.

 2. I was not advocating option A on the wiki, in my previous email I
 actually assume that option is already dropped and we are only considering
 option B (which is my option b) in the email) and C (option a) in my
 email), and I think with some proper wrapping of status codes (today we
 still call them error codes) option B in the wiki may not necessarily
 require people who implement clients to handle each status code one-by-one.

 Guozhang

 On Tue, Mar 17, 2015 at 12:22 AM, Ewen Cheslack-Postava e...@confluent.io
 
 wrote:

  Steven - that's a reasonable concern. I think I've mentioned the same
 sort
  of issue in the issues about the new producer's RecordAccumulator not
  timing out sends, e.g. in
 https://issues.apache.org/jira/browse/KAFKA-1788
  .
  The shared buffer causes problems if one broker isn't available for
 awhile
  since messages to that broker end up consuming the entire buffer. You can
  end up with a similar problem here due to the effective rate limiting
  caused by delaying responses.
 
  Guozhang - I think only option A from the KIP is actually an error. If we
  want to look to HTTP for examples, there's an RFC that defines the Too
 Many
  Requests response to handle rate limiting:
  http://tools.ietf.org/html/rfc6585#page-3 In this case, it actually is
 an
  error, specifically a client error since its in the 400 range.The
  implication from the status code (429), name of the response, and the
  example given is that is is an error and no real data is returned, which
  would correspond to option A from the KIP. Note that the protocol
 provides
  a mechanism for giving extra (optional) information about when you should
  retry (via headers). I'd guess that even despite that, most systems that
  encounter a 429 use some ad hoc backoff mechanism because they only try
 to
  detect anything in the 400 range...
 
  One additional point -- I think malicious clients shouldn't be our
 target
  here, they can do a lot worse than what's been addressed in this thread.
  But I do agree that any proposal should have a clear explanation of how
  existing clients that are ignorant of quotas would behave (which is why
  options b and c make a lot of sense -- they rate limit without requiring
 an
  update to normally-behaving clients).
 
 
  On Mon, Mar 16, 2015 at 11:34 PM, Steven Wu stevenz...@gmail.com
 wrote:
 
   wait. we create one kafka producer for each cluster. each cluster can
  have
   many topics. if producer buffer got filled up due to delayed response
 for
   one throttled topic, won't that penalize other topics unfairly? it
 seems
  to
   me that broker should just return error without delay.
  
   sorry that I am chatting to myself :)
  
   On Mon, Mar 16, 2015 at 11:29 PM, Steven Wu stevenz...@gmail.com
  wrote:
  
I think I can answer my own question. delayed response will cause the
producer buffer to be full, which then result in either thread
 blocking
   or
message drop.
   
On Mon, Mar 16, 2015 at 11:24 PM, Steven Wu stevenz...@gmail.com
   wrote:
   
please correct me if I am missing sth here. I am not understanding
 how
would throttle work without cooperation/back-off from producer. new
  Java
producer supports non-blocking API. why would delayed response be
 able
   to
slow down producer? producer will continue to fire async sends.
   
On Mon, Mar 16, 2015 at 10:58 PM, Guozhang Wang wangg...@gmail.com
 
wrote:
   
I think we are really discussing two separate issues here:
   
1. Whether we should a) append-then-block-then-returnOKButThrottled
  or
   b)
block-then-returnFailDuetoThrottled for quota actions on produce
requests.
   
Both these approaches assume some kind of well-behaveness of the
   clients:
option a) assumes the client sets an proper timeout value while can
   just
ignore OKButThrottled response, while option b) assumes the
 client
handles the FailDuetoThrottled appropriately. For any malicious
   clients
that, for example, just keep retrying either intentionally or not,
neither
of these approaches are actually effective.
   
2. For OKButThrottled and FailDuetoThrottled responses, shall
 we
encode
them as error codes or augment the protocol to use a separate field
indicating status codes.
   
Today we have already incorporated some status code as error codes
 in
   the
responses, e.g. ReplicaNotAvailable in MetadataResponse, the pros

Re: [KIP-DISCUSSION] KIP-13 Quotas