date:20170825

[jira] [Resolved] (KAFKA-5640) Look into making acks=all the default setting

2017-08-25 Thread Apurva Mehta (JIRA)


 [ 
https://issues.apache.org/jira/browse/KAFKA-5640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apurva Mehta resolved KAFKA-5640.
-
Resolution: Duplicate

This is a dup of https://issues.apache.org/jira/browse/KAFKA-5796

> Look into making acks=all the default setting
> -
>
> Key: KAFKA-5640
> URL: https://issues.apache.org/jira/browse/KAFKA-5640
> Project: Kafka
>  Issue Type: Sub-task
>Reporter: Apurva Mehta
>Assignee: Apurva Mehta
> Fix For: 1.0.0
>
>
> KAFKA-5494 proposed dropping the requirement for 
> {{max.inflight.requests.per.connection=1}} for the idempotent producer. 
> That is a stepping stone to enabling the idempotent producer by default 
> without sacrificing performance.
> A further step would be making {{acks=all}} the default setting as well. 
> Then, with {{enable.idempotence=true}}, 
> {{max.inflight.requests.per.connection=5}}, {{acks=all}}, 
> {{retries=MAX_INT}}, we would have exactly once semantics with strong 
> durability guarantees. 
> This particular ticket is about investigating the performance degradation 
> caused by {{acks=all}}. How much does throughput degrade? If it is 
> significant, are there low hanging fruit in terms of code or config changes 
> which would allow us to bridge most of the gap?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Created] (KAFKA-5796) Understand performance implications of acks=all and potential ways to reduce it

2017-08-25 Thread Apurva Mehta (JIRA)

Apurva Mehta created KAFKA-5796:
---

 Summary: Understand performance implications of acks=all and 
potential ways to reduce it
 Key: KAFKA-5796
 URL: https://issues.apache.org/jira/browse/KAFKA-5796
 Project: Kafka
  Issue Type: Sub-task
 Environment: To get exactly once semantics, we need acks=all. However, 
we know that there is a latency and throughput impact with acks=all when 
compared with acks=1. 

The impact is quantified here:
https://cwiki.apache.org/confluence/display/KAFKA/An+analysis+of+the+impact+of+max.in.flight.requests.per.connection+and+acks+on+Producer+performance

However, we can't explain some of that data. Nor do we know the causes for some 
of the degradation. We would like to understand the performance of acks=all at 
the very minimum before making it the default producer setting.

Reporter: Apurva Mehta
Assignee: Apurva Mehta






--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Created] (KAFKA-5795) Make the idempotent producer the default producer setting

2017-08-25 Thread Apurva Mehta (JIRA)

Apurva Mehta created KAFKA-5795:
---

 Summary: Make the idempotent producer the default producer setting
 Key: KAFKA-5795
 URL: https://issues.apache.org/jira/browse/KAFKA-5795
 Project: Kafka
  Issue Type: Improvement
Reporter: Apurva Mehta
Assignee: Apurva Mehta
 Fix For: 1.0.0


We would like to turn on idempotence by default. The KIP is here: 
https://cwiki.apache.org/confluence/display/KAFKA/KIP-185%3A+Make+exactly+once+in+order+delivery+per+partition+the+default+producer+setting



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Created] (KAFKA-5794) Introduce new idempotence mode to gracefully deal with topics on the older message format

2017-08-25 Thread Apurva Mehta (JIRA)

Apurva Mehta created KAFKA-5794:
---

 Summary: Introduce new idempotence mode to gracefully deal with 
topics on the older message format
 Key: KAFKA-5794
 URL: https://issues.apache.org/jira/browse/KAFKA-5794
 Project: Kafka
  Issue Type: Bug
Affects Versions: 0.11.0.0
Reporter: Apurva Mehta
Assignee: Apurva Mehta
 Fix For: 1.0.0


In the discussion of KIP-185: Make exactly once in order delivery per partition 
the default producer setting, it was realized that we don't have graceful 
handling when an idempotence-enabled producer is writing to a broker with a 
message format older than v2 (ie. the 0.11.0 message format). 

In particular, if we enable idempotence, any produce requests to topics with an 
older message format will fail with an UnsupportedVersionException. Thus if the 
idempotent producer was to be made the default, the out of the box producer 
would fail to produce when used with clusters which haven't upgraded the 
message format yet.

This is particularly problematic since the recommended upgrade path is to 
upgrade broker code while keeping the message format at the older version, then 
upgrade all clients, and only finally upgrade the message format on the server. 
With the current behavior, the middle step is actually untenable if we enable 
idempotence as the default.

More details available at: 
https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Exactly+Once+-+Dealing+with+older+message+formats+when+idempotence+is+enabled



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

[jira] [Created] (KAFKA-5793) Tighten up situations where OutOfOrderSequence may be returned

2017-08-25 Thread Apurva Mehta (JIRA)

Apurva Mehta created KAFKA-5793:
---

Summary: Tighten up situations where OutOfOrderSequence may be
returned
Key: KAFKA-5793
URL: https://issues.apache.org/jira/browse/KAFKA-5793
Project: Kafka
Issue Type: Bug
Affects Versions: 0.11.0.0
Reporter: Apurva Mehta
Assignee: Apurva Mehta
Fix For: 1.0.0

Details of the problem are provided here:
https://cwiki.apache.org/confluence/display/KAFKA/Kafka+Exactly+Once+-+Solving+the+problem+of+spurious+OutOfOrderSequence+errors

A quick summary follows:

In the discussion of KIP-185: Make exactly once in order delivery per partition
the default producer setting, the following point regarding the
OutOfOrderSequenceException was raised:

1. The OutOfOrderSequenceException indicates that there has been data loss on
the broker.. ie. a previously acknowledged message no longer exists. For most
part, this should only occur in rare situations (simultaneous power outages,
multiple disk losses, software bugs resulting in data corruption, etc.).
2. However, there is another perfectly normal scenario where data is removed:
in particular, data could be deleted because it is old and crosses the
retention threshold.
Hence, if a producer remains inactive for longer than a topic's retention
period, we could get an OutOfOrderSequence which is a false positive: the data
is removed through valid processes, and this isn't an error.
3. We would like to eliminate the possibility of getting spurious
OutOfOrderSequenceExceptions – when you get it, it should always mean data loss
and should be taken very seriously.

76 matches

Mail list logo