(1) should work, but as Jun suggested earlier in the thread it is
slightly misleading. The (intuitive) post-condition of "close" is that
the producer has shutdown - i.e., its sender thread, closed its
metrics, serializer/deserializer, etc. That is not necessarily a
post-condition of "close(0)" although one can contend that if you call
the method in non-blocking mode (zero timeout) then it is reasonable
to not expect that post-condition.

So I think that although (2) adds one more API it brings "simplicity
by virtue of overall clarity".

I would be in favor of (2) but not strongly opposed to (1).

Thanks,

Joel

On Thu, Mar 19, 2015 at 10:05:04AM -0700, Neha Narkhede wrote:
> I'm in favor of (1) for the sake of simplicity and as Jay mentions to
> reduce the number of different APIs. Can you explain when (1) does not work?
> 
> On Wed, Mar 18, 2015 at 2:52 PM, Jay Kreps <jay.kr...@gmail.com> wrote:
> 
> > Personally I'm in favor of (1) just to reduce the number of different APIs.
> > People will find the difference between abort and close subtle and
> > confusing and the only instance where you want it is this somewhat unusual
> > case you guys are pursuing, right?
> >
> > -Jay
> >
> > On Wed, Mar 18, 2015 at 2:13 PM, Jiangjie Qin <j...@linkedin.com.invalid>
> > wrote:
> >
> > > It looks we have another option and are now deciding between the
> > following
> > > two interfaces:
> > >
> > > 1. Close() + close(timeout)
> > >   - timeout could be either positive or zero.
> > >   - only close(0) can be called from sender thread
> > >
> > > 2. Close() + abort() + close(timeout)
> > >   - timeout can either be positive or zero
> > >   - only abort() can be called from sender thread
> > >
> > >   - abort() is equivalent to close(0) in 1) but does not join sender
> > > thread and does not close metrics.
> > >   - Another thread has to call close() or close(timeout) in order to make
> > > sure the resources in producer are gone.
> > >
> > > The tow approach provides the same function we need, the difference is
> > > approach 2) follows convention of close() and abort(). On the other hand,
> > > approach 1) saves one interface compared with approach 2) but does not
> > > follow the convention.
> > >
> > > When the two approaches come to user code, it is probably something like
> > > this:
> > >
> > > Try {
> > >   While(!finished)
> > >     Producer.send(record, callback)
> > > } catch (Exception e) {
> > >   Producer.close(5)
> > > }
> > >
> > > Class CallbackImpl implements Callback {
> > >   onCompletion(RecordMetadata metadata Exception e) {
> > >     If (e != null)
> > >       Abort() / close()
> > >   }
> > > }
> > >
> > > Because the two approach leads to almost the same user code, assuming
> > > users are always calling producer.close() as a clean up step, personally
> > I
> > > prefer approach 2) as it follows convention.
> > >
> > > Any thoughts?
> > >
> > > Jiangjie (Becket) Qin
> > >
> > >
> > > On 3/17/15, 10:25 AM, "Jiangjie Qin" <j...@linkedin.com> wrote:
> > >
> > > >Hi Jun,
> > > >
> > > >Yes, as Guozhang said, the main reason we set a flag is because close(0)
> > > >is expected to be called by sender thread itself.
> > > >If we want to maintain the semantic meaning of close(), one alternative
> > is
> > > >to have an abort() method does the same thing as close(0) except
> > cleanup.
> > > >And in close(timeout), after timeout we call abort() and join the sender
> > > >thread. This was one of the previous proposal. We merged abort to
> > close(0)
> > > >because they are almost doing the same thing. But from what you
> > mentioned,
> > > >it might make sense to have two separate methods.
> > > >
> > > >Thanks.
> > > >
> > > >Jiangjie (Becket) Qin
> > > >
> > > >On 3/16/15, 10:31 PM, "Guozhang Wang" <wangg...@gmail.com> wrote:
> > > >
> > > >>Yeah in this sense the sender thread will not exist immediately in the
> > > >>close(0) call, but will only terminate after the current response batch
> > > >>has
> > > >>been processed, as will the producer instance itself.
> > > >>
> > > >>There is a reason for this though: for a clean shutdown the caller
> > thread
> > > >>has to wait for the sender thread to join before closing the producer
> > > >>instance, but this cannot be achieve if close(0) is called by the
> > sender
> > > >>thread itself (for example in KAFKA-1659, there is a proposal from
> > Andrew
> > > >>Stein on using thread.interrupt and thread.stop, but if it is called by
> > > >>the
> > > >>ioThread itself the stop call will fail). Hence we came up with the
> > flag
> > > >>approach to let the sender thread to close as soon as it is at the
> > > >>barrier
> > > >>of the run loop.
> > > >>
> > > >>Guozhang
> > > >>
> > > >>On Mon, Mar 16, 2015 at 9:41 PM, Jun Rao <j...@confluent.io> wrote:
> > > >>
> > > >>> Hmm, does that mean that after close(0), the sender thread is not
> > > >>>necessary
> > > >>> gone? Normally, after closing an entity, we expect all internal
> > threads
> > > >>> associated with the entity are shut down completely.
> > > >>>
> > > >>> Thanks,
> > > >>>
> > > >>> Jun
> > > >>>
> > > >>> On Mon, Mar 16, 2015 at 3:18 PM, Jiangjie Qin
> > > >>><j...@linkedin.com.invalid>
> > > >>> wrote:
> > > >>>
> > > >>> > Hi Jun,
> > > >>> >
> > > >>> > Close(0) will set two flags in sender. Running=false and a newly
> > > >>>added
> > > >>> > forceClose=true. It will also set accumulator.closed=true so no
> > > >>>further
> > > >>> > producer.send() will succeed.
> > > >>> > The sender thread will finish executing all the callbacks in
> > current
> > > >>> batch
> > > >>> > of responses, then it will see the forceClose flag. It will just
> > fail
> > > >>>all
> > > >>> > the incomplete batches in the producer and exit.
> > > >>> > So close(0) is a non-blocking call and sender thread will not try
> > to
> > > >>>join
> > > >>> > itself in close(0).
> > > >>> >
> > > >>> > Thanks.
> > > >>> >
> > > >>> > Jiangjie (Becket) Qin
> > > >>> >
> > > >>> > On 3/16/15, 2:50 PM, "Jun Rao" <j...@confluent.io> wrote:
> > > >>> >
> > > >>> > >How does close(0) work if it's called from the sender thread? If
> > > >>> close(0)
> > > >>> > >needs to wait for the sender thread to join, wouldn't this cause a
> > > >>> > >deadlock?
> > > >>> > >
> > > >>> > >Thanks,
> > > >>> > >
> > > >>> > >Jun
> > > >>> > >
> > > >>> > >On Mon, Mar 16, 2015 at 2:26 PM, Jiangjie Qin
> > > >>><j...@linkedin.com.invalid
> > > >>> >
> > > >>> > >wrote:
> > > >>> > >
> > > >>> > >> Thanks Guozhang. It wouldn’t be as thoroughly considered without
> > > >>> > >> discussing with you :)
> > > >>> > >>
> > > >>> > >> Jiangjie (Becket) Qin
> > > >>> > >>
> > > >>> > >> On 3/16/15, 1:07 PM, "Guozhang Wang" <wangg...@gmail.com>
> > wrote:
> > > >>> > >>
> > > >>> > >> >Thanks Jiangjie,
> > > >>> > >> >
> > > >>> > >> >After talking to you offline on this, I have been convinced and
> > > >>> > >>changed my
> > > >>> > >> >preference to blocking. The immediate shutdown approach does
> > have
> > > >>> some
> > > >>> > >> >unsafeness in some cases.
> > > >>> > >> >
> > > >>> > >> >Guozhang
> > > >>> > >> >
> > > >>> > >> >On Mon, Mar 16, 2015 at 11:50 AM, Jiangjie Qin
> > > >>> > >><j...@linkedin.com.invalid
> > > >>> > >> >
> > > >>> > >> >wrote:
> > > >>> > >> >
> > > >>> > >> >> It looks that the problem we want to solve and the purpose we
> > > >>>want
> > > >>> to
> > > >>> > >> >> achieve is:
> > > >>> > >> >> If user uses close() in callback, we want to let user be
> > aware
> > > >>>that
> > > >>> > >>they
> > > >>> > >> >> should use close(0) instead of close() in the callback.
> > > >>> > >> >>
> > > >>> > >> >> We have agreed that we will have an error log to inform user
> > > >>>about
> > > >>> > >>this
> > > >>> > >> >> mis-usage. The options differ in the way how we can force
> > user
> > > >>>to
> > > >>> > >>take a
> > > >>> > >> >> look at that error log.
> > > >>> > >> >> There are two scenarios:
> > > >>> > >> >> 1. User does not expect the program to exit.
> > > >>> > >> >> 2. User expect the program to exit.
> > > >>> > >> >>
> > > >>> > >> >> For scenario 1), blocking will probably delay the discovery
> > of
> > > >>>the
> > > >>> > >> >> problem. Calling close(0) exposes the problem quicker. In
> > this
> > > >>> > >>scenario
> > > >>> > >> >> producer just encounter a send failure when running normally.
> > > >>> > >> >> For scenario 2), blocking will expose the problem quick.
> > > >>>Calling
> > > >>> > >> >>close(-1)
> > > >>> > >> >> might hide the problem. This scenario might include: a) Unit
> > > >>>test
> > > >>> > >>for a
> > > >>> > >> >> send failure. b) Message sending during a close() call from a
> > > >>>user
> > > >>> > >> >>thread.
> > > >>> > >> >>
> > > >>> > >> >> So as a summary table:
> > > >>> > >> >>
> > > >>> > >> >>                   Scenario 1)
> >  Scenario
> > > >>>2)
> > > >>> > >> >>
> > > >>> > >> >> Blocking      Delay problem discovery         Guaranteed
> > > >>>problem
> > > >>> > >> >>discovery
> > > >>> > >> >>
> > > >>> > >> >> Close(-1)     Immediate problem discovery     Problem might
> > be
> > > >>> hidden
> > > >>> > >> >>
> > > >>> > >> >>
> > > >>> > >> >> Personally I prefer blocking because it seems providing more
> > > >>> > >>guarantees
> > > >>> > >> >> and safer.
> > > >>> > >> >>
> > > >>> > >> >> Thanks.
> > > >>> > >> >>
> > > >>> > >> >> Jiangjie (Becket) Qin
> > > >>> > >> >>
> > > >>> > >> >>
> > > >>> > >> >> On 3/16/15, 10:11 AM, "Guozhang Wang" <wangg...@gmail.com>
> > > >>>wrote:
> > > >>> > >> >>
> > > >>> > >> >> >HI Jiangjie,
> > > >>> > >> >> >
> > > >>> > >> >> >As far as I understand calling close() in the ioThread is
> > not
> > > >>> > >>common,
> > > >>> > >> >>as
> > > >>> > >> >> >it
> > > >>> > >> >> >may only trigger when we saw some non-retriable error. Hence
> > > >>>when
> > > >>> > >>user
> > > >>> > >> >>run
> > > >>> > >> >> >their program it is unlikely that close() will be triggered
> > > >>>and
> > > >>> > >>problem
> > > >>> > >> >> >will be detected. So it seems to me that from the error
> > > >>>detection
> > > >>> > >> >>aspect
> > > >>> > >> >> >these two options seems to be the same as people will
> > usually
> > > >>> > >>detect it
> > > >>> > >> >> >from the producer metrics all dropping to 0.
> > > >>> > >> >> >
> > > >>> > >> >> >Guozhang
> > > >>> > >> >> >
> > > >>> > >> >> >On Mon, Mar 16, 2015 at 9:52 AM, Jiangjie Qin
> > > >>> > >> >><j...@linkedin.com.invalid>
> > > >>> > >> >> >wrote:
> > > >>> > >> >> >
> > > >>> > >> >> >> It seems there are two options we can choose from when
> > > >>>close()
> > > >>> is
> > > >>> > >> >>called
> > > >>> > >> >> >> from sender thread (callback):
> > > >>> > >> >> >> 1. Log an error and close the producer using close(-1)
> > > >>> > >> >> >> 2. Log an error and block.
> > > >>> > >> >> >> (Throwing an exception will not work because we catch all
> > > >>>the
> > > >>> > >> >>exception
> > > >>> > >> >> >> thrown from user callback. It will just lead to an error
> > > >>>log.)
> > > >>> > >> >> >>
> > > >>> > >> >> >> My concern for the first option is that the producer will
> > be
> > > >>> > >>closed
> > > >>> > >> >>even
> > > >>> > >> >> >> if we logged and error. I am wondering if some user would
> > > >>>not
> > > >>> even
> > > >>> > >> >>take
> > > >>> > >> >> >>a
> > > >>> > >> >> >> look at the log if producer is closed normally. Because
> > from
> > > >>>the
> > > >>> > >> >> >>programs
> > > >>> > >> >> >> behavior, everything looks good. If that is the case, the
> > > >>>error
> > > >>> > >> >>message
> > > >>> > >> >> >>we
> > > >>> > >> >> >> logged probably will just be ignored until some day when
> > > >>>people
> > > >>> > >>check
> > > >>> > >> >> >>the
> > > >>> > >> >> >> log and see it.
> > > >>> > >> >> >>
> > > >>> > >> >> >> As for the second option, because producer does not close
> > > >>>but
> > > >>> > >>blocks.
> > > >>> > >> >> >>User
> > > >>> > >> >> >> will notice this the first time they run the program. They
> > > >>> > >>probably
> > > >>> > >> >>will
> > > >>> > >> >> >> look at the log to see why producer could not be closed
> > and
> > > >>>they
> > > >>> > >>will
> > > >>> > >> >> >>see
> > > >>> > >> >> >> the error log we put there. So they will get informed
> > about
> > > >>>this
> > > >>> > >> >> >>mis-usage
> > > >>> > >> >> >> of close() in sender thread the first time they run the
> > code
> > > >>> > >>instead
> > > >>> > >> >>of
> > > >>> > >> >> >> some time later.
> > > >>> > >> >> >>
> > > >>> > >> >> >> Personally I prefer the second one because it is more
> > > >>>obvious
> > > >>> that
> > > >>> > >> >> >> something was wrong.
> > > >>> > >> >> >>
> > > >>> > >> >> >> Jiangjie (Becket) Qin
> > > >>> > >> >> >>
> > > >>> > >> >> >> On 3/15/15, 4:27 PM, "Guozhang Wang" <wangg...@gmail.com>
> > > >>> wrote:
> > > >>> > >> >> >>
> > > >>> > >> >> >> >Yeah I agree we should not silently change the behavior
> > of
> > > >>>the
> > > >>> > >> >>function
> > > >>> > >> >> >> >with the given parameters; and I would prefer
> > > >>> > >> >> >>error-logging-and-shutdown
> > > >>> > >> >> >> >over blocking when close(>0) is used, since as Neha
> > > >>>suggested
> > > >>> > >> >>blocking
> > > >>> > >> >> >> >would also not proceed with sending any data, bu will
> > just
> > > >>>let
> > > >>> > >> >>users to
> > > >>> > >> >> >> >realize the issue later than sooner.
> > > >>> > >> >> >> >
> > > >>> > >> >> >> >On Sun, Mar 15, 2015 at 3:25 PM, Neha Narkhede
> > > >>> > >><n...@confluent.io>
> > > >>> > >> >> >>wrote:
> > > >>> > >> >> >> >
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >> > And I also agree it is better if we can make producer
> > > >>>block
> > > >>> > >>when
> > > >>> > >> >> >> >> > close() is called from sender thread so user will
> > > >>>notice
> > > >>> > >> >>something
> > > >>> > >> >> >> >>went
> > > >>> > >> >> >> >> > wrong.
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >> This isn't a great experience either. Why can't we just
> > > >>>throw
> > > >>> > >>an
> > > >>> > >> >> >> >>exception
> > > >>> > >> >> >> >> for a behavior we know is incorrect and we'd like the
> > > >>>user to
> > > >>> > >> >>know.
> > > >>> > >> >> >> >> Blocking as a means of doing that seems wrong and
> > > >>>annoying.
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >> On Sun, Mar 15, 2015 at 11:56 AM, Jay Kreps
> > > >>> > >><jay.kr...@gmail.com>
> > > >>> > >> >> >> wrote:
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >> > Cool.
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >> > I think blocking is good or alternately throwing an
> > > >>> exception
> > > >>> > >> >> >>directly
> > > >>> > >> >> >> >> from
> > > >>> > >> >> >> >> > close(). Basically I would just worry about subtly
> > > >>>doing
> > > >>> > >> >>something
> > > >>> > >> >> >> >> slightly
> > > >>> > >> >> >> >> > different from what the user asked for as it will be
> > > >>>hard
> > > >>> to
> > > >>> > >> >>notice
> > > >>> > >> >> >> >>that
> > > >>> > >> >> >> >> > behavior difference.
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >> > -Jay
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >> > On Sat, Mar 14, 2015 at 5:48 PM, Jiangjie Qin
> > > >>> > >> >> >> >><j...@linkedin.com.invalid
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >> > wrote:
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >> > > Hi Jay,
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > > I have modified the KIP as you suggested. I thinks
> > as
> > > >>> long
> > > >>> > >>as
> > > >>> > >> >>we
> > > >>> > >> >> >> >>have
> > > >>> > >> >> >> >> > > consistent define for timeout across Kafka
> > interface,
> > > >>> there
> > > >>> > >> >>would
> > > >>> > >> >> >> >>be no
> > > >>> > >> >> >> >> > > problem. And I also agree it is better if we can
> > make
> > > >>> > >>producer
> > > >>> > >> >> >>block
> > > >>> > >> >> >> >> when
> > > >>> > >> >> >> >> > > close() is called from sender thread so user will
> > > >>>notice
> > > >>> > >> >> >>something
> > > >>> > >> >> >> >>went
> > > >>> > >> >> >> >> > > wrong.
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > > Thanks.
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > > Jiangjie (Becket) Qin
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > > On 3/14/15, 11:37 AM, "Jay Kreps"
> > > >>><jay.kr...@gmail.com>
> > > >>> > >> wrote:
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > > >Hey Jiangjie,
> > > >>> > >> >> >> >> > > >
> > > >>> > >> >> >> >> > > >I think this is going to be very confusing that
> > > >>> > >> >> >> >> > > >  close(0) waits indefinitely and
> > > >>> > >> >> >> >> > > >  close(-1) waits for 0.
> > > >>> > >> >> >> >> > > >I understand this appears in other apis, but it
> > is a
> > > >>> > >>constant
> > > >>> > >> >> >> >>cause of
> > > >>> > >> >> >> >> > > >bugs. Let's not repeat that mistake.
> > > >>> > >> >> >> >> > > >
> > > >>> > >> >> >> >> > > >Let's make close(0) wait for 0. We don't need a
> > way
> > > >>>to
> > > >>> > >>wait
> > > >>> > >> >> >> >> indefinitely
> > > >>> > >> >> >> >> > > >as
> > > >>> > >> >> >> >> > > >we already have close() so having a magical
> > constant
> > > >>>for
> > > >>> > >> >>that is
> > > >>> > >> >> >> >> > > >redundant.
> > > >>> > >> >> >> >> > > >
> > > >>> > >> >> >> >> > > >Calling close() from the I/O thread was already
> > > >>>possible
> > > >>> > >>and
> > > >>> > >> >> >>would
> > > >>> > >> >> >> >> block
> > > >>> > >> >> >> >> > > >indefinitely. I think trying to silently change
> > the
> > > >>> > >>behavior
> > > >>> > >> >>is
> > > >>> > >> >> >> >> probably
> > > >>> > >> >> >> >> > > >not right. I.e. if the user calls close() in the
> > > >>> callback
> > > >>> > >> >>there
> > > >>> > >> >> >>is
> > > >>> > >> >> >> >> > > >actually
> > > >>> > >> >> >> >> > > >some misunderstanding and they need to think more,
> > > >>> > >>silently
> > > >>> > >> >> >>making
> > > >>> > >> >> >> >> this
> > > >>> > >> >> >> >> > > >not
> > > >>> > >> >> >> >> > > >block will hide the problem from them which is the
> > > >>> > >>opposite
> > > >>> > >> >>of
> > > >>> > >> >> >> >>what we
> > > >>> > >> >> >> >> > > >want.
> > > >>> > >> >> >> >> > > >
> > > >>> > >> >> >> >> > > >-Jay
> > > >>> > >> >> >> >> > > >
> > > >>> > >> >> >> >> > > >On Thu, Mar 12, 2015 at 1:49 AM, Jiangjie Qin
> > > >>> > >> >> >> >> <j...@linkedin.com.invalid
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > > >wrote:
> > > >>> > >> >> >> >> > > >
> > > >>> > >> >> >> >> > > >> Hey Joe & Jay,
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> Thanks for the comments on the voting thread.
> > > >>>Since it
> > > >>> > >> >>seems
> > > >>> > >> >> >>we
> > > >>> > >> >> >> >> > probably
> > > >>> > >> >> >> >> > > >> will have more discussion on this, I am just
> > > >>>replying
> > > >>> > >>from
> > > >>> > >> >>the
> > > >>> > >> >> >> >> > > >>discussion
> > > >>> > >> >> >> >> > > >> thread here.
> > > >>> > >> >> >> >> > > >> I’ve updated the KIP page to make it less like
> > > >>> > >>half-baked,
> > > >>> > >> >> >> >>apologize
> > > >>> > >> >> >> >> > for
> > > >>> > >> >> >> >> > > >> the rush...
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> The contract in current KIP is:
> > > >>> > >> >> >> >> > > >>   1. close() - wait until all requests either
> > are
> > > >>>sent
> > > >>> > >>or
> > > >>> > >> >> >>reach
> > > >>> > >> >> >> >> > request
> > > >>> > >> >> >> >> > > >> timeout.
> > > >>> > >> >> >> >> > > >>   2. close(-1, TimeUnit.MILLISECONDS) - close
> > > >>> > >>immediately
> > > >>> > >> >> >> >> > > >>   3. close(0, TimeUnit.MILLISECONDS) -
> > equivalent
> > > >>>to
> > > >>> > >> >>close(),
> > > >>> > >> >> >> >>i.e.
> > > >>> > >> >> >> >> > Wait
> > > >>> > >> >> >> >> > > >> until all requests are sent or reach request
> > > >>>timeout
> > > >>> > >> >> >> >> > > >>   4. close(5, TimeUnit.MILLISECONDS) - try the
> > > >>>best to
> > > >>> > >> >>finish
> > > >>> > >> >> >> >> sending
> > > >>> > >> >> >> >> > > >>in 5
> > > >>> > >> >> >> >> > > >> milliseconds, if something went wrong, just
> > > >>>shutdown
> > > >>> the
> > > >>> > >> >> >>producer
> > > >>> > >> >> >> >> > > >>anyway,
> > > >>> > >> >> >> >> > > >> my callback will handle the failures.
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> About how we define what timeout value stands
> > for,
> > > >>>I
> > > >>> > >> >>actually
> > > >>> > >> >> >> >> > struggled
> > > >>> > >> >> >> >> > > >>a
> > > >>> > >> >> >> >> > > >> little bit when wrote the patch. Intuitively,
> > > >>>close(0)
> > > >>> > >> >>should
> > > >>> > >> >> >> >>mean
> > > >>> > >> >> >> >> > > >> immediately, however it seems that all the
> > > >>>existing
> > > >>> java
> > > >>> > >> >>class
> > > >>> > >> >> >> >>have
> > > >>> > >> >> >> >> > this
> > > >>> > >> >> >> >> > > >> convention of timeout=0 means no timeout or
> > never
> > > >>> > >>timeout
> > > >>> > >> >> >> >> > > >>(Thread.join(0),
> > > >>> > >> >> >> >> > > >> Object.wait(0), etc.) So here the dilemma is
> > > >>>either we
> > > >>> > >> >>follow
> > > >>> > >> >> >>the
> > > >>> > >> >> >> >> > > >> intuition or we follow the convention. What I
> > > >>>chose is
> > > >>> > >>to
> > > >>> > >> >> >>follow
> > > >>> > >> >> >> >>the
> > > >>> > >> >> >> >> > > >> convention but document the interface to let
> > user
> > > >>>be
> > > >>> > >>aware
> > > >>> > >> >>of
> > > >>> > >> >> >>the
> > > >>> > >> >> >> >> > usage.
> > > >>> > >> >> >> >> > > >> The reason is that I think producer.close() is a
> > > >>> public
> > > >>> > >> >> >> >>interface so
> > > >>> > >> >> >> >> > it
> > > >>> > >> >> >> >> > > >> might be better to follow java convention.
> > Whereas
> > > >>> > >> >>selector is
> > > >>> > >> >> >> >>not a
> > > >>> > >> >> >> >> > > >> public interface that used by user, so as long
> > as
> > > >>>it
> > > >>> > >>makes
> > > >>> > >> >> >>sense
> > > >>> > >> >> >> >>to
> > > >>> > >> >> >> >> > us,
> > > >>> > >> >> >> >> > > >>it
> > > >>> > >> >> >> >> > > >> is less a problem to be different from java
> > > >>> convention.
> > > >>> > >> >>That
> > > >>> > >> >> >>said
> > > >>> > >> >> >> >> > since
> > > >>> > >> >> >> >> > > >> consumer.poll(timeout) is also a public
> > interface,
> > > >>>I
> > > >>> > >>think
> > > >>> > >> >>it
> > > >>> > >> >> >> >>also
> > > >>> > >> >> >> >> > makes
> > > >>> > >> >> >> >> > > >> sense to make producer.close() to have the same
> > > >>> > >>definition
> > > >>> > >> >>of
> > > >>> > >> >> >> >> > > >> consumer.poll(timeout).
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> The main argument for keeping a timeout in close
> > > >>>would
> > > >>> > >>be
> > > >>> > >> >> >> >>separating
> > > >>> > >> >> >> >> > the
> > > >>> > >> >> >> >> > > >> close timeout from request timeout, which
> > probably
> > > >>> makes
> > > >>> > >> >> >>sense. I
> > > >>> > >> >> >> >> > would
> > > >>> > >> >> >> >> > > >> guess typically the request timeout would be
> > long
> > > >>> (e.g.
> > > >>> > >>60
> > > >>> > >> >> >> >>seconds)
> > > >>> > >> >> >> >> > > >> because we might want to consider retries with
> > > >>>back
> > > >>> off
> > > >>> > >> >>time.
> > > >>> > >> >> >>If
> > > >>> > >> >> >> >>we
> > > >>> > >> >> >> >> > have
> > > >>> > >> >> >> >> > > >> multiple batches in accumulator, in worst case
> > > >>>that
> > > >>> > >>could
> > > >>> > >> >>take
> > > >>> > >> >> >> >>up to
> > > >>> > >> >> >> >> > > >> several minutes to complete all the requests.
> > But
> > > >>>when
> > > >>> > >>we
> > > >>> > >> >> >>close a
> > > >>> > >> >> >> >> > > >> producer, we might not want to wait for that
> > long
> > > >>>as
> > > >>> it
> > > >>> > >> >>might
> > > >>> > >> >> >> >>cause
> > > >>> > >> >> >> >> > some
> > > >>> > >> >> >> >> > > >> other problem like deployment tool timeout.
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> There is also a subtle difference between
> > > >>> close(timeout)
> > > >>> > >> >>and
> > > >>> > >> >> >> >> > > >> flush(timeout). The only purpose for flush() is
> > to
> > > >>> write
> > > >>> > >> >>data
> > > >>> > >> >> >>to
> > > >>> > >> >> >> >>the
> > > >>> > >> >> >> >> > > >> broker, so it makes perfect sense to wait until
> > > >>> request
> > > >>> > >> >> >>timeout.
> > > >>> > >> >> >> >>I
> > > >>> > >> >> >> >> > think
> > > >>> > >> >> >> >> > > >> that is why flush(timeout) looks strange. On the
> > > >>>other
> > > >>> > >> >>hand,
> > > >>> > >> >> >>the
> > > >>> > >> >> >> >>top
> > > >>> > >> >> >> >> > > >> priority for close() is to close the producer
> > > >>>rather
> > > >>> > >>than
> > > >>> > >> >> >>flush()
> > > >>> > >> >> >> >> > data,
> > > >>> > >> >> >> >> > > >>so
> > > >>> > >> >> >> >> > > >> close(timeout) gives guarantee on bounded
> > waiting
> > > >>>for
> > > >>> > >>its
> > > >>> > >> >>main
> > > >>> > >> >> >> >>job.
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> Sorry for the confusion about forceClose flag.
> > It
> > > >>>is
> > > >>> > >>not a
> > > >>> > >> >> >>public
> > > >>> > >> >> >> >> > > >> interface. I mentioned it in Proposed Changes
> > > >>>section
> > > >>> > >> >>which I
> > > >>> > >> >> >> >> thought
> > > >>> > >> >> >> >> > > >>was
> > > >>> > >> >> >> >> > > >> supposed to provide implementation details.
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> Thanks again for all the comments and
> > suggestions!
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> Jiangjie (Becket) Qin
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> On 3/10/15, 8:57 PM, "Jiangjie Qin" <
> > > >>> j...@linkedin.com>
> > > >>> > >> >> wrote:
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >> >The KIP page has been updated per Jay¹s
> > comments.
> > > >>> > >> >> >> >> > > >> >I¹d like to initiate the voting process if no
> > > >>>further
> > > >>> > >> >> >>comments
> > > >>> > >> >> >> >>are
> > > >>> > >> >> >> >> > > >> >received by tomorrow.
> > > >>> > >> >> >> >> > > >> >
> > > >>> > >> >> >> >> > > >> >Jiangjie (Becket) Qin
> > > >>> > >> >> >> >> > > >> >
> > > >>> > >> >> >> >> > > >> >On 3/8/15, 9:45 AM, "Jay Kreps"
> > > >>><jay.kr...@gmail.com
> > > >>> >
> > > >>> > >> >>wrote:
> > > >>> > >> >> >> >> > > >> >
> > > >>> > >> >> >> >> > > >> >>Hey Jiangjie,
> > > >>> > >> >> >> >> > > >> >>
> > > >>> > >> >> >> >> > > >> >>Can you capture the full motivation and use
> > > >>>cases
> > > >>> for
> > > >>> > >>the
> > > >>> > >> >> >> >>feature?
> > > >>> > >> >> >> >> > > >>This
> > > >>> > >> >> >> >> > > >> >>mentions your interest in having a way of
> > > >>>aborting
> > > >>> > >>from
> > > >>> > >> >> >>inside
> > > >>> > >> >> >> >>the
> > > >>> > >> >> >> >> > > >> >>Callback. But it doesn't really explain that
> > > >>>usage
> > > >>> or
> > > >>> > >>why
> > > >>> > >> >> >>other
> > > >>> > >> >> >> >> > people
> > > >>> > >> >> >> >> > > >> >>would want to do that. It also doesn't list
> > the
> > > >>> > >>primary
> > > >>> > >> >>use
> > > >>> > >> >> >> >>case
> > > >>> > >> >> >> >> for
> > > >>> > >> >> >> >> > > >> >>having
> > > >>> > >> >> >> >> > > >> >>close with a bounded timeout which was to
> > avoid
> > > >>> > >>blocking
> > > >>> > >> >>too
> > > >>> > >> >> >> >>long
> > > >>> > >> >> >> >> on
> > > >>> > >> >> >> >> > > >> >>shutdown.
> > > >>> > >> >> >> >> > > >> >>
> > > >>> > >> >> >> >> > > >> >>-Jay
> > > >>> > >> >> >> >> > > >> >>
> > > >>> > >> >> >> >> > > >> >>
> > > >>> > >> >> >> >> > > >> >>
> > > >>> > >> >> >> >> > > >> >>On Sat, Mar 7, 2015 at 12:25 PM, Jiangjie Qin
> > > >>> > >> >> >> >> > > >><j...@linkedin.com.invalid
> > > >>> > >> >> >> >> > > >> >
> > > >>> > >> >> >> >> > > >> >>wrote:
> > > >>> > >> >> >> >> > > >> >>
> > > >>> > >> >> >> >> > > >> >>> Hi,
> > > >>> > >> >> >> >> > > >> >>>
> > > >>> > >> >> >> >> > > >> >>> I just created a KIP for adding a
> > > >>>close(timeout)
> > > >>> to
> > > >>> > >>new
> > > >>> > >> >> >> >> producer.
> > > >>> > >> >> >> >> > > >>Most
> > > >>> > >> >> >> >> > > >> >>>of
> > > >>> > >> >> >> >> > > >> >>> the previous discussions are in KAFKA-1660
> > > >>>where
> > > >>> > >>Parth
> > > >>> > >> >> >> >> Brahmbhatt
> > > >>> > >> >> >> >> > > >>has
> > > >>> > >> >> >> >> > > >> >>> already done a lot of work.
> > > >>> > >> >> >> >> > > >> >>> Since this is an interface change so we are
> > > >>>going
> > > >>> > >> >>through
> > > >>> > >> >> >>the
> > > >>> > >> >> >> >> KIP
> > > >>> > >> >> >> >> > > >> >>>process.
> > > >>> > >> >> >> >> > > >> >>> Here is the KIP link:
> > > >>> > >> >> >> >> > > >> >>>
> > > >>> > >> >> >> >> > > >> >>>
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >>
> > > >>> > >> >> >>
> > > >>> > >> >>
> > > >>> > >>
> > > >>> >
> > > >>>
> > > https://cwiki.apache.org/confluence/pages/viewpage.action?pageId=5373978
> > > >>> > >> >> >> >> > > >> >>>2
> > > >>> > >> >> >> >> > > >> >>>
> > > >>> > >> >> >> >> > > >> >>> Thanks.
> > > >>> > >> >> >> >> > > >> >>>
> > > >>> > >> >> >> >> > > >> >>> Jiangjie (Becket) Qin
> > > >>> > >> >> >> >> > > >> >>>
> > > >>> > >> >> >> >> > > >> >
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > > >>
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> > >
> > > >>> > >> >> >> >> >
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >> --
> > > >>> > >> >> >> >> Thanks,
> > > >>> > >> >> >> >> Neha
> > > >>> > >> >> >> >>
> > > >>> > >> >> >> >
> > > >>> > >> >> >> >
> > > >>> > >> >> >> >
> > > >>> > >> >> >> >--
> > > >>> > >> >> >> >-- Guozhang
> > > >>> > >> >> >>
> > > >>> > >> >> >>
> > > >>> > >> >> >
> > > >>> > >> >> >
> > > >>> > >> >> >--
> > > >>> > >> >> >-- Guozhang
> > > >>> > >> >>
> > > >>> > >> >>
> > > >>> > >> >
> > > >>> > >> >
> > > >>> > >> >--
> > > >>> > >> >-- Guozhang
> > > >>> > >>
> > > >>> > >>
> > > >>> >
> > > >>> >
> > > >>>
> > > >>
> > > >>
> > > >>
> > > >>--
> > > >>-- Guozhang
> > > >
> > >
> > >
> >
> 
> 
> 
> -- 
> Thanks,
> Neha

Reply via email to