[ 
https://issues.apache.org/jira/browse/CHUKWA-369?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12743808#action_12743808
 ] 

Ari Rabkin commented on CHUKWA-369:
-----------------------------------

Patch has basically five pieces; I'm happy to split them up and commit 
separately if some are uncontroversial.

1) Sender and Connector are refactored to allow the HttpClient to be reused, 
and used more generally.
2) Writers now return an instance of ChukwaWriter.CommitStatus.  This is either 
OK, Failure, or Pending. The first two are singletons, the latter includes a 
list of strings.
3) SeqFileWriter returns a CommitPending on writes.
4) A new servlet, CommitCheckServlet for periodically scanning HDFS.
5) A new Sender, the AsyncAckSender, that doesn't automatically commit, but 
only does so when it either receives an OK, or else after a pending commit 
becomes stable.  The Sender periodically asks a CommitCheckServlet what's been 
committed.

I think (1), and possibly (2+3) may make sense even without 4 and 5, which are 
the bits that I think need serious testing before we should even discuss 
committing them.

> proposed reliability mechanism
> ------------------------------
>
>                 Key: CHUKWA-369
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-369
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>          Components: data collection
>    Affects Versions: 0.3.0
>            Reporter: Ari Rabkin
>            Assignee: Ari Rabkin
>             Fix For: 0.3.0
>
>         Attachments: delayedAcks.patch
>
>
> We like to say that Chukwa is a system for reliable log collection. It isn't, 
> quite, since we don't handle collector crashes.  Here's a proposed 
> reliability mechanism.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to