[
https://issues.apache.org/jira/browse/CHUKWA-203?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12717960#action_12717960
]
Ari Rabkin commented on CHUKWA-203:
-----------------------------------
I don't entirely understand the scope of this. I had thought our model was
that adaptors conceal rotation from stages farther up the line?
I definitely like the idea of issuing an "end of file, adaptor has
deregistered" chunk/marker.
> Track data loading from agent
> -----------------------------
>
> Key: CHUKWA-203
> URL: https://issues.apache.org/jira/browse/CHUKWA-203
> Project: Hadoop Chukwa
> Issue Type: New Feature
> Components: data collection, Data Processors
> Reporter: Jerome Boulon
> Priority: Critical
>
> Chukwa needs to track progress on all files for completeness reason.
> The first step could be to send adaptor information to the backend for
> postprocess/storage.
> This could be done at the same time of the writing checkpoint file by
> building a chunk and post it to the queue.
> In addition to that, we need to track all Add/Remove operations and the final
> offset for all files, the easiest way to do this will be to generate this
> information at the beginning and the end of each adaptor.
> Based on that, we should be able to:
> - track any file from the add to the remove,
> - validate that all data has been sent
> - track all files' rotation.
> - record any permission issue (expiration policy)
> - generate alerts
>
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.