[
https://issues.apache.org/jira/browse/CHUKWA-55?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Eric Yang updated CHUKWA-55:
----------------------------
Attachment: CHUKWA-55.patch
Slot hours is calculate from sum of the task startup and end time * number of
attempts, aggregated for all jobs and group by user. HDFS bytes usage is
calculated from dus periodically. By doing time intersection and time
grouping, we get a time series data for user usage of the slot hours, and hdfs
usage.
> Collect and aggregate slot-hours usage by user, and by scheduler queues.
> ------------------------------------------------------------------------
>
> Key: CHUKWA-55
> URL: https://issues.apache.org/jira/browse/CHUKWA-55
> Project: Hadoop Chukwa
> Issue Type: New Feature
> Components: data collection, Data Processors, input tools, User
> Interface
> Environment: Redhat 5.1, Java 6
> Reporter: Eric Yang
> Assignee: Eric Yang
> Priority: Blocker
> Attachments: CHUKWA-55.patch
>
>
> The easiest model is:
> Step 1: Collect Disk usage by user
> Step 2: Collect Task Slot usage by user
> Step 3: Aggregating data by scheduler queues
> Step 4: Display data on HICC portal
> Step 5: Profit!
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.