[jira] Updated: (CHUKWA-253) Aggregations by User

Cheng (JIRA) Thu, 28 May 2009 13:46:18 -0700

     [ 
https://issues.apache.org/jira/browse/CHUKWA-253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Cheng updated CHUKWA-253:
-------------------------

    Attachment: chukwa-253.patch

Patch submitted.
* the pig script is at chukwa-home/script/pig
* the shell script for cronjob is at chukwa-home/bin. To manually run the shell 
script, use command
        /path/to/chukwa-home/bin/UserDailySummary.sh <YYYYMMDD> <CLUSTER>

> Aggregations by User
> --------------------
>
>                 Key: CHUKWA-253
>                 URL: https://issues.apache.org/jira/browse/CHUKWA-253
>             Project: Hadoop Chukwa
>          Issue Type: New Feature
>            Reporter: Cheng
>         Attachments: chukwa-253.patch
>
>
> Typically a user submits jobs with similar characteristics. Aggregating the 
> following metrics based on users can help
> quickly identify VIP users and how their jobs look like:
> -       slot-hours used for map tasks, for reduce tasks
> -       total jobs, jobs failed
> -       data-local-maps,  rack-local-maps, remote-maps
> -       total map-input-bytes, reduce-output-records
> -       total map tasks, total reduce tasks
> The granularity of the aggregation can be as coarse as daily. Data may be 
> used to report top-K users in certain
> categories. Data shall be available as chukwa records (namely, one record per 
> day per user).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Updated: (CHUKWA-253) Aggregations by User

Reply via email to