[
https://issues.apache.org/jira/browse/CHUKWA-253?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Cheng updated CHUKWA-253:
-------------------------
Attachment: chukwa-253.patch
Patch submitted.
* the pig script is at chukwa-home/script/pig
* the shell script for cronjob is at chukwa-home/bin. To manually run the shell
script, use command
/path/to/chukwa-home/bin/UserDailySummary.sh <YYYYMMDD> <CLUSTER>
> Aggregations by User
> --------------------
>
> Key: CHUKWA-253
> URL: https://issues.apache.org/jira/browse/CHUKWA-253
> Project: Hadoop Chukwa
> Issue Type: New Feature
> Reporter: Cheng
> Attachments: chukwa-253.patch
>
>
> Typically a user submits jobs with similar characteristics. Aggregating the
> following metrics based on users can help
> quickly identify VIP users and how their jobs look like:
> - slot-hours used for map tasks, for reduce tasks
> - total jobs, jobs failed
> - data-local-maps, rack-local-maps, remote-maps
> - total map-input-bytes, reduce-output-records
> - total map tasks, total reduce tasks
> The granularity of the aggregation can be as coarse as daily. Data may be
> used to report top-K users in certain
> categories. Data shall be available as chukwa records (namely, one record per
> day per user).
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.