[ 
https://issues.apache.org/jira/browse/YARN-4063?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14707256#comment-14707256
 ] 

Joep Rottinghuis commented on YARN-4063:
----------------------------------------

Min max should work well.
Wondering how we'd guarantee a flush or compaction even happens around end of 
day time.

Perhaps better to let AM do a daily snapshot so it can decide how to divvy up 
counters across daily boundaries.

Sent from my iPhone



> Populate the flow activity table
> --------------------------------
>
>                 Key: YARN-4063
>                 URL: https://issues.apache.org/jira/browse/YARN-4063
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: timelineserver
>            Reporter: Vrushali C
>            Assignee: Vrushali C
>
> Need to populate the flow_activity table
> -Stores per day flow run pointers and info
> -Written to by RM’s collector for application lifecycle
> primary key: cluster ! day timestamp ! user ! flow id 
> -For the day timestamp we can take the millis since epoch for the end of the 
> day (24:00h).
> columns include runids, start time, end time, snapshot time
> -This table will also be used to efficiently retrieve the flows that had an 
> activity in a certain day. That is needed for daily aggregations, but also 
> for several UIs, including a flow-based UI.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to