Vrushali C created YARN-4063:
--------------------------------

             Summary: Populate the flow activity table
                 Key: YARN-4063
                 URL: https://issues.apache.org/jira/browse/YARN-4063
             Project: Hadoop YARN
          Issue Type: Sub-task
            Reporter: Vrushali C


Need to populate the flow_activity table

-Stores per day flow run pointers and info
-Written to by RM’s collector for application lifecycle
primary key: cluster ! day timestamp ! user ! flow id 
-For the day timestamp we can take the millis since epoch for the end of the 
day (24:00h).
columns include runids, start time, end time, snapshot time
-This table will also be used to efficiently retrieve the flows that had an 
activity in a certain day. That is needed for daily aggregations, but also for 
several UIs, including a flow-based UI.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to