Sangjin Lee created YARN-5095:
---------------------------------

             Summary: flow activities and flow runs are populated with wrong 
timestamp when RM restarts w/ recovery enabled
                 Key: YARN-5095
                 URL: https://issues.apache.org/jira/browse/YARN-5095
             Project: Hadoop YARN
          Issue Type: Sub-task
          Components: timelineserver
    Affects Versions: YARN-2928
            Reporter: Sangjin Lee
            Priority: Critical


I have the RM recovery enabled. I see that upon restart the RM populates 
records into flow activity and flow runs but with *wrong* timestamps. What I 
mean by the timestamp is the part of the row key:
- flow activity: row created with the day of the RM restart
- flow run: row created with the RM start time as the "run id"

The following illustrates an example flow run:
{noformat}
metrics: [ ],
events: [ ],
id: "sjlee@Sleep job/1463433569917",
type: "YARN_FLOW_RUN",
createdtime: 1463422860987,
info: {
UID: "yarn_cluster!sjlee!Sleep job!1463433569917",
SYSTEM_INFO_FLOW_RUN_ID: 1463433569917,
SYSTEM_INFO_FLOW_NAME: "Sleep job",
SYSTEM_INFO_FLOW_RUN_END_TIME: 1463422865033,
SYSTEM_INFO_USER: "sjlee"
},
isrelatedto: { },
relatesto: { }
{noformat}
The created time and the end time are correct (i.e. original time), whereas the 
timestamp in the row key (= run id: 1463433569917) is actually later than the 
end time and coincides with the RM restart.




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: yarn-dev-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-dev-h...@hadoop.apache.org

Reply via email to