[
https://issues.apache.org/jira/browse/YARN-3127?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Naganarasimha G R updated YARN-3127:
------------------------------------
Attachment: YARN-3127.20150329-1.patch
Thanks [~ozawa] for the review and sorry for the delay in the response as i was
held up in other issues ...
bq. Could you add a test case to TestRMRestart to cover the case?
Have taken care in this updated patch
bq. can we preserve following test cases?
As there are changes in the transitions if these methods were there, then
TestRMAppTransitions was failing for multiple testcases. Approach adopted to
fix this issue is : earlier SystemMetricPublisher.appCreated() was invoked
during creation of RMAppImpl itself and also SystemMetricPublisher.ACLsUpdated
was invoked in RMAppManager.createAndPopulateNewRMApp which was common to both
recover and new application execution flow, so I have removed from the above
mentioned places and placed it in RMAppManager.publishSystemMetrics thus
ensuring that only during new application execution flow these updates are sent
to SystemMetricPublisher
> Apphistory url crashes when RM switches with ATS enabled
> --------------------------------------------------------
>
> Key: YARN-3127
> URL: https://issues.apache.org/jira/browse/YARN-3127
> Project: Hadoop YARN
> Issue Type: Bug
> Components: resourcemanager, timelineserver
> Affects Versions: 2.6.0
> Environment: RM HA with ATS
> Reporter: Bibin A Chundatt
> Assignee: Naganarasimha G R
> Attachments: YARN-3127.20150213-1.patch, YARN-3127.20150329-1.patch
>
>
> 1.Start RM with HA and ATS configured and run some yarn applications
> 2.Once applications are finished sucessfully start timeline server
> 3.Now failover HA form active to standby
> 4.Access timeline server URL <IP>:<PORT>/applicationhistory
> Result: Application history URL fails with below info
> {quote}
> 2015-02-03 20:28:09,511 ERROR org.apache.hadoop.yarn.webapp.View: Failed to
> read the applications.
> java.lang.reflect.UndeclaredThrowableException
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1643)
> at
> org.apache.hadoop.yarn.server.webapp.AppsBlock.render(AppsBlock.java:80)
> at
> org.apache.hadoop.yarn.webapp.view.HtmlBlock.render(HtmlBlock.java:67)
> at
> org.apache.hadoop.yarn.webapp.view.HtmlBlock.renderPartial(HtmlBlock.java:77)
> at org.apache.hadoop.yarn.webapp.View.render(View.java:235)
> at
> org.apache.hadoop.yarn.webapp.view.HtmlPage$Page.subView(HtmlPage.java:49)
> ...
> Caused by:
> org.apache.hadoop.yarn.exceptions.ApplicationAttemptNotFoundException: The
> entity for application attempt appattempt_1422972608379_0001_000001 doesn't
> exist in the timeline store
> at
> org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryManagerOnTimelineStore.getApplicationAttempt(ApplicationHistoryManagerOnTimelineStore.java:151)
> at
> org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryManagerOnTimelineStore.generateApplicationReport(ApplicationHistoryManagerOnTimelineStore.java:499)
> at
> org.apache.hadoop.yarn.server.applicationhistoryservice.ApplicationHistoryManagerOnTimelineStore.getAllApplications(ApplicationHistoryManagerOnTimelineStore.java:108)
> at
> org.apache.hadoop.yarn.server.webapp.AppsBlock$1.run(AppsBlock.java:84)
> at
> org.apache.hadoop.yarn.server.webapp.AppsBlock$1.run(AppsBlock.java:81)
> at java.security.AccessController.doPrivileged(Native Method)
> at javax.security.auth.Subject.doAs(Subject.java:415)
> at
> org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1628)
> ... 51 more
> 2015-02-03 20:28:09,512 ERROR org.apache.hadoop.yarn.webapp.Dispatcher: error
> handling URI: /applicationhistory
> org.apache.hadoop.yarn.webapp.WebAppException: Error rendering block:
> nestLevel=6 expected 5
> at
> org.apache.hadoop.yarn.webapp.view.HtmlBlock.render(HtmlBlock.java:69)
> at
> org.apache.hadoop.yarn.webapp.view.HtmlBlock.renderPartial(HtmlBlock.java:77)
> {quote}
> Behaviour with AHS with file based history store
> -Apphistory url is working
> -No attempt entries are shown for each application.
>
> Based on inital analysis when RM switches ,application attempts from state
> store are not replayed but only applications are.
> So when /applicaitonhistory url is accessed it tries for all attempt id and
> fails
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)