[
https://issues.apache.org/jira/browse/SLIDER-426?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14165367#comment-14165367
]
thomas liu commented on SLIDER-426:
-----------------------------------
In sliderClient.actionFreeze():
after checking application state is < YarnApplicationState.FINISHED, if the
application state is not in ‘YarnApplicationState.RUNNING’, then mark boolean
local variable forcekill to be true
Race condition here may happen:
1) The application was in non running state when querying, and however, moved
to running state when killing it.
Given we are killing the application by force anyway, there wouldn’t be bad
results
2) The application was in running state when querying, and however, moved to
non running state when killing it.
Current decision is to leave with this race condition
actionDestroy() is left as it is
> If an app is in ACCEPTED state then it cannot be stopped or destroyed
> ---------------------------------------------------------------------
>
> Key: SLIDER-426
> URL: https://issues.apache.org/jira/browse/SLIDER-426
> Project: Slider
> Issue Type: Bug
> Components: client
> Affects Versions: Slider 0.50
> Reporter: Sumit Mohanty
> Assignee: thomas liu
> Fix For: Slider 0.60
>
>
> {noformat}
> [yarn@c6401 bin]$ ./slider stop cl1
> ...
> 2014-09-13 15:14:24,357 [main] INFO client.RMProxy - Connecting to
> ResourceManager at c6401.ambari.apache.org/192.168.64.101:8050
> 2014-09-13 15:14:26,042 [main] WARN client.SliderClient - Exception while
> trying to terminate cl1:
> org.apache.slider.core.exceptions.BadClusterStateException: Application not
> running: application_1410619964429_0001 state=ACCEPTED
> 2014-09-13 15:14:26,051 [main] INFO util.ExitUtil - Exiting with status -1
> [yarn@c6401 bin]$ ./slider destroy cl1
> ...
> 2014-09-13 02:13:39,837 [main] INFO client.RMProxy - Connecting to
> ResourceManager at c6401.ambari.apache.org/192.168.64.101:8050
> 2014-09-13 02:13:41,687 [main] ERROR main.ServiceLauncher - cl1: Application
> Instance already running :applicationId { id: 1 cluster_timestamp:
> 1410570014723 } user: "yarn" queue: "default" name: "cl1" host: "N/A"
> rpc_port: -1 yarn_application_state: ACCEPTED trackingUrl:
> "http://c6401.ambari.apache.org:8088/proxy/application_1410570014723_0001/"
> diagnostics: "" startTime: 1410574245847 finishTime: 0
> final_application_status: APP_UNDEFINED app_resource_Usage {
> num_used_containers: 0 num_reserved_containers: 0 used_resources { memory: 0
> virtual_cores: 0 } reserved_resources { memory: 0 virtual_cores: 0 }
> needed_resources { memory: 0 virtual_cores: 0 } } originalTrackingUrl: "N/A"
> currentApplicationAttemptId { application_id { id: 1 cluster_timestamp:
> 1410570014723 } attemptId: 1 } progress: 0.0 applicationType:
> "org-apache-slider" applicationTags: "name: hbase" applicationTags: "version:
> 0.98.5-hadoop2" applicationTags: "description: apache hbase is the hadoop
> database, a distributed, scalable, big data store...."
> 2014-09-13 02:13:41,693 [main] INFO util.ExitUtil - Exiting with status 74
> {noformat}
> [~thomas_liu] can you discuss with [[email protected]] regarding this?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)