[
https://issues.apache.org/jira/browse/AURORA-898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14192788#comment-14192788
]
Maxim Khutornenko commented on AURORA-898:
------------------------------------------
Tried both - no repro:
h6.Master scheduler + 10/22 client
{noformat}
I1031 23:45:31.453 THREAD121
org.apache.aurora.scheduler.thrift.aop.LoggingInterceptor.invoke:
createJob(JobConfiguration(key:JobKey(role:www-data, environment:prod, name:he
llo), owner:Identity(role:www-data, user:vagrant), cronSchedule:null,
cronCollisionPolicy:KILL_EXISTING, taskConfig:TaskConfig(job:null,
owner:Identity(role:www-data, user:v
agrant), environment:prod, jobName:hello, isService:true, numCpus:1.0,
ramMb:128, diskMb:128, priority:0, maxTaskFailures:1, production:false,
constraints:[], requestedPorts
:[], taskLinks:{}, executorConfig:ExecutorConfig(name:BLANKED, data:BLANKED),
metadata:[]), instanceCount:1), null, SessionKey(mechanism:UNAUTHENTICATED,
data:50 D0 14 4C 71 0D 4C 80 80 4C 40))
{noformat}
h6.10/22 scheduler + master client
{noformat}
I1031 23:53:23.884 THREAD119
org.apache.aurora.scheduler.thrift.aop.LoggingInterceptor.invoke:
createJob(JobConfiguration(key:JobKey(role:www-data, environment:prod, name:he
llo), owner:Identity(role:www-data, user:vagrant), cronSchedule:null,
cronCollisionPolicy:KILL_EXISTING,
taskConfig:TaskConfig(owner:Identity(role:www-data, user:vagrant), e
nvironment:prod, jobName:hello, isService:true, numCpus:1.0, ramMb:128,
diskMb:128, priority:0, maxTaskFailures:1, production:false, constraints:[],
requestedPorts:[], taskL
inks:{}, executorConfig:ExecutorConfig(name:BLANKED, data:BLANKED),
metadata:[]), instanceCount:1), null, SessionKey(mechanism:UNAUTHENTICATED,
data:50 D0 14 4C 71 0D 4C 80 80 4C 40))
{noformat}
Any chance your thrift.py \[1\] is different than the one on master or you have
any additional thrift interceptors similar to \[2\] that may mutate the
TaskConfig before it reaches createJob() RPC? Does it repro for you in vagrant?
What command syntax do you use to create a job?
\[1\] -
https://github.com/apache/incubator-aurora/blob/master/src/main/python/apache/aurora/config/thrift.py
\[2\] -
https://github.com/apache/incubator-aurora/blob/master/src/main/java/org/apache/aurora/scheduler/thrift/aop/LoggingInterceptor.java
> unable to kill a job that is in ASSIGNED state
> ----------------------------------------------
>
> Key: AURORA-898
> URL: https://issues.apache.org/jira/browse/AURORA-898
> Project: Aurora
> Issue Type: Bug
> Components: Scheduler
> Affects Versions: 0.5.0
> Reporter: Bhuvan Arumugam
>
> we unable to kill a job that's in ASSIGNED state. it's always reproducible,
> even with a hello world job.
> The {{aurora killall}} command give up after 5mins with this message:
> {code}
> .
> .
> DEBUG "POST /api HTTP/1.1" 200 None
> DEBUG] "POST /api HTTP/1.1" 200 None
> DEBUG] handle_response(): returning <Response [200]>
> DEBUG] Response from scheduler: OK (message: None)
> FATAL] Tasks were not killed in time.
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)