[
https://issues.apache.org/jira/browse/AURORA-483?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14011989#comment-14011989
]
Bill Farner commented on AURORA-483:
------------------------------------
My regression, introduced in
{noformat}
commit a8fa267f03e2fa36039eb014884858074f7b7575
Author: Bill Farner <[email protected]>
Date: Tue May 27 11:25:02 2014 -0700
Enable some additional PMD rules, and fix sources to satisfy them.
Reviewed at https://reviews.apache.org/r/21849/
{noformat}
Review incoming shortly.
> Following tutorial results in LOST tasks
> ----------------------------------------
>
> Key: AURORA-483
> URL: https://issues.apache.org/jira/browse/AURORA-483
> Project: Aurora
> Issue Type: Bug
> Reporter: Bill Farner
> Assignee: Bill Farner
> Priority: Blocker
>
> Running the vagrant tutorial fails on master:
> {noformat}
> $ git log -1 --oneline
> 3d09a75 Add big job to local test fixture
> $ vagrant up
> $ bash examples/vagrant/test_tutorial.sh
> Checking scheduler health...\c
> PASS
> Checking observer health...\c
> PASS
> Checking master health...\c
> PASS
> Checking slave health...\c
> PASS
> Running create command...\c
> PASS
> Waiting to observe FAILED task...\c
> FAIL
> Command output:\nConnection to 127.0.0.1 closed.
> Running update command...\c
> FAIL
> Command output:\n INFO] Updating job: hello_world
> INFO] Starting job update.
> INFO] Examining instances: [0]
> INFO] Killing instances: [0]
> INFO] Instances killed
> INFO] Adding instances: [0]
> INFO] Instances added
> INFO] Watching instances: [0]
> INFO] Instance 0 was not reported healthy within 60 seconds
> ERROR] Failed instances: set([0])
> WARN] Not restarting failed instances [0], which exceeded maximum allowed
> instance failure limit of 0
> ERROR] 1 failed instances observed, maximum allowed is 0
> ERROR] 1 instance failures for instance 0, maximum allowed is 0
> INFO] Reverting update for [0]
> INFO] Examining instances: [0]
> INFO] Killing instances: [0]
> INFO] Instances killed
> INFO] Adding instances: [0]
> INFO] Instances added
> INFO] Watching instances: [0]
> INFO] Instance 0 was not reported healthy within 60 seconds
> ERROR] Rollback failed for instances: [0]
> ERROR] 1 failed instances observed, maximum allowed is 0
> ERROR] 1 instance failures for instance 0, maximum allowed is 0
> WARN] Update failures threshold reached
> INFO] Response from scheduler: ERROR (message: Update reverted)
> {noformat}
> In the web UI, the task is shown to have failed with the following mesasge:
> {noformat}
> LOST : Task uses invalid resources: ports(*):[]
> {noformat}
--
This message was sent by Atlassian JIRA
(v6.2#6252)