[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1223:
--
Assignee: Kai Huang
> Modify scheduler updater to not use "watch_secs" for health-check enabled jobs
>
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1222:
--
Sprint: Twitter Aurora Q2'16 Sprint 20
> Modify stats and SLA metrics to properly account for STARTING
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1223:
--
Story Points: 5
> Modify scheduler updater to not use "watch_secs" for health-check enabled jobs
>
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1222:
--
Sprint: (was: Twitter Aurora Q2'16 Sprint 20)
> Modify stats and SLA metrics to properly account for
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1222:
-
Assignee: Kai Huang
> Modify stats and SLA metrics to properly account for STARTING
> --
[
https://issues.apache.org/jira/browse/AURORA-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1224:
-
Assignee: Kai Huang
> Add a new "min_consecutive_health_checks" setting in .aurora config
>
[
https://issues.apache.org/jira/browse/AURORA-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1221:
-
Assignee: Kai Huang
> Modify task state machine to treat STARTING as a new active state
> --
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1225:
-
Assignee: Kai Huang
> Modify executor state transition logic to rely on health checks (if enable
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1222:
--
Sprint: Twitter Aurora Q2'16 Sprint 20
> Modify stats and SLA metrics to properly account for STARTING
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang resolved AURORA-1223.
---
Resolution: Fixed
After discussion on Aurora dev list, it turns out there will be no
scheduler-side
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15468067#comment-15468067
]
Kai Huang edited comment on AURORA-1223 at 9/6/16 6:19 PM:
---
Aft
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reopened AURORA-1223:
---
Found watch_secs constraint to relax at scheduler side.
> Modify scheduler updater to not use "watch_sec
[
https://issues.apache.org/jira/browse/AURORA-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15468426#comment-15468426
]
Kai Huang commented on AURORA-1221:
---
There are two side-effects after we add STARTING s
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15468433#comment-15468433
]
Kai Huang commented on AURORA-1222:
---
After discussion with Maxim, I decided not to acco
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15468433#comment-15468433
]
Kai Huang edited comment on AURORA-1222 at 9/6/16 8:19 PM:
---
Aft
[
https://issues.apache.org/jira/browse/AURORA-1222?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang resolved AURORA-1222.
---
Resolution: Fixed
> Modify stats and SLA metrics to properly account for STARTING
> -
[
https://issues.apache.org/jira/browse/AURORA-1221?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15468426#comment-15468426
]
Kai Huang edited comment on AURORA-1221 at 9/6/16 8:27 PM:
---
The
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15471115#comment-15471115
]
Kai Huang commented on AURORA-1225:
---
Currently, aurora_executor sends the task status u
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15471115#comment-15471115
]
Kai Huang edited comment on AURORA-1225 at 9/7/16 4:52 PM:
---
Cur
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang resolved AURORA-1223.
---
Resolution: Fixed
Modify the assertion of watch_secs in scheduler. So the scheduler can accept 0
for
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15473026#comment-15473026
]
Kai Huang edited comment on AURORA-1223 at 9/8/16 7:03 AM:
---
Mod
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1223:
--
Comment: was deleted
(was: Found watch_secs constraint to relax at scheduler side.)
> Modify scheduler
[
https://issues.apache.org/jira/browse/AURORA-1223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15473026#comment-15473026
]
Kai Huang edited comment on AURORA-1223 at 9/8/16 7:03 AM:
---
Mod
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15471115#comment-15471115
]
Kai Huang edited comment on AURORA-1225 at 9/8/16 7:52 PM:
---
Cur
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1225:
--
Sprint: Twitter Aurora Q2'16 Sprint 21
> Modify executor state transition logic to rely on health check
Kai Huang created AURORA-1775:
-
Summary: Updating pants to 1.1.0-rc7 breaks the
make-pycharm-virtualenv script
Key: AURORA-1775
URL: https://issues.apache.org/jira/browse/AURORA-1775
Project: Aurora
[
https://issues.apache.org/jira/browse/AURORA-1775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1775:
--
Description:
When I run build-support/python/make-pycharm-virtualenv script locally,
exceptions were t
[
https://issues.apache.org/jira/browse/AURORA-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15507123#comment-15507123
]
Kai Huang commented on AURORA-1224:
---
I would propose making the following changes to th
[
https://issues.apache.org/jira/browse/AURORA-216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-216:
-
Assignee: Kai Huang
> allow aurora executor to be customized via the commandline
> ---
[
https://issues.apache.org/jira/browse/AURORA-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1426:
--
Assignee: Kai Huang
> thermos kill hangs when killing a Docker-containerized task
> ---
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang resolved AURORA-1225.
---
Resolution: Fixed
> Modify executor state transition logic to rely on health checks (if enabled)
> --
[
https://issues.apache.org/jira/browse/AURORA-1224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang resolved AURORA-1224.
---
Resolution: Fixed
> Add a new "min_consecutive_health_checks" setting in .aurora config
> ---
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15566711#comment-15566711
]
Kai Huang commented on AURORA-1791:
---
Thanks for pointing it out. The current implementa
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15566721#comment-15566721
]
Kai Huang commented on AURORA-1791:
---
In your case, it's likely your task becomes health
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567768#comment-15567768
]
Kai Huang commented on AURORA-1791:
---
To sum up, the issue is caused by failed to reach
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567803#comment-15567803
]
Kai Huang commented on AURORA-1791:
---
An issue to implement (b) is that the health check
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15567768#comment-15567768
]
Kai Huang edited comment on AURORA-1791 at 10/12/16 6:43 AM:
-
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15569183#comment-15569183
]
Kai Huang commented on AURORA-1791:
---
Thanks for the examples, [~davmclau] It seems the
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15569600#comment-15569600
]
Kai Huang commented on AURORA-1791:
---
We've decided to revert the commit.
The changes
Kai Huang created AURORA-1793:
-
Summary: Revert
Key: AURORA-1793
URL: https://issues.apache.org/jira/browse/AURORA-1793
Project: Aurora
Issue Type: Bug
Reporter: Kai Huang
[
https://issues.apache.org/jira/browse/AURORA-1793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1793:
--
Summary: Revert Commit ca683 which is not backwards compatible (was:
Revert )
> Revert Commit ca683 w
[
https://issues.apache.org/jira/browse/AURORA-1793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1793:
--
Description:
The commit ca683cb9e27bae76424a687bc6c3af5a73c501b9 is not backwards
compatible. We decid
[
https://issues.apache.org/jira/browse/AURORA-1793?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1793:
--
Assignee: Kai Huang
> Revert Commit ca683 which is not backwards compatible
> -
[
https://issues.apache.org/jira/browse/AURORA-1791?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15569618#comment-15569618
]
Kai Huang commented on AURORA-1791:
---
The ticket to track is: https://issues.apache.org
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1225:
--
Assignee: (was: Kai Huang)
> Modify executor state transition logic to rely on health checks (if en
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15582921#comment-15582921
]
Kai Huang commented on AURORA-1225:
---
This ticket is almost done. We need to add some mo
[
https://issues.apache.org/jira/browse/AURORA-1225?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15583543#comment-15583543
]
Kai Huang commented on AURORA-1225:
---
Some one at Aurora Team@Twitter will continue work
Kai Huang created AURORA-1879:
-
Summary: /pendingTasks endpoint shows 500 HTTP Error when there
are multiple pending tasks with the same key
Key: AURORA-1879
URL: https://issues.apache.org/jira/browse/AURORA-1879
[
https://issues.apache.org/jira/browse/AURORA-1879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1879:
--
Component/s: Scheduler
> /pendingTasks endpoint shows 500 HTTP Error when there are multiple pending
>
[
https://issues.apache.org/jira/browse/AURORA-1879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1879:
--
Attachment: pending_tasks.png
> /pendingTasks endpoint shows 500 HTTP Error when there are multiple pen
[
https://issues.apache.org/jira/browse/AURORA-1837?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1837:
--
Summary: Improve implicit task history pruning (was: Improve task history
pruning)
> Improve implicit
Kai Huang created AURORA-1929:
-
Summary: Improve explicit task history pruning.
Key: AURORA-1929
URL: https://issues.apache.org/jira/browse/AURORA-1929
Project: Aurora
Issue Type: Task
[
https://issues.apache.org/jira/browse/AURORA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1929:
--
Description:
There are currently two types of task history pruning running by aurora:
# 1) The implicit
[
https://issues.apache.org/jira/browse/AURORA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1929:
--
Description:
There are currently two types of task history pruning running by aurora:
# The implicit ta
[
https://issues.apache.org/jira/browse/AURORA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1929:
--
Description:
There are currently two types of task history pruning running by aurora:
# The implicit ta
[
https://issues.apache.org/jira/browse/AURORA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1929:
--
Component/s: Scheduler
> Improve explicit task history pruning.
> -
[
https://issues.apache.org/jira/browse/AURORA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1929:
--
Description:
There are currently two types of task history pruning running by aurora:
# The implicit ta
[
https://issues.apache.org/jira/browse/AURORA-1929?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16035170#comment-16035170
]
Kai Huang commented on AURORA-1929:
---
https://reviews.apache.org/r/59699/
> Improve exp
Kai Huang created AURORA-1934:
-
Summary: Add a whitelist for TaskStateChange events in Aurora
Scheduler WebHooks
Key: AURORA-1934
URL: https://issues.apache.org/jira/browse/AURORA-1934
Project: Aurora
[
https://issues.apache.org/jira/browse/AURORA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1934:
--
Description: Aurora Scheduler has a webhook module that watches all
TaskStateChanges and send events to
[
https://issues.apache.org/jira/browse/AURORA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1934:
--
Priority: Minor (was: Major)
> Add a whitelist for TaskStateChange events in Aurora Scheduler WebHooks
[
https://issues.apache.org/jira/browse/AURORA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16048550#comment-16048550
]
Kai Huang commented on AURORA-1934:
---
https://reviews.apache.org/r/59940/
> Add a white
Kai Huang created AURORA-1937:
-
Summary: Add metrics for status updates after switching to V1
Mesos Driver implementaion
Key: AURORA-1937
URL: https://issues.apache.org/jira/browse/AURORA-1937
Project: Au
[
https://issues.apache.org/jira/browse/AURORA-1937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1937:
--
Summary: Add metrics for status updates before switching to V1 Mesos Driver
implementaion (was: Add me
[
https://issues.apache.org/jira/browse/AURORA-1937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1937:
-
Assignee: Kai Huang
> Add metrics for status updates before switching to V1 Mesos Driver
> impl
[
https://issues.apache.org/jira/browse/AURORA-1937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063422#comment-16063422
]
Kai Huang edited comment on AURORA-1937 at 6/26/17 5:38 PM:
A
[
https://issues.apache.org/jira/browse/AURORA-1937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16063483#comment-16063483
]
Kai Huang commented on AURORA-1937:
---
Add timing metrics for status_update: https://revi
[
https://issues.apache.org/jira/browse/AURORA-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1940:
-
Assignee: Kai Huang
> aurora job restart request should be retryable
> -
Kai Huang created AURORA-1940:
-
Summary: aurora job restart request should be retryable
Key: AURORA-1940
URL: https://issues.apache.org/jira/browse/AURORA-1940
Project: Aurora
Issue Type: Task
[
https://issues.apache.org/jira/browse/AURORA-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1940:
--
Description:
There was a recent change to the Aurora client to provide "at most once"
instead of "at l
[
https://issues.apache.org/jira/browse/AURORA-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang updated AURORA-1940:
--
Description:
There was a recent change to the Aurora client to provide "at most once"
instead of "at l
[
https://issues.apache.org/jira/browse/AURORA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang resolved AURORA-1934.
---
Resolution: Fixed
> Add a whitelist for TaskStateChange events in Aurora Scheduler WebHooks
> ---
[
https://issues.apache.org/jira/browse/AURORA-1934?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reopened AURORA-1934:
---
> Add a whitelist for TaskStateChange events in Aurora Scheduler WebHooks
> -
[
https://issues.apache.org/jira/browse/AURORA-1426?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-1426:
-
Assignee: (was: Kai Huang)
> thermos kill hangs when killing a Docker-containerized task
> -
[
https://issues.apache.org/jira/browse/AURORA-216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Kai Huang reassigned AURORA-216:
Assignee: (was: Kai Huang)
> allow aurora executor to be customized via the commandline
> -
75 matches
Mail list logo