[
https://issues.apache.org/jira/browse/CLOUDSTACK-4604?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Murali Reddy updated CLOUDSTACK-4604:
-------------------------------------
Summary: add checkpoint based cancel/restart/pause async jobs to help
recovery from failures (was: add cancel/restart/pause async jobs to help
recovery from failures)
> add checkpoint based cancel/restart/pause async jobs to help recovery from
> failures
> -----------------------------------------------------------------------------------
>
> Key: CLOUDSTACK-4604
> URL: https://issues.apache.org/jira/browse/CLOUDSTACK-4604
> Project: CloudStack
> Issue Type: Improvement
> Security Level: Public(Anyone can view this level - this is the
> default.)
> Reporter: Murali Reddy
> Fix For: Future
>
>
> Two complementary functionalities want to address with this improvement.
> - checkpoint the async jobs. perhaps use current journal to record the all
> entity manipulation, For e.g asyncJob for VM create can have journal like
> Volume, create, ID
> Network, implement, id
> Nic, prepare, id etc
> - asyncjob management api currently exposes ability to perform query and list
> async jobs. There is no api support to cancel a job or restart a job. async
> job manager can be enhanced to add pause, cancel, restart the jobs
> A Job pause (pause to nearest checkpoint) and job restart (restart from the
> last checkpoint), cancel (rollback from last checkpoint) can be used to help
> recovering from failures.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira