Re: [openstack-dev] [Nova] Migration state machine proposal.

Tang Chen Tue, 27 Oct 2015 01:34:06 -0700

Hi Jay, Timofei,

Thank you for the info.


On 10/27/2015 08:02 AM, Jay Pipes wrote:

On 10/22/2015 11:13 AM, Tang Chen wrote:

On 10/22/2015 05:17 AM, Joshua Harlow wrote:

Overall I'm very much inclined to have three state machines (one
for each type), vs the mix-mash of all three into one state machine
(which causes the confusion around states in the first diagram in
that paste).


That is an idea. But I would prefer to have one single state machine
for migration, because resize and evacuate are reusing migration.
They can be in one state machine.

Evacuate does *not* migrate/move anything. Evacuate *rebuilds* VMsfrom their original source image.

Well, I just dug into the source code. I think there could be somedifference between evacuate in nova server side and client side. In novacompute, the evacuate API does call rebuild process as you said. But innovaclient, there is a command "nova host-evacuate-live", which willlive-migrate all running VMs, which made me believe that evacuate alsomigrates VMs. Please refer to:


https://github.com/openstack/python-novaclient/blob/master/novaclient/v2/contrib/host_evacuate_live.py#L72

I think this is also a reason why I always got confused in all theseconcepts: cold-migrate, evacuate, evacuate-live, rebuild, resize.

About the migration type, I can see that Timofei has tried to splitlive-migration into 3 types:

1. block_live_migrate
2. live_migrate_file_level_storage
3. live_migrate_block_stroage

I think it is in driver level, not the user level. It is based on thetype of the storage the VM is using. And I think migration type shouldbe a multi-level thing.

Since I'm still a little confused with all the types of migration, I'dlike to share some of my understanding and if they are correct, I thinkwe can improve it like this.

1. OpenStack is now supporting resize a VM to another compute node. Ifwe set "allow_resize_to_same-host", it also supports local resize. If weare not using memory/CPU hotplug, resize will result in a shutdown andreconfiguration of VM.So, there should be 2 types of resize: live (using hotplug) and cold(often resizing the primary disk).

2. Evacuate also has 2 types: live (equals to live-migrate) and cold(rebuild). But evacuate itself does nothing, I mean there is no actualprocess called evacuate. evacuate() is just an API callingrebuild_instance().


This is from the user level.

So finally, the migration type would be like this:

      user compute                                    driver

  live-migrate
  live-evacuate                     live-migrate
  live-resize                  memory/CPU hotplug

  cold-migrate           storage type, etc
  clod-evacuate                   cold-migrate
  cold-resize                      (to self or not)

    rebuild                               rebuild
                                  (this is not a migration)

I mean maybe we should handle different things in different levels. Incompute, if the flow is too complex, we can define some more helperfunctions to make the main flow easier to understand.

I support Nikola in that I believe the different migration typesshould have different state machines entirely (but be as consistent aspossible in the naming of terminal states like "finished" vs "done" etc)

OK. Agreed. And maybe also introduce state machines for task_state andvm_state.

It would be very helpful if the designer of the migration process
could share his idea. But if it is just some code modified by many
people many times, I think we should remove the confusing states and
give a easier, better state machine.
There isn't a designer of the migration process :( The original (crap,IMHO) API from Rackspace Cloud Servers API was used for the resizefunctionality in the compute API and it's been a source of confusionand frustration ever since. Relying on a manual confirmation or revertinput from the user was and continues to be a horrible idea.


Agreed.

I believe strongly that we should deprecate the existing migrate,resize, an live-migrate APIs in favor of a single consolidated,consistent "move" REST API that would have the following characteristics:
* No manual or wait-input states in any FSM graph


Yes.

* Removal of the term "resize" from the API entirely (the targetresource sizing is an attribute of the move operation, not a differenttype of API operation in and of itself)


Maybe we can define it in a different level, as I said above. Not sure.

* Transition to a task-based API for poll-state requests. This meansthat in order for a caller to determine the state of a VM the callerwould call something like GET /servers/<UUID>/tasks/<UUID> in order tosee the history of state changes or subtask operations for aparticular request to move a VM


Yes.

Timofei Durakov (cc'd) has a blueprint for splitting thelive-migration types into separate task classes here:
https://review.openstack.org/#/c/225910/
I think there's a lot of good ideas in that proposal. Please do have alook at it.


Thanks very much.

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Nova] Migration state machine proposal.

Reply via email to