Re: [openstack-dev] [nova] I think nova behaves poorly when booting multiple instances

Sylvain Bauza Wed, 03 Jun 2015 01:25:58 -0700

(I unfortunately need to quote a second time since Chris cut half of themessage below)


Le 03/06/2015 03:48, Jay Pipes a écrit :

On 06/02/2015 07:25 PM, Chris Friesen wrote:
On 06/02/2015 02:36 PM, Andrew Laski wrote:
There used to be a project that I think was looking for an API like
this to
provide a reservation system, Climate or Blazar or something. There
was brief
talk of providing something like it for that use case, but the idea
was put on
the backburner to wait for the scheduling rework that's occurring.
The question in my mind is should the claim requests be in the Nova
API or come
from a scheduler API.  And I tend to think that they should come from a
scheduler API.
Who owns the resources, nova or the scheduler?

In many cases only nova-compute can resolve races (resource tracking of
specific CPU cores, specific PCI devices, etc. in the face of parallel
scheduling) so unless we're going to guarantee no races then I think
claim requests should be a nova API call, and it should go all the way
down to nova-compute to make sure that the resources are actuallyclaimed.
That's actually how the system works today. And, IMHO, it'sinefficient. The nova-compute node should be the final arbiter ofwhether a request for resources can be properly fulfilled by thehypervisor, however, the scheduler should be the thing that "owns"resource usage records for the partition of resource providers thatthe scheduler process is responsible for.
I think the claim IDs should be returned from the scheduler APIinstead of created within the nova-compute manager.

IHMO, having claims done in the scheduler and claim IDs given tocomputes is a technical answer to an overall problem : since thescheduler doesn't commit a transaction when it does the resource usagecheck (here consume_from_instance() ), we need to provide a consistentID for saying "here, this was the ID of the transaction done whenconsuming the resource usage of the request done by the scheduler".

That's actually pretty close to what we discussed during the scaling-outscheduler session in Vancouver where notes are here :https://etherpad.openstack.org/p/YVR-nova-scalling-out-scheduler-for-cells

The idea is that we would like to see a single atomic commit when doingthe reservation (the decrement of resource usage) so that distributedschedulers could compete for concurrency by being optimistic and retryif that's not possible (if the commit fails). That's what Google triedto do with Omega, and MHO is that is a good direction. Now we have anaction to draft some proposal based on our talks, we have to follow-upon that.


-Sylvain

Best,
-jay
__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe:[email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev



__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [nova] I think nova behaves poorly when booting multiple instances

Reply via email to