[openstack-dev] [nova] [cyborg] Race condition in the Cyborg/Nova flow

Nadathur, Sundar Thu, 22 Mar 2018 21:29:30 -0700

Hi all,

There seems to be a possibility of a race condition in theCyborg/Nova flow. Apologies for missing this earlier. (You can refer tothe proposed Cyborg/Nova spec<https://review.openstack.org/#/c/554717/1/doc/specs/rocky/cyborg-nova-sched.rst>for details.)

Consider the scenario where the flavor specifies a resource class for adevice type, and also specifies a function (e.g. encrypt) in the extraspecs. The Nova scheduler would only track the device type as aresource, and Cyborg needs to track the availability of functions.Further, to keep it simple, say all the functions exist all the time (noreprogramming involved).


To recap, here is the scheduler flow for this case:

 * A request spec with a flavor comes to Nova conductor/scheduler. The
   flavor has a device type as a resource class, and a function in the
   extra specs.
 * Placement API returns the list of RPs (compute nodes) which contain
   the requested device types (but not necessarily the function).
 * Cyborg will provide a custom filter which queries Cyborg DB. This
   needs to check which hosts contain the needed function, and filter
   out the rest.
 * The scheduler selects one node from the filtered list, and the
   request goes to the compute node.

For the filter to work, the Cyborg DB needs to maintain a table withtriples of (host, function type, #free units). The filter checks if agiven host has one or more free units of the requested function type.But, to keep the # free units up to date, Cyborg on the selected computenode needs to notify the Cyborg API to decrement the #free units when aninstance is spawned, and to increment them when resources are released.

Therein lies the catch: this loop from the compute node to controller issusceptible to race conditions. For example, if two simultaneousrequests each ask for function A, and there is only one unit of thatavailable, the Cyborg filter will approve both, both may land on thesame host, and one will fail. This is because Cyborg on the controllerdoes not decrement resource usage due to one request before processingthe next request.

This is similar to this previous Nova scheduling issue<https://specs.openstack.org/openstack/nova-specs/specs/pike/implemented/placement-claims.html>.That was solved by having the scheduler claim a resource in Placementfor the selected node. I don't see an analog for Cyborg, since it wouldnot know which node is selected.


Thanks in advance for suggestions and solutions.

Regards,
Sundar

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

[openstack-dev] [nova] [cyborg] Race condition in the Cyborg/Nova flow

Reply via email to