Re: [openstack-dev] [neutron] [nova] scheduling bandwidth resources / NIC_BW_KB resource class

Jay Pipes Mon, 11 Apr 2016 04:51:02 -0700

Hi Miguel Angel, comments/answers inline :)

On 04/08/2016 09:17 AM, Miguel Angel Ajo Pelayo wrote:

Hi!,


    In the context of [1] (generic resource pools / scheduling in nova)
and [2] (minimum bandwidth guarantees -egress- in neutron), I had a talk
a few weeks ago with Jay Pipes,

    The idea was leveraging the generic resource pools and scheduling
mechanisms defined in [1] to find the right hosts and track the total
available bandwidth per host (and per host "physical network"),
something in neutron (still to be defined where) would notify the new
API about the total amount of "NIC_BW_KB" available on every host/physnet.

Yes, what we discussed was making it initially per host, meaning thehost would advertise a total aggregate bandwidth amount for all NICsthat it uses for the data plane as a single amount.

The other way to track this resource class (NIC_BW_KB) would be to makethe NICs themselves be resource providers and then the scheduler couldpick a specific NIC to bind the port to based on available NIC_BW_KB ona particular NIC.

The former method makes things conceptually easier at the expense ofintroducing greater potential for retrying placement decisions (sincethe specific NIC to bind a port to wouldn't be known until the claim ismade on the compute host). The latter method adds complexity to thefiltering and scheduler in order to make more accurate placementdecisions that would result in fewer retries.

    That part is quite clear to me,

    From [1] I'm not sure which blueprint introduces the ability to
schedule based on the resource allocation/availability itself,
("resource-providers-scheduler" seems more like an optimization to the
schedule/DB interaction, right?)

Yes, you are correct about the above blueprint; it's only for moving thePython-side filters to be a DB query.


The resource-providers-allocations blueprint:

https://review.openstack.org/300177

Is the one where we convert the various consumed resource amount fieldsto live in the single allocations table that may be queried for usageinformation.

We aim to use the ComputeNode object as a facade that hides themigration of these data fields as much as possible so that the scheduleractually does not need to know that the schema has changed underneathit. Of course, this only works for *existing* resource classes, likevCPU, RAM, etc. It won't work for *new* resource classes like thediscussed NET_BW_KB because, clearly, we don't have an existing field inthe instance_extra or other tables that contain that usage amount andtherefore can't use ComputeNode object as a facade over a non-existingpiece of data.

Eventually, the intent is to change the ComputeNode object to return anew AllocationList object that would contain all of the compute node'sresources in a tabular format (mimicking the underlying allocations table):


https://review.openstack.org/#/c/282442/20/nova/objects/resource_provider.py

Once this is done, the scheduler can be fitted to query thisAllocationList object to make resource usage and placement decisions inthe Python-side filters.

We are still debating on the resource-providers-scheduler-db-filtersblueprint:


https://review.openstack.org/#/c/300178/

Whether to change the existing FilterScheduler or create a brand newscheduler driver. I could go either way, frankly. If we made a brand newscheduler driver, it would do a query against the compute_nodes table inthe DB directly. The legacy FilterScheduler would manipulate theAllocationList object returned by the ComputeNode.allocations attribute.Either way we get to where we want to go: representing all quantitativeresources in a standardized and consistent fashion.

     And, that brings me to another point: at the moment of filtering
hosts, nova  I guess, will have the neutron port information, it has to
somehow identify if the port is tied to a minimum bandwidth QoS policy.

Yes, Nova's conductor gathers information about the requested networks*before* asking the scheduler where to place hosts:


https://github.com/openstack/nova/blob/stable/mitaka/nova/conductor/manager.py#L362

     That would require identifying that the port has a "qos_policy_id"
attached to it, and then, asking neutron for the specific QoS policy
  [3], then look out for a minimum bandwidth rule (still to be defined),
and extract the required bandwidth from it.


Yep, exactly correct.

    That moves, again some of the responsibility to examine and
understand external resources to nova.

Yep, it does. The alternative is more retries for placement decisionsbecause accurate decisions cannot be made until the compute node isalready selected and the claim happens on the compute node.

     Could it make sense to make that part pluggable via stevedore?, so
we would provide something that takes the "resource id" (for a port in
this case) and returns the requirements translated to resource classes
(NIC_BW_KB in this case).

Not sure Stevedore makes sense in this context. Really, we want *less*extensibility and *more* consistency. So, I would envision rather asystem where Nova would call to Neutron before scheduling when it hasreceived a port or network ID in the boot request and ask Neutronwhether the port or network has any resource constraints on it. Neutronwould return a standardized response containing each resource class andthe amount requested in a dictionary (or better yet, an os_vif.objects.*object, serialized). Something like:


{
  'resources': {
    '<UUID of port or network>': {
      'NIC_BW_KB': 2048,
      'IPV4_ADDRESS': 1
    }
  }
}

In the case of the NIC_BW_KB resource class, Nova's scheduler would lookfor compute nodes that had a NIC with that amount of bandwidth stillavailable. In the case of the IPV4_ADDRESS resource class, Nova'sscheduler would use the generic-resource-pools interface to find aresource pool of IPV4_ADDRESS resources (i.e. a Neutron routed networkor subnet allocation pool) that has available IP space for the request.


Best,
-jay

Best regards,
Miguel Ángel Ajo


[1]
http://lists.openstack.org/pipermail/openstack-dev/2016-February/086371.html
[2] https://bugs.launchpad.net/neutron/+bug/1560963
[3] http://developer.openstack.org/api-ref-networking-v2-ext.html#showPolicy


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [neutron] [nova] scheduling bandwidth resources / NIC_BW_KB resource class

Reply via email to