Re: [openstack-dev] [nova] blueprint about multiple workers supported in nova-scheduler

Jay Pipes Wed, 04 Mar 2015 12:25:56 -0800

On 03/04/2015 01:51 AM, Attila Fazekas wrote:

Hi,


I wonder what is the planned future of the scheduling.

The scheduler does a lot of high field number query,
which is CPU expensive when you are using sqlalchemy-orm.
Does anyone tried to switch those operations to sqlalchemy-core ?

Actually, the scheduler does virtually no SQLAlchemy ORM queries. Almostall database access is serialized from the nova-scheduler through thenova-conductor service via the nova.objects remoting framework.

The scheduler does lot of thing in the application, like filtering
what can be done on the DB level more efficiently. Why it is not done
on the DB side ?

That's a pretty big generalization. Many filters (check out NUMAconfiguration, host aggregate extra_specs matching, any of the JSONfilters, etc) don't lend themselves to SQL column-based sorting andfiltering.

There are use cases when the scheduler would need to know even more data,
Is there a plan for keeping `everything` in all schedulers process memory 
up-to-date ?
(Maybe zookeeper)

Zookeeper has nothing to do with scheduling decisions -- only whether ornot a compute node's "service descriptor" is active or not. The end goal(after splitting the Nova scheduler out into Gantt hopefully at thestart of the L release cycle) is to have the Gantt database be moreoptimized to contain the resource usage amounts of all resourcesconsumed in the entire cloud, and to use partitioning/sharding to scalethe scheduler subsystem, instead of having each scheduler process handlerequests for all resources in the cloud (or cell...)

The opposite way would be to move most operation into the DB side,
since the DB already knows everything.
(stored procedures ?)

See above. This assumes that the data the scheduler is iterating over iswell-structured and consistent, and that is a false assumption.


Best,
-jay

Best Regards,
Attila


----- Original Message -----

From: "Rui Chen" <[email protected]>
To: "OpenStack Development Mailing List (not for usage questions)" 
<[email protected]>
Sent: Wednesday, March 4, 2015 4:51:07 AM
Subject: [openstack-dev] [nova] blueprint about multiple workers supported      
in nova-scheduler

Hi all,

I want to make it easy to launch a bunch of scheduler processes on a host,
multiple scheduler workers will make use of multiple processors of host and
enhance the performance of nova-scheduler.

I had registered a blueprint and commit a patch to implement it.
https://blueprints.launchpad.net/nova/+spec/scheduler-multiple-workers-support

This patch had applied in our performance environment and pass some test
cases, like: concurrent booting multiple instances, currently we didn't find
inconsistent issue.

IMO, nova-scheduler should been scaled horizontally on easily way, the
multiple workers should been supported as an out of box feature.

Please feel free to discuss this feature, thanks.

Best Regards

__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


__________________________________________________________________________
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: [email protected]?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [nova] blueprint about multiple workers supported in nova-scheduler

Reply via email to