Re: [openstack-dev] [Heat] Convergence proof-of-concept showdown

Zane Bitter Wed, 03 Dec 2014 21:21:55 -0800

On 01/12/14 02:02, Anant Patil wrote:

On GitHub:https://github.com/anantpatil/heat-convergence-poc

I'm trying to review this code at the moment, and finding some stuff Idon't understand:


https://github.com/anantpatil/heat-convergence-poc/blob/master/heat/engine/stack.py#L911-L916

This appears to loop through all of the resources *prior* to kicking offany actual updates to check if the resource will change. This isimpossible to do in general, since a resource may obtain a propertyvalue from an attribute of another resource and there is no way to knowwhether an update to said other resource would cause a change in theattribute value.

In addition, no attempt to catch UpdateReplace is made. Although thatlooks like a simple fix, I'm now worried about the level to which thiscode has been tested.

I'm also trying to wrap my head around how resources are cleaned up independency order. If I understand correctly, you store in theResourceGraph table the dependencies between various resource names inthe current template (presumably there could also be some left aroundfrom previous templates too?). For each resource name there may be anumber of rows in the Resource table, each with an incrementing version.As far as I can tell though, there's nowhere that the dependency graphfor _previous_ templates is persisted? So if the dependency orderchanges in the template we have no way of knowing the correct order toclean up in any more? (There's not even a mechanism to associate aresource version with a particular template, which might be one avenueby which to recover the dependencies.)

I think this is an important case we need to be able to handle, so Iadded a scenario to my test framework to exercise it and discovered thatmy implementation was also buggy. Here's the fix:https://github.com/zaneb/heat-convergence-prototype/commit/786f367210ca0acf9eb22bea78fd9d51941b0e40

It was difficult, for me personally, to completely understand Zane's PoC
and how it would lay the foundation for aforementioned design goals. It
would be very helpful to have Zane's understanding here. I could
understand that there are ideas like async message passing and notifying
the parent which we also subscribe to.

So I guess the thing to note is that there are essentially two parts tomy Poc:1) A simulation framework that takes what will be in the finalimplementation multiple tasks running in parallel in separate processesand talking to a database, and replaces it with an event loop that runsthe tasks sequentially in a single process with an in-memory data store.I could have built a more realistic simulator using Celery or something,but I preferred this way as it offers deterministic tests.

2) A toy implementation of Heat on top of this framework.

The files map roughly to Heat something like this:

converge.engine       -> heat.engine.service
converge.stack        -> heat.engine.stack
converge.resource     -> heat.engine.resource
converge.template     -> heat.engine.template
converge.dependencies -> actually is heat.engine.dependencies
converge.sync_point   -> no equivalent
converge.converger    -> no equivalent (this is convergence "worker")
converge.reality      -> represents the actual OpenStack services

For convenience, I just use the @asynchronous decorator to turn anordinary method call into a simulated message.


The concept is essentially as follows:

At the start of a stack update (creates and deletes are also justupdates) we create any new resources in the DB calculate the dependencygraph for the update from the data in the DB and template. This graph isthe same one used by updates in Heat currently, so it contains both theforward and reverse (cleanup) dependencies. The stack update then kicksoff checks of all the leaf nodes, passing the pre-calculated dependencygraph.

Each resource check may result in a call to the create(), update() ordelete() methods of a Resource plugin. The resource also reads anyattributes that will be required from it. Once this is complete, ittriggers any dependent resources that are ready, or updates a SyncPointin the database if there are dependent resources that have multiplerequirements. The message triggering the next resource will contain thedependency graph again, as well as the RefIds and required attributes ofany resources it depends on.

The new dependencies thus created are added to the resource itself inthe database at the time it is checked, allowing it to record thechanges caused by a requirement being unexpectedly replaced withoutneeding a global lock on anything.

When cleaning up resources, we also endeavour to remove any that aresuccessfully deleted from the dependencies graph.

Each traversal has a unique ID that is both stored in the stack andpassed down through the resource check triggers. (At present this is thetemplate ID, but it may make more sense to have a unique ID since oldtemplate IDs can be resurrected in the case of a rollback.) As soon asthese fail to match the resource checks stop propagating, so only anupdate of a single field is required (rather than locking an entiretable) before beginning a new stack update.

Hopefully that helps a little. Please let me know if you have specificquestions. I'm *very* happy to incorporate other ideas into it, sinceit's pretty quick to change, has tests to check for regressions, and isintended to be thrown away anyhow (so I genuinely don't care if somebits get thrown away earlier than others).

In retrospective, we had to struggle a lot to understand the existing
Heat engine. We couldn't have done justice by just creating another
project in GitHub and without any concrete understanding of existing
state-of-affairs.

I completely agree, and you guys did the right thing by starting outlooking at Heat. But remember, the valuable thing isn't the code, it'swhat you learned. My concern is that now that you have Heat pretty wellfigured out, you won't be able to continue to learn nearly as fasttrying to wrestle with the Heat codebase as you could with thesimulator. We don't want to fall into the trap of just shipping whateverwe have because it's too hard to explore the other options, we want toidentify a promising design and iterate it as quickly as possible.


cheers,
Zane.

_______________________________________________
OpenStack-dev mailing list
[email protected]
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Heat] Convergence proof-of-concept showdown

Reply via email to