On 6 October 2016 at 10:43, Jay Pipes <jaypi...@gmail.com> wrote:
> On 10/06/2016 11:58 AM, Naveen Joy (najoy) wrote:
>> It’s primarliy because we have seen better stability and scalability
>> with etcd over rabbitmq.
> Well, that's kind of comparing apples to oranges. :)
> One is a distributed k/v store. The other is a message queue broker.
> The way that we (IMHO) over-use the peer-to-peer RPC communication
> paradigm in Nova and Neutron has resulted in a number of design choices and
> awkward code in places like oslo.messaging because of the use of
> broker-based message queue systems as the underlying transport mechanism.
> It's not that RabbitMQ or AMQP isn't scalable or reliable. It's that we're
> using it in ways that don't necessarily fit well.
> One might argue that in using etcd and etcd watches in the way you are in
> networking-vpp, that you are essentially using those tools to create a
> simplified pub-sub messaging system and that isn't really what etcd was
> built for and you will end up running into similar fitness issues
> long-term. But, who knows? It might end up being a genius implementation. :)
> I'm happy to see innovation flourish here and encourage new designs and
> strategies. Let's just make sure we compare apples to apples when making
> statements about performance or reliability.
Sorry to waken an old thread, but I chose a perfect moment to go on
So yes: I don't entirely trust the way we use RabbitMQ, and that's largely
because what we're doing with it - distributing state, or copies of state,
or information derived from state - leads to some fragility and odd
situations when using a tool perhaps better suited to listing off tasks.
We've tried to find a different model of working that is closer to the
behaviour we're after. It is, I believe, similar to the Calico team's
thinking, but not derived from their code. I have to admit at this point
that it's not been tested at scale in our use of it, and that's something
we will be doing, but I can say that this is working in a way that is in
line with how etcd is intended to be used, we have tested representative
etcd performance, and we don't expect problems.
As mentioned before, Neutron's SQL database is the source of truth - you
need to have one, and that one represents what the client asked for in its
purest form. In the nature of keeping two datastores in sync, there is a
worker thread outside of the REST call to do the synchronisation (because
we don't want the cloud user to be waiting on our internal workings, and
because consistently committing to two databases is a recipe for disaster)
- etcd lags the Neutron DB commits very slightly, and the Neutron DB is
always right. This allows the API to be quick while the backend will run
as efficiently as possible.
It does also mean that failures to communicate in the backend don't result
in failed API calls - the call succeeds but state updates don't happen.
This is in line with a 'desired state' model. A user tells Neutron what
they want to do and Neutron should generally accept the request if it's
well formatted and consistent. Exceptional error codes like 500s are
annoying to deal with, as you never know if that means 'I failed to save
that' or 'I failed to implement that' or 'I saved and implemented that, but
didn't quite get the answer to you' - having simple frontend code ensures
the answer is highly likely to be 'I will do that it in a moment', in
keeping with with the eventually consistent model OpenStack has. The
driver will then work its magic and update object states when the work is
Watching changes - and the pub-sub model you end up with - is a means of
being efficient, but should we miss notifications there's a fallback
mechanism to get back into state sync with the most recent version of the
state. In the worst case, we focus on the currently desired state, and not
the backlog of recent changes to state.
And Jay, you're right. What we should be comparing here is how well it
works. Is it easy to use, is it easy to maintain, is it annoyingly
fragile, and does it eat network or CPU? I believe so (or I wouldn't have
chosen to do it this way), and I hope we've produced something simple to
understand while being easier to operate. However, the proof of the
pudding is in the eating, so let's see how this works as we continue to
develop and test it.
OpenStack Development Mailing List (not for usage questions)