subject:"\[openstack\-dev\] \[Cinder\] A possible solution for HA Active\-Active"

Re: [openstack-dev] [Cinder] A possible solution for HA Active-Active

2015-08-07 Thread Andrew Beekhof


 On 5 Aug 2015, at 1:34 am, Joshua Harlow harlo...@outlook.com wrote:
 
 Philipp Marek wrote:
 If we end up using a DLM then we have to detect when the connection to
 the DLM is lost on a node and stop all ongoing operations to prevent
 data corruption.
 
 It may not be trivial to do, but we will have to do it in any solution
 we use, even on my last proposal that only uses the DB in Volume Manager
 we would still need to stop all operations if we lose connection to the
 DB.
 
 Well, is it already decided that Pacemaker would be chosen to provide HA in
 Openstack? There's been a talk Pacemaker: the PID 1 of Openstack IIRC.
 
 I know that Pacemaker's been pushed aside in an earlier ML post, but IMO
 there's already *so much* been done for HA in Pacemaker that Openstack
 should just use it.
 
 All HA nodes needs to participate in a Pacemaker cluster - and if one node
 looses connection, all services will get stopped automatically (by
 Pacemaker) - or the node gets fenced.
 
 
 No need to invent some sloppy scripts to do exactly the tasks (badly!) that
 the Linux HA Stack has been providing for quite a few years.
 
 
 Yes, Pacemaker needs learning - but not more than any other involved
 project, and there are already quite a few here, which have to be known to
 any operator or developer already.
 
 
 (BTW, LINBIT sells training for the Linux HA Cluster Stack - and yes,
  I work for them ;)
 
 So just a piece of information, but yahoo (the company I work for, with vms 
 in the tens of thousands, baremetal in the much more than that...) hasn't 
 used pacemaker, and in all honesty this is the first project (openstack) that 
 I have heard that needs such a solution. I feel that we really should be 
 building our services better so that they can be A-A vs having to depend on 
 another piece of software to get around our 'sloppiness' (for lack of a 
 better word).

HA is a deceptively hard problem.
There is really no need for every project to attempt to solve it on their own.
Having everyone consuming/calculating a different membership list is a very 
good way to go insane.

Aside from the usual bugs, the HA space lends itself to making simplifying 
assumptions early on, only to trap you with them down the road.
Its even worse if you’re trying to bolt it on after-the-fact...

Perhaps try to think of pacemaker as a distribute finite state machine instead 
of a cluster manager.
That is part of the value we bring to projects like galera and rabbitmq.

Sure they are A-A, and once they’re up they can survive many failures, but 
bringing them up can be non-trivial.
We also provide the additional context (eg. quorum and fencing) that allow more 
kinds of failures to be safely recovered from.

Something to think about perhaps.

— Andrew

 
 Nothing against pacemaker personally... IMHO it just doesn't feel like we are 
 doing this right if we need such a product in the first place.
 
 
 __
 OpenStack Development Mailing List (not for usage questions)
 Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
 http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
 
 __
 OpenStack Development Mailing List (not for usage questions)
 Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
 http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Cinder] A possible solution for HA Active-Active

2015-08-05 Thread Mike Perez

On Tue, Aug 4, 2015 at 7:47 PM, Morgan Fainberg
morgan.fainb...@gmail.com wrote:

 On Tue, Aug 4, 2015 at 1:43 AM, Gorka Eguileor gegui...@redhat.com wrote:

 On Tue, Aug 04, 2015 at 05:47:44AM +1000, Morgan Fainberg wrote:
 
   On Aug 4, 2015, at 01:42, Fox, Kevin M kevin@pnnl.gov wrote:
  
   I'm usually for abstraction layers, but they don't always pay off very
   well due to catering to the lowest common denominator.
  
   Lets clearly define the problem space first. IFF the problem space can
   be fully implemented using Tooz, then lets do that. Then the operator can
   choose. If Tooz cant and wont handle the problem space, then we're 
   trying to
   fit a square peg in a round hole.
 
  +1 and specifically around tooz, it is narrow in comparison to the
  feature sets of some the DLMs (since it has to mostly-implement to the
  lowest common denominator, as abstraction layers do). Defining the space we
  are trying to target will let us make the informed decision on what we use.

 Again with this?

 Yes, I was reiterating that we should not talk about a specific choice but
 continue with the other discussion. Tooz, ZooKeeper, Consul, etc, is all
 irrelevant to the rest of the conversation we are having. The specific
 technology used can be discussed in an x-project spec, but I really would
 rather see a very opinionated choice. That can again be delayed until a
 later point.

 We already what we want to get out of Tooz, where we want it and for how
 long we'll be using it in each of those places.

 My response was also before the rest of the convo that occurred post
 Flavio's summary.

 To answer those questions all that's needed is to read this thread and
 the links referred on some conversations.

 I am fine with using a DLM. I see a significant benefit (without putting too
 fine a point on it, Keystone *will* benefit from a choice for a DLM to be
 available in OpenStack, and I like the idea). I was hoping to continue (and
 we did) identify where we had DLM-like/DLM uses in OpenStack so we knew
 where to focus.

Hey all,

This thread is a mess.

I'm going to put together facts with what projects are doing and why.
I will present my findings at the session that I will be moderating in
the cross project track of the summit [1], if accepted. Spec may
follow.

[1] - https://etherpad.openstack.org/p/mitaka-cross-project-session-planning

--
Mike Perez

__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Cinder] A possible solution for HA Active-Active

2015-08-05 Thread Flavio Percoco


On 04/08/15 23:39 -0700, Mike Perez wrote:

On Tue, Aug 4, 2015 at 7:47 PM, Morgan Fainberg
morgan.fainb...@gmail.com wrote:


On Tue, Aug 4, 2015 at 1:43 AM, Gorka Eguileor gegui...@redhat.com wrote:


On Tue, Aug 04, 2015 at 05:47:44AM +1000, Morgan Fainberg wrote:

  On Aug 4, 2015, at 01:42, Fox, Kevin M kevin@pnnl.gov wrote:
 
  I'm usually for abstraction layers, but they don't always pay off very
  well due to catering to the lowest common denominator.
 
  Lets clearly define the problem space first. IFF the problem space can
  be fully implemented using Tooz, then lets do that. Then the operator can
  choose. If Tooz cant and wont handle the problem space, then we're trying to
  fit a square peg in a round hole.

 +1 and specifically around tooz, it is narrow in comparison to the
 feature sets of some the DLMs (since it has to mostly-implement to the
 lowest common denominator, as abstraction layers do). Defining the space we
 are trying to target will let us make the informed decision on what we use.

Again with this?


Yes, I was reiterating that we should not talk about a specific choice but
continue with the other discussion. Tooz, ZooKeeper, Consul, etc, is all
irrelevant to the rest of the conversation we are having. The specific
technology used can be discussed in an x-project spec, but I really would
rather see a very opinionated choice. That can again be delayed until a
later point.


We already what we want to get out of Tooz, where we want it and for how
long we'll be using it in each of those places.


My response was also before the rest of the convo that occurred post
Flavio's summary.


To answer those questions all that's needed is to read this thread and
the links referred on some conversations.


I am fine with using a DLM. I see a significant benefit (without putting too
fine a point on it, Keystone *will* benefit from a choice for a DLM to be
available in OpenStack, and I like the idea). I was hoping to continue (and
we did) identify where we had DLM-like/DLM uses in OpenStack so we knew
where to focus.


Hey all,

This thread is a mess.

I'm going to put together facts with what projects are doing and why.
I will present my findings at the session that I will be moderating in
the cross project track of the summit [1], if accepted. Spec may
follow.

[1] - https://etherpad.openstack.org/p/mitaka-cross-project-session-planning


FWIW, there are 2 threads now. This one that you just replied to is
supposed to be related to Cinder and not to the cross-project
discussion. It's a mess, I agree! :(

That said, you may want to sync with Joshua since he's going to work
on a cross-project spec as well (as he mentioned in the other
thread).[0]

Thanks for taking the time,
Flavio

[0] http://lists.openstack.org/pipermail/openstack-dev/2015-August/071400.html



--
Mike Perez

__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev


--
@flaper87
Flavio Percoco


pgpjhEZ_lKeP9.pgp
Description: PGP signature
__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

Re: [openstack-dev] [Cinder] A possible solution for HA Active-Active

2015-08-05 Thread Philipp Marek

Well, is it already decided that Pacemaker would be chosen to provide HA in
Openstack? There's been a talk Pacemaker: the PID 1 of Openstack IIRC.

I know that Pacemaker's been pushed aside in an earlier ML post, but IMO
there's already *so much* been done for HA in Pacemaker that Openstack
should just use it.

All HA nodes needs to participate in a Pacemaker cluster - and if one node
looses connection, all services will get stopped automatically (by
Pacemaker) - or the node gets fenced.

No need to invent some sloppy scripts to do exactly the tasks (badly!) that
the Linux HA Stack has been providing for quite a few years.
So just a piece of information, but yahoo (the company I work for, with vms
in the tens of thousands, baremetal in the much more than that...) hasn't
used pacemaker, and in all honesty this is the first project (openstack)
that I have heard that needs such a solution. I feel that we really should
be building our services better so that they can be A-A vs having to depend
on another piece of software to get around our 'sloppiness' (for lack of a
better word).

Nothing against pacemaker personally... IMHO it just doesn't feel like we
are doing this right if we need such a product in the first place.
Well, Pacemaker is *the* Linux HA Stack.

So, before trying to achieve similar goals by self-written scripts (and
having to re-discover all the gotchas involved), it would be much better to
learn from previous experiences - even if they are not one's own.

Pacemaker has eg. the concept of clones[1] - these define services that run
multiple instances within a cluster. And behold! the instances get some
Pacemaker-internal unique id[2], which can be used to do sharding.

Yes, that still means that upon service or node crash the failed instance
has to be started on some other node; but as that'll typically be up and
running already, the startup time should be in the range of seconds.

We'd instantly get
* a supervisor to start/stop/restart/fence/monitor the service(s)
* node/service failure detection
* only small changes needed in the services
* and all that in a tested software that's available in all distributions,
and that already has its own testsuite...

If we decide that this solution won't fulfill all our expectations, fine -
let's use something else.

But I don't think it makes *any* sense to try to redo some (existing)
High-Availability code in some quickly written scripts, just because it
looks easy - there are quite a few traps for the unwary.

Ad 1:
http://clusterlabs.org/doc/en-US/Pacemaker/1.1/html/Pacemaker_Explained/s-resource-clone.html
Ad 2: OCF_RESKEY_CRM_meta_clone; that's not guaranteed to be an unbroken
sequence, though.

__
OpenStack Development Mailing List (not for usage questions)
Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe
http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev

70 matches

Mail list logo