Re: [openstack-dev] [Openstack-operators] Destructive / HA / fail-over scenarios
Hi Timur, You can also consider Stepler from Mirantis https://github.com/Mirantis/stepler. I never tried it, but the documentation states: "Stepler framework is intended to provide the community with a testing framework that is capable of perform advanced scenario and destructive test cases, like batch instances launching, instances migration, services restarts and different HA-specific cases." Cheers, Jorge -Original Message- From: Adam Spiers [mailto:aspi...@suse.com] Sent: Monday, November 28, 2016 3:09 PM To: openstack-dev@lists.openstack.org Subject: Re: [openstack-dev] [Openstack-operators] Destructive / HA / fail-over scenarios Timur Nurlygayanov <tnurlygaya...@mirantis.com> wrote: > Hi OpenStack developers and operators, > > we are going to create the test suite for destructive testing of > OpenStack clouds. We want to hear your feedback and ideas about > possible destructive and failover scenarios which we need to check. > > Which scenarios we need to check if we want to make sure that some > OpenStack cluster is configured in High Availability mode and can be > published as a "production/enterprise" cluster. > > Your ideas are welcome, let's discuss the ideas of test scenarios in > this email thread. I applaud the effort to boost automated testing of failure scenarios! And thanks a lot for polling the list before starting any work on this. Regarding the implementation, did you consider reusing Cloud 99, and if not, please could you? :-) Obviously it would be good to avoid reinventing wheels where possible. https://www.openstack.org/summit/vancouver-2015/summit-videos/presentation/high-availability-and-resiliency-testing-strategies-for-openstack-clouds https://github.com/cisco-oss-eng/Cloud99 If there are some gaps between Cloud99 and what is needed then it would be worth evaluating them in order to determine whether it makes sense to start from scratch versus simply develop Cloud99 further. Also it would be great if you could join the #openstack-ha IRC channel where you will find friendly folks from the broader OpenStack HA sub-community who I'm sure will be happy to discuss this further. You are also very welcome to join our weekly IRC meetings: https://wiki.openstack.org/wiki/Meetings/HATeamMeeting Thanks! Adam __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
Re: [openstack-dev] [Openstack-operators] Destructive / HA / fail-over scenarios
Adam, So Rally actually field the gap specified in that video. Now there are so called "Hooks" that allows you to trigger other tools/code during the load (after some amount of time or on specific iteration) Basically reliability testing & load testing during upgrades can be implemented as a single Rally task which is quite convenient. Best regards, Boris Pavlovic On Wed, Nov 30, 2016 at 6:02 AM, Dulko, Michalwrote: > On Mon, 2016-11-28 at 15:51 +0300, Timur Nurlygayanov wrote: > > Hi OpenStack developers and operators, > > > > we are going to create the test suite for destructive testing of > > OpenStack clouds. We want to hear your feedback and ideas > > about possible destructive and failover scenarios which we need > > to check. > > In Cinder we're pursuing A/A for our cinder-volume service. It would be > useful to run some destructive tests on patch chain [1] to make sure no > volume operations are failing while clustered cinder-volume service > gets killed. In the future we should have a CI testing that in periodic > zuul queue. > > [1] https://review.openstack.org/#/c/355968 > > > > > Which scenarios we need to check if we want to make sure that > > some OpenStack cluster is configured in High Availability mode > > and can be published as a "production/enterprise" cluster. > > > > Your ideas are welcome, let's discuss the ideas of test scenarios in > > this email thread. > > > > The spec for High Availability testing is on review: [1] > > The user story for destructive testing of OpenStack clouds is > > on review: [2]. > > > > Thank you! > > > > [1] https://review.openstack.org/#/c/399618/ > > [2] https://review.openstack.org/#/c/396142 > > > > -- > > > > Timur, > > QA Manager > > OpenStack Projects > > Mirantis Inc > __ > OpenStack Development Mailing List (not for usage questions) > Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe > http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev > __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
Re: [openstack-dev] [Openstack-operators] Destructive / HA / fail-over scenarios
On Mon, 2016-11-28 at 15:51 +0300, Timur Nurlygayanov wrote: > Hi OpenStack developers and operators, > > we are going to create the test suite for destructive testing of > OpenStack clouds. We want to hear your feedback and ideas > about possible destructive and failover scenarios which we need > to check. In Cinder we're pursuing A/A for our cinder-volume service. It would be useful to run some destructive tests on patch chain [1] to make sure no volume operations are failing while clustered cinder-volume service gets killed. In the future we should have a CI testing that in periodic zuul queue. [1] https://review.openstack.org/#/c/355968 > > Which scenarios we need to check if we want to make sure that > some OpenStack cluster is configured in High Availability mode > and can be published as a "production/enterprise" cluster. > > Your ideas are welcome, let's discuss the ideas of test scenarios in > this email thread. > > The spec for High Availability testing is on review: [1] > The user story for destructive testing of OpenStack clouds is > on review: [2]. > > Thank you! > > [1] https://review.openstack.org/#/c/399618/ > [2] https://review.openstack.org/#/c/396142 > > -- > > Timur, > QA Manager > OpenStack Projects > Mirantis Inc __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
Re: [openstack-dev] [Openstack-operators] Destructive / HA / fail-over scenarios
Timur Nurlygayanovwrote: > Hi OpenStack developers and operators, > > we are going to create the test suite for destructive testing of > OpenStack clouds. We want to hear your feedback and ideas > about possible destructive and failover scenarios which we need > to check. > > Which scenarios we need to check if we want to make sure that > some OpenStack cluster is configured in High Availability mode > and can be published as a "production/enterprise" cluster. > > Your ideas are welcome, let's discuss the ideas of test scenarios in > this email thread. I applaud the effort to boost automated testing of failure scenarios! And thanks a lot for polling the list before starting any work on this. Regarding the implementation, did you consider reusing Cloud 99, and if not, please could you? :-) Obviously it would be good to avoid reinventing wheels where possible. https://www.openstack.org/summit/vancouver-2015/summit-videos/presentation/high-availability-and-resiliency-testing-strategies-for-openstack-clouds https://github.com/cisco-oss-eng/Cloud99 If there are some gaps between Cloud99 and what is needed then it would be worth evaluating them in order to determine whether it makes sense to start from scratch versus simply develop Cloud99 further. Also it would be great if you could join the #openstack-ha IRC channel where you will find friendly folks from the broader OpenStack HA sub-community who I'm sure will be happy to discuss this further. You are also very welcome to join our weekly IRC meetings: https://wiki.openstack.org/wiki/Meetings/HATeamMeeting Thanks! Adam __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev
[openstack-dev] [Openstack-operators] Destructive / HA / fail-over scenarios
Hi OpenStack developers and operators, we are going to create the test suite for destructive testing of OpenStack clouds. We want to hear your feedback and ideas about possible destructive and failover scenarios which we need to check. Which scenarios we need to check if we want to make sure that some OpenStack cluster is configured in High Availability mode and can be published as a "production/enterprise" cluster. Your ideas are welcome, let's discuss the ideas of test scenarios in this email thread. The spec for High Availability testing is on review: [1] The user story for destructive testing of OpenStack clouds is on review: [2]. Thank you! [1] https://review.openstack.org/#/c/399618/ [2] https://review.openstack.org/#/c/396142 -- Timur, QA Manager OpenStack Projects Mirantis Inc __ OpenStack Development Mailing List (not for usage questions) Unsubscribe: openstack-dev-requ...@lists.openstack.org?subject:unsubscribe http://lists.openstack.org/cgi-bin/mailman/listinfo/openstack-dev