Re: [Gluster-users] To RAID or not to RAID...

2020-01-14 Thread Hu Bert
Hi,

our old setup is not really comparable, but i thought i'd drop some
lines... we once had a Distributed-Replicate setup with 4 x 3 = 12
disks (10 TB hdd). Simple JBOD, every disk == brick. Was running
pretty good, until one of the disks died. The restore (reset-brick)
took about a month, because the application has a quite high I/O and
therefore slows down the volume and the disks.

Next step: take servers with 10x10TB disks and build a RAID 10; raid
array == brick, replicate volume (1 x 3 = 3). When a disk fails, you
only have to rebuild the SW RAID which takes about 3-4 days, plus the
periodic redundany checks. This was way better than the
JBOD/reset-scenario before. But still not optimal. Upcoming step:
build a distribute-replicate with lots of SSDs (maybe again with a
RAID underneath) .

tl;dr what i wanted to say: we waste a lot of disks. It simply depends
on which setup you have and how to handle the situation when one of
the disks fails - and it will! ;-(


regards
Hubert

Am Di., 14. Jan. 2020 um 12:36 Uhr schrieb Markus Kern :
>
>
> Greetings again!
>
> After reading RedHat documentation regarding optimizing Gluster storage
> another question comes to my mind:
>
> Let's presume that I want to go the distributed dispersed volume way.
> Three nodes which two bricks each.
> According to RedHat's recommendation, I should use RAID6 as underlying
> RAID for my planned workload.
> I am frightened by that "waste" of disks in such a case:
> When each brick is a RAID6, I would "loose" two disks per brick - 12
> lossed disks in total.
> In addition to this, distributed dispersed volume adds another layer of
> lossed disk space.
>
> Am I wrong here? Maybe I didn't understand the recommendations wrong?
>
> Markus
> 
>
> Community Meeting Calendar:
>
> APAC Schedule -
> Every 2nd and 4th Tuesday at 11:30 AM IST
> Bridge: https://bluejeans.com/441850968
>
> NA/EMEA Schedule -
> Every 1st and 3rd Tuesday at 01:00 PM EDT
> Bridge: https://bluejeans.com/441850968
>
> Gluster-users mailing list
> Gluster-users@gluster.org
> https://lists.gluster.org/mailman/listinfo/gluster-users


Community Meeting Calendar:

APAC Schedule -
Every 2nd and 4th Tuesday at 11:30 AM IST
Bridge: https://bluejeans.com/441850968

NA/EMEA Schedule -
Every 1st and 3rd Tuesday at 01:00 PM EDT
Bridge: https://bluejeans.com/441850968

Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users


Re: [Gluster-users] To RAID or not to RAID...

2020-01-14 Thread Strahil
Hi Markus,


You are right  .I think that the 3 node setup matches distributed volume.

According to 
https://docs.gluster.org/en/latest/Administrator%20Guide/Setting%20Up%20Volumes/

Dispersed  volumes  use erasure codes to have a 'parity' on a separate brick. 
In such case you can afford to  loose a brick without loosing data and you will 
need more bricks.

Yet, I don't see anything about RAID6 being required.

Use the gluster's official documentation (if possible) as it is the most recent 
info.

Maybe you can share the ammount of disks , raid controllers and servers you 
have and your tolerance to data loss. Then I can share my thoughts on the 
possible volume types.

Best Regards,
Strahil Nikolov

On Jan 14, 2020 20:33, Markus Kern  wrote:
>
> Hi Strahil, 
>
> thanks for you answer - but now I am completely lost :) 
>
> From this documentation: 
> https://docs.oracle.com/cd/E52668_01/F10040/html/gluster-312-volume-distr-disp.html
>  
>
> "As a dispersed volume must have a minimum of three bricks, a 
> distributed dispersed volume must have at least six bricks. For example, 
> six nodes with one brick, or three nodes with two bricks on each node 
> are needed for this volume type." 
>
> So for a distributed dispersed volume I need at least six bricks. If 
> each brick is a RAID6, I have 6 x 2 Parity disks = 12 disks for parity. 
>
> In your example you only have one brick per node in a three node setup. 
> This is no distributed dispersed volume then, right? 
>
> A confused Markus 
>
>
> Am 14.01.2020 16:29, schrieb Strahil: 
> > Hi Markus, 
> > 
> > Distributed dispersed volume is just LVM's linear LV -> so in case of 
> > brick failiure - you loose the data on it. 
> > 
> > Raid 6 requires  2 disks  for parity, so you can make a large RAID6 
> > and use that as a single brick - so the disks that hold  the parity 
> > data are  only 6 ( 3 nodes x 2 disks). 
> > 
> > Of course  if you have too many disks  for a single raid controller 
> > ,that you can consider a  replica volume  with an arbiter. 
> > 
> > 
> > Best Regards, 
> > Strahil Nikolov 
> > 
> > On Jan 14, 2020 13:36, Markus Kern  wrote: 
> >> 
> >> 
> >> Greetings again! 
> >> 
> >> After reading RedHat documentation regarding optimizing Gluster 
> >> storage 
> >> another question comes to my mind: 
> >> 
> >> Let's presume that I want to go the distributed dispersed volume way. 
> >> Three nodes which two bricks each. 
> >> According to RedHat's recommendation, I should use RAID6 as underlying 
> >> RAID for my planned workload. 
> >> I am frightened by that "waste" of disks in such a case: 
> >> When each brick is a RAID6, I would "loose" two disks per brick - 12 
> >> lossed disks in total. 
> >> In addition to this, distributed dispersed volume adds another layer 
> >> of 
> >> lossed disk space. 
> >> 
> >> Am I wrong here? Maybe I didn't understand the recommendations wrong? 
> >> 
> >> Markus 
> >>  
> >> 
> >> Community Meeting Calendar: 
> >> 
> >> APAC Schedule - 
> >> Every 2nd and 4th Tuesday at 11:30 AM IST 
> >> Bridge: https://bluejeans.com/441850968 
> >> 
> >> NA/EMEA Schedule - 
> >> Every 1st and 3rd Tuesday at 01:00 PM EDT 
> >> Bridge: https://bluejeans.com/441850968 
> >> 
> >> Gluster-users mailing list 
> >> Gluster-users@gluster.org 
> >> https://lists.gluster.org/mailman/listinfo/gluster-users 


Community Meeting Calendar:

APAC Schedule -
Every 2nd and 4th Tuesday at 11:30 AM IST
Bridge: https://bluejeans.com/441850968

NA/EMEA Schedule -
Every 1st and 3rd Tuesday at 01:00 PM EDT
Bridge: https://bluejeans.com/441850968

Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users


Re: [Gluster-users] To RAID or not to RAID...

2020-01-14 Thread Markus Kern

Hi Strahil,

thanks for you answer - but now I am completely lost :)

From this documentation:
https://docs.oracle.com/cd/E52668_01/F10040/html/gluster-312-volume-distr-disp.html

"As a dispersed volume must have a minimum of three bricks, a 
distributed dispersed volume must have at least six bricks. For example, 
six nodes with one brick, or three nodes with two bricks on each node 
are needed for this volume type."


So for a distributed dispersed volume I need at least six bricks. If 
each brick is a RAID6, I have 6 x 2 Parity disks = 12 disks for parity.


In your example you only have one brick per node in a three node setup. 
This is no distributed dispersed volume then, right?


A confused Markus


Am 14.01.2020 16:29, schrieb Strahil:

Hi Markus,

Distributed dispersed volume is just LVM's linear LV -> so in case of
brick failiure - you loose the data on it.

Raid 6 requires  2 disks  for parity, so you can make a large RAID6
and use that as a single brick - so the disks that hold  the parity
data are  only 6 ( 3 nodes x 2 disks).

Of course  if you have too many disks  for a single raid controller
,that you can consider a  replica volume  with an arbiter.


Best Regards,
Strahil Nikolov

On Jan 14, 2020 13:36, Markus Kern  wrote:



Greetings again!

After reading RedHat documentation regarding optimizing Gluster 
storage

another question comes to my mind:

Let's presume that I want to go the distributed dispersed volume way.
Three nodes which two bricks each.
According to RedHat's recommendation, I should use RAID6 as underlying
RAID for my planned workload.
I am frightened by that "waste" of disks in such a case:
When each brick is a RAID6, I would "loose" two disks per brick - 12
lossed disks in total.
In addition to this, distributed dispersed volume adds another layer 
of

lossed disk space.

Am I wrong here? Maybe I didn't understand the recommendations wrong?

Markus


Community Meeting Calendar:

APAC Schedule -
Every 2nd and 4th Tuesday at 11:30 AM IST
Bridge: https://bluejeans.com/441850968

NA/EMEA Schedule -
Every 1st and 3rd Tuesday at 01:00 PM EDT
Bridge: https://bluejeans.com/441850968

Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users



Community Meeting Calendar:

APAC Schedule -
Every 2nd and 4th Tuesday at 11:30 AM IST
Bridge: https://bluejeans.com/441850968

NA/EMEA Schedule -
Every 1st and 3rd Tuesday at 01:00 PM EDT
Bridge: https://bluejeans.com/441850968

Gluster-users mailing list
Gluster-users@gluster.org
https://lists.gluster.org/mailman/listinfo/gluster-users