Re: [DRBD-user] DRBD with CentOS in Production?

2013-08-19 Thread Florian Haas
On Mon, Aug 19, 2013 at 10:11 PM, Nick Khamis wrote: > Bottom line is there are people who are sincere in there endeavours, ones > that actually spend countless hours pumping out messages helping each other. > You know exactly who you are. > And then there are opportunists, that see an angle and t

Re: [DRBD-user] DRBD with CentOS in Production?

2013-08-19 Thread Florian Haas
On Mon, Aug 19, 2013 at 9:57 PM, Nick Khamis wrote: >>> Nick was never a customer of ours -- unless he used a pseudonym :) ] > > I think we have enough brain power here at UoT And at the beginning > there > was a small learning curve. Terminology etc... > > I have no problem cc'ing you in any

Re: [DRBD-user] DRBD with CentOS in Production?

2013-08-19 Thread Florian Haas
Sorry for prolonging the list pollution here. But since this one was targeted personally at me, I hope people are OK with me responding. Also, sorry for the late reply, I took a few days off to spend with my family. On 08/14/2013 09:02 PM, Nick Khamis wrote: > Whatever you do, don't by Florian, h

Re: [DRBD-user] Highly available iSCSI storage with DRBD and Pacemaker - how to set VPD page 83 (Device Identification) using pacemaker

2013-05-03 Thread Florian Haas
On Fri, May 3, 2013 at 8:38 AM, Vladislav Bogdanov wrote: > 02.05.2013 13:22, Maurits van de Lande wrote: >> Hello, >> >> I have setup “Highly available iSCSI storage with DRBD and Pacemaker” >> using Centos 6 and tgt. http://wiki.skytech.dk/images/4/44/Ha-iscsi.pdf >> I would like to use the iscs

Re: [DRBD-user] Highly available iSCSI storage with DRBD and Pacemaker - how to set VPD page 83 (Device Identification) using pacemaker

2013-05-02 Thread Florian Haas
Hi Maurits, On Thu, May 2, 2013 at 12:22 PM, Maurits van de Lande wrote: > > Hello, > > > > I have setup “Highly available iSCSI storage with DRBD and Pacemaker” using > Centos 6 and tgt. http://wiki.skytech.dk/images/4/44/Ha-iscsi.pdf Try looking at page 12, section 5.2.1. :) > I would like t

Re: [DRBD-user] Fast write performance on backing device, slow write Performance on DRBD

2012-12-20 Thread Florian Haas
On Tue, Dec 18, 2012 at 10:58 AM, Tom Fernandes wrote: > --- DRBD - > tom@hydra04 [1526]:~$ sudo drbdadm dump > # /etc/drbd.conf > common { > protocol C; > syncer { > rate 150M; > } >

Re: [DRBD-user] DRBD over LVM - Cannot open backing device

2012-12-12 Thread Florian Haas
On Tue, Dec 11, 2012 at 9:44 PM, Paul Shannon - NOAA Federal wrote: > I'm trying to setup DRBD for the first time on a 2-node cluster using DRBD > over a lvm lvgroup. I believe I have set it up just as outlined in sec10.2 > of the DRBD User's Guide. My r0.res is setup with: > resource 0 { Is th

Re: [DRBD-user] DRBD Replication Modes

2012-12-09 Thread Florian Haas
On Fri, Dec 7, 2012 at 7:09 PM, Nathan Joyes wrote: > We are investigating the possibility of using DRBD to perform synchronous > replication of our PostgreSQL database server. After reading the > Replication modes section of the DRBD features documentation we concluded > that we would need to use

Re: [DRBD-user] Failover Behavior in Server-Crash Scenario

2012-12-06 Thread Florian Haas
On 12/07/2012 01:14 AM, Robinson, Eric wrote: >> All looks reasonable. Of course, given the fact that you're >> missing crm-fence-peer.sh, if I were you I'd double check the >> existence (and >> executability) of all other handler scripts as well. >> > > The path is wrong. None of the handler sc

Re: [DRBD-user] Failover Behavior in Server-Crash Scenario

2012-12-06 Thread Florian Haas
On 12/07/2012 12:53 AM, Robinson, Eric wrote: Any concurrent log entries in your kernel log, from the >> drbd0 device? >>> >>> >>> In fact, there are... >>> >>> Dec 6 13:51:17 ha09a kernel: d-con ha02_mysql: conn( >> Unconnected -> >>> WFConnection ) Dec 6 13:51:19 ha09a root: drbd S

Re: [DRBD-user] Failover Behavior in Server-Crash Scenario

2012-12-06 Thread Florian Haas
On 12/07/2012 12:04 AM, Robinson, Eric wrote: >> Have you checked your Pacemaker and DRBD logs to >> determine whether (a) Pacemaker is not even _attempting_ a >> promotion, or (b) Pacemaker does attempt it but it fails at >> the DRBD level? >> > > Pacemaker successfully demotes the resources o

Re: [DRBD-user] Failover Behavior in Server-Crash Scenario

2012-12-06 Thread Florian Haas
On 12/07/2012 12:26 AM, Robinson, Eric wrote: >> Any concurrent log entries in your kernel log, from the drbd0 device? >> > > > In fact, there are... > > Dec 6 13:51:17 ha09a kernel: d-con ha02_mysql: conn( Unconnected -> > WFConnection ) > Dec 6 13:51:19 ha09a root: drbd SA notify > Dec 6 1

Re: [DRBD-user] Failover Behavior in Server-Crash Scenario

2012-12-06 Thread Florian Haas
On Thu, Dec 6, 2012 at 5:38 PM, Robinson, Eric wrote: > With Pacemaker 1.1.8 and drbd 8.4.2, we are observing that when the primary > node is put into standby mode ('crm node standby') the drbd resource on the > secondary node refuses to be promoted because it is in a WFConnection state. > Is this

Re: [DRBD-user] Dual Primary dead with 8.4.2

2012-12-04 Thread Florian Haas
On Fri, Nov 16, 2012 at 5:01 PM, Henning Ryll wrote: > Hello, > > I have tried to make a dual primary setup, but can't get it work. > I'm using two Celisus 550 Workstations with two sata drives on each machine > building a softraid 0. > There are dedicated 1GBit network cards for use with drbd dir

Re: [DRBD-user] DRBD 8.4.2 Doesn't Compile Against 3.7 Kernel

2012-12-03 Thread Florian Haas
On Fri, Nov 30, 2012 at 6:15 PM, Shaun Thomas wrote: > Hey guys, > > We were asked to do some Ubuntu testing against an upstream 3.7 kernel, and > it turns out the 8.4.2 included with a vanilla 3.7 kernel does not come > anywhere near the code for 8.4.2 available for download. APIs have changed >

Re: [DRBD-user] Sockets Direct Protocol (SDP)

2012-10-25 Thread Florian Haas
On Wed, Oct 24, 2012 at 2:28 PM, Matthew Goulding wrote: > Hi All!, > > I have been looking into to using Infiniband with SDP. > > Though from what i read SDP is now deprecated and the "ib_sdp" module is no > longer part of OFED or the kernel. > > Ref: > http://comments.gmane.org/gmane.network.ope

Re: [DRBD-user] Secondary node io-error

2012-10-10 Thread Florian Haas
On Wed, Oct 10, 2012 at 5:42 AM, Velayutham, Prakash wrote: > Just wanted to add this. I repeated my test again and get the exact same > results again. Here is /proc/drbd of the primary (bmimysqlt3) and secondary > (bmimysqlt4) before the secondary's disk is cut off (disabling the fiber > switc

Re: [DRBD-user] Version issue? - DRBD Primary on 8.3.2 - Secondary on 8.3.13

2012-10-08 Thread Florian Haas
On Mon, Oct 8, 2012 at 10:24 AM, Lars Ellenberg wrote: >> Lars, did any _default_ settings change in 8.3.3, when proto 91 was >> introduced? > > You are on the completely wrong track there. That's encouraging. Cheers, Florian -- Need help with High Availability? http://www.hastexo.com/now ___

Re: [DRBD-user] Version issue? - DRBD Primary on 8.3.2 - Secondary on 8.3.13

2012-10-07 Thread Florian Haas
On Sun, Oct 7, 2012 at 6:56 PM, ShockwaveCS wrote: > > Can I downgrade the secondary host back to 8.3.2 without any negative side > effects? -Or- Can I upgrade the 8.3.2 to 8.3.13 while the node is online, > serving data, and still Primary? Doesn't seem likely, but hey why not ask. I > want to do

Re: [DRBD-user] Does oversize disk hurt anything?

2012-10-07 Thread Florian Haas
On Sun, Oct 7, 2012 at 2:20 PM, Dan Barker wrote: >>Well if you had created a partition (/dev/sdc1) rather than use the full disk >>(/dev/sdc), then you could have set up that partition to match the size of >>the disk on your primary. > > Partition. Great idea. If I had thought of that, I'd have

Re: [DRBD-user] Does oversize disk hurt anything?

2012-10-07 Thread Florian Haas
On Fri, Oct 5, 2012 at 7:59 PM, Dan Barker wrote: > I just lost a disk on my secondary node. I looked EVERYWHERE and can't find > the spare disks I bought for such an occurrence. So, I put in a handy disk, > twice the size. > > drbdadm create-md r1 > drbdadm attach r1 > > and off we go. > > If mem

Re: [DRBD-user] two node (master/slave) failover not working

2012-10-01 Thread Florian Haas
On 10/01/2012 10:36 PM, Lonni J Friedman wrote: > order FS0_drbd-after-FS0 inf: FS0_Clone:promote g_services This your problem. That line is functionally equivalent to order FS0_drbd-after-FS0 inf: FS0_Clone:promote g_services:promote which means "you can start g_services at any time, but you ne

Re: [DRBD-user] DRBD with GlusterFS

2012-09-28 Thread Florian Haas
On 09/28/2012 04:59 AM, Sayuri Komatsu wrote: > Hi all. > > In the past I try to use DRBD in a load balancer enviroment. And for > that, I need read/write for several server, but DRBD only support two nodes. > > Can I use DRBD with a Distribute File System? > > I think to use the DRBD block for

Re: [DRBD-user] drbd-8.4.2rc1 test results

2012-08-20 Thread Florian Haas
On Mon, Aug 20, 2012 at 3:39 PM, Dirk Bonenkamp - ProActive wrote: > Op 17-8-2012 17:56, Brian R. Hellman schreef: >> On 08/17/2012 04:24 AM, Dirk Bonenkamp - ProActive wrote: >>> Hi All, >>> >>> I guess a blog post and maybe some extra detail in the manual would help >>> on this matter. >>> Cheer

Re: [DRBD-user] DRBD and iSCSI (which? ^o^) versus scalability

2012-07-27 Thread Florian Haas
Hello, On Fri, Jul 27, 2012 at 4:32 AM, Christian Balzer wrote: > > Hello, > > I'm pondering a HA iSCSI (really iSER or SRP, Infiniband backend) storage > cluster based on DRBD and Pacemaker. So something that has been documented > and implemented numerous times. > > However setting up things on

Re: [DRBD-user] crm-fence-peer.sh & maintenance / reboots

2012-07-25 Thread Florian Haas
On Wed, Jul 25, 2012 at 12:30 PM, Dirk Bonenkamp - ProActive wrote: > Op 24-7-2012 17:00, Florian Haas schreef: >> On Tue, Jul 24, 2012 at 2:17 PM, Dirk Bonenkamp - ProActive >> wrote: >>> Hi all, >>> >>> I'm having trouble with the crm-fen

Re: [DRBD-user] crm-fence-peer.sh & maintenance / reboots

2012-07-24 Thread Florian Haas
On Tue, Jul 24, 2012 at 2:17 PM, Dirk Bonenkamp - ProActive wrote: > Hi all, > > I'm having trouble with the crm-fence-peer.sh script and maintenance / > reboots. > > I have a simple Master / Slave set-up with 1 resource. Without the > crm-fence-peer.sh and crm-unfence-peer.sh scripts activated, t

Re: [DRBD-user] Resource-manger with Heartbeat 2.0.7 and DRBD 8.0.6

2012-07-24 Thread Florian Haas
On Tue, Jul 24, 2012 at 12:50 PM, Richard Avilez wrote: > Hi > > I am a newcomer to HA/DRBD and had to setup a simplified 2 node VM cluster > test envionment with OpenSuse 10.3/2.6.22 (i586) > over the last few weeks to at least try to reproduce couple of serious > problems which happened in a pro

Re: [DRBD-user] Problem to use DRBD with LVM controlled by Pacemaker

2012-07-03 Thread Florian Haas
On Tue, Jul 3, 2012 at 4:17 PM, Lars Marowsky-Bree wrote: > On 2012-06-29T11:23:09, Phil Frost wrote: > >> >Jun 29 12:34:53 ss2 LVM[2386]: INFO: 0 logical volume(s) in volume >> >group "storage" now active >> Looks like your problem is right there. The LVM RA considers >> starting the VG as faile

Re: [DRBD-user] Concurrent local write detected

2012-06-26 Thread Florian Haas
On Tue, Jun 26, 2012 at 10:51 AM, Thilo Uttendorfer wrote: >> Libvirt configuration or qemu/kvm command line please? (don't post >> them here; pastebin and share the URL instead). > > thanks for your help: > http://pastebin.com/mZxaJr5a OK, can we see the DRBD config too, please? Cheers, Florian

Re: [DRBD-user] Concurrent local write detected

2012-06-25 Thread Florian Haas
On Fri, Jun 22, 2012 at 6:53 PM, Thilo Uttendorfer wrote: > Hi, > > we have a KVM cluster with each virtual machine running on top of a DRBD > device. After running this setup for many month we recently saw these log > messages on two (of total 15) DRBD devices: > > Jun 22 17:53:01 server-v1 kerne

Re: [DRBD-user] upgrade drbd 8.3.7 -> 8.3.11

2012-06-12 Thread Florian Haas
On 06/12/12 07:35, Marcel Kraan wrote: > Is this not a split brain? No. Hence, the rest of your suggestions are moot, sadly. Cheers, Florian -- Need help with High Availability? http://www.hastexo.com/now ___ drbd-user mailing list drbd-user@lists.li

Re: [DRBD-user] Performance regression with DRBD 8.3.12 and newer

2012-06-11 Thread Florian Haas
On 06/11/12 23:00, Matthias Hensler wrote: > On Mon, Jun 11, 2012 at 10:31:16PM +0200, Florian Haas wrote: >> On 06/11/12 22:14, Matthias Hensler wrote: >>> Indeed, the problem lies within the kernel version used to build the >>> drbd.ko module. I double checked by us

Re: [DRBD-user] Performance regression with DRBD 8.3.12 and newer

2012-06-11 Thread Florian Haas
On 06/11/12 22:14, Matthias Hensler wrote: > On Mon, Jun 11, 2012 at 06:35:18PM +0200, Matthias Hensler wrote: >> [...] >> I checked the changelog for 8.3.12, but nothing obviously struck me. >> Also diffing the sourcetrees 8.3.11->8.3.12 I did not find any >> obvious. > > Let me follow up on this

Re: [DRBD-user] drbd 8.3.12 not writing meta data

2012-06-08 Thread Florian Haas
On Fri, Jun 8, 2012 at 4:50 PM, Dirk wrote: > Sorry for the nonsense. > > I have pasted drbd.conf, global_common.conf and resource.res into > > http://pastebin.com/PtZYbWnA OK. Nothing unusual in that config (except the usual warnings that apply for dual-Primary configurations). Follow Lars' adv

Re: [DRBD-user] drbd 8.3.12 not writing meta data

2012-06-06 Thread Florian Haas
On Wed, Jun 6, 2012 at 9:28 PM, Dirk wrote: > Hi folks, > > on a freshly installed server pair I am activating 3 drbd volumes each. > The first two go without problem, the third returns >> >> [root@server-01 ~]# drbdadm create-md RESOURCE >> Writing meta data... >> initializing activity log >> NOT

Re: [DRBD-user] Three-way replication setup

2012-06-05 Thread Florian Haas
On Tue, Jun 5, 2012 at 8:34 PM, Arnold Krille wrote: > Hi, > > > Luca Fornasari schrieb: >>I have already setup a two  node HA Proxmox cluster configuring as per >> >>http://pve.proxmox.com/wiki/DRBD >>So I have a primary/primary DRB

Re: [DRBD-user] Performance hit DRBD vs raw block device, even when disconnected

2012-06-04 Thread Florian Haas
On Mon, Jun 4, 2012 at 2:17 PM, Wiebe Cazemier wrote: > - Original Message - >> From: "Lars Ellenberg" >> To: "Wiebe Cazemier" >> Cc: drbd-user@lists.linbit.com >> Sent: Monday, 4 June, 2012 11:29:20 AM >> Subject: Re: [DRBD-user] Performance hit DRBD vs raw block device, even when >> d

Re: [DRBD-user] Question on how DRBD connects to peer on a floating disk configuration

2012-06-01 Thread Florian Haas
On Thu, May 31, 2012 at 3:24 AM, David Coulson wrote: > No, it explicitly uses the IPs and ports defined in your configuration for > the resource. > > What exactly do you mean by 'floating IP setup'? This: http://www.drbd.org/users-guide-8.3/s-floating-peers.html Which links to: http://www.drbd.

Re: [DRBD-user] Split Brain due to 'diskless' state with pacemaker/heartbeat

2012-06-01 Thread Florian Haas
On 06/01/12 18:22, Lars Ellenberg wrote: > There is one improvement we could make in DRBD: > call the fence-peer handler not only for connection loss, > but also for peer disk failure. That sounds like a good and simple idea to me. >>> Alternitively, a constraint in pacemaker on diskless state un

Re: [DRBD-user] Performance hit DRBD vs raw block device, even when disconnected

2012-06-01 Thread Florian Haas
On Fri, Jun 1, 2012 at 6:14 PM, Wiebe Cazemier wrote: > Hi, > > I'm setting up a DRBD system and I don't really understand the performance > hits I'm getting. > > I have two hosts (with dedicated GB LAN): > > Host1: 3Ware 9650 SE SATA2 hardware RAID6, 5x 2TB disks > Host2: 2x250GB Linux MD RAID1.

Re: [DRBD-user] Split Brain due to 'diskless' state with pacemaker/heartbeat

2012-06-01 Thread Florian Haas
On 06/01/12 11:16, Philip Gaw wrote: > 1. Secondary goes into diskless state due to a broken array (as > expected) while primary is being written to > 2. Primary then dies (power failure) > 3. Secondary gets rebooted, or dies and comes back online etc. > > The secondary will become primary - and a

Re: [DRBD-user] Problem between two nodes but only on 1 ressource

2012-05-29 Thread Florian Haas
> Somebody have a way for me?! http://www.drbd.org/users-guide-8.3/s-resolve-split-brain.html Cheers, Florian -- Need help with High Availability? http://www.hastexo.com/now ___ drbd-user mailing list drbd-user@lists.linbit.com http://lists.linbit.com

Re: [DRBD-user] Rescue after reduce :(

2012-05-25 Thread Florian Haas
On Fri, May 25, 2012 at 11:08 AM, Felix Frank wrote: > On 05/25/2012 11:03 AM, Florian Haas wrote: >>>> fsck of your /dev/drbdX device, I hope? >>>> >> >>> > >>> > Yes, on the /dev/drbd0. Otherwise fsck would have thrown errors, right

Re: [DRBD-user] Rescue after reduce :(

2012-05-25 Thread Florian Haas
On Fri, May 25, 2012 at 9:37 AM, Christian Völker wrote: > Hi >>> Steps I did: >>> - fsck -f (ext3) >> >> fsck of your /dev/drbdX device, I hope? >> > > Yes, on the /dev/drbd0. Otherwise fsck would have thrown errors, right? Only if the filesystem had errors. :) You can always pretty much do anyt

Re: [DRBD-user] Rescue after reduce :(

2012-05-25 Thread Florian Haas
On Fri, May 25, 2012 at 8:32 AM, Christian Völker wrote: > Hi, > > I've a drbd device (8.3) on both sides on aLVM volume. > > I tried to reduce the device now. Steps I did: > - fsck -f (ext3) fsck of your /dev/drbdX device, I hope? > - reduced filesystem to 1,400G > - drbdadm -- --new-size=1450G

Re: [DRBD-user] "PingAck not received" messages

2012-05-24 Thread Florian Haas
On Thu, May 24, 2012 at 3:09 PM, Matthew Bloch wrote: >>> We are preparing to jump to a 2.6.32 sourced from CentOS because this >>> Debian >>> kernel seems to crash with one bug or another every few months. >> >> That would seem like an odd thing to do. FWIW, we've been running >> happily on squee

Re: [DRBD-user] "PingAck not received" messages

2012-05-24 Thread Florian Haas
On Thu, May 24, 2012 at 1:53 PM, Matthew Bloch wrote: > Hmm, thanks. Unrelated to any of this, the v3a kernel (Debian 2.6.32-4) > crashed pretty badly 48hrs ago. Since it has been rebooted - there have > been no "PingAck not received" messages. Sure, if you have kernel-induced network problems

Re: [DRBD-user] DRBD initial settings for two disks

2012-05-24 Thread Florian Haas
On Thu, May 24, 2012 at 1:25 PM, Tero Mäntyvaara wrote: > I was reading document > https://help.ubuntu.com/10.04/serverguide/drbd.html. Would the following > configuration be correct with two disks? > > |/etc/drbd.conf|: > resource r0 { >  volume 0 { >    device    /dev/drbd1; >    disk      /dev/

Re: [DRBD-user] Recovering from erroneous sync state

2012-05-23 Thread Florian Haas
On Wed, May 23, 2012 at 10:34 PM, Zev Weiss wrote: > > On May 23, 2012, at 3:22 PM, Florian Haas wrote: > >> On Wed, May 23, 2012 at 10:14 PM, Zev Weiss wrote: >>> Hi, >>> >>> I'm running DRBD 8.3.12, and recently hit what looks to me like a bug tha

Re: [DRBD-user] Recovering from erroneous sync state

2012-05-23 Thread Florian Haas
On Wed, May 23, 2012 at 10:14 PM, Zev Weiss wrote: > Hi, > > I'm running DRBD 8.3.12, and recently hit what looks to me like a bug that > was listed as fixed in 8.3.13 -- getting into a state where both nodes are in > SyncSource (it's just stuck like that, going nowhere).  Luckily this happened

Re: [DRBD-user] "PingAck not received" messages

2012-05-23 Thread Florian Haas
On Tue, May 22, 2012 at 5:05 PM, Matthew Bloch wrote: > I'm not using drbdadm and the helper, my "pairvm" script manages DRBD > for VMs using this command to attach the disc: > >      drbd.setup("disk", drbd_backing_device, drbd_meta_device, 0) > > and these commands to connect the network, pretty

Re: [DRBD-user] "PingAck not received" messages

2012-05-22 Thread Florian Haas
On Tue, May 22, 2012 at 12:45 PM, Matthew Bloch wrote: > On 22/05/12 08:16, Felix Frank wrote: >> On 05/21/2012 06:56 PM, Matthew Bloch wrote: >>> Thanks for the accounts Pascal and Felix, though Felix I'm pretty >>> certain Debian/lenny's kernel had a virtio bug that does cause its >>> network to

Re: [DRBD-user] Reasons not to use allow-two-primaries with DRDB

2012-05-21 Thread Florian Haas
On Fri, May 18, 2012 at 6:29 PM, kare...@gmail.com wrote: > Hello, > > I am in the process of setting up DRBD on my servers, the network > bandwidth being the bottleneck.  After having evaluated GlusterFS I > realised, that I need the instant read access offered by DRBD. Out of curiosity (even th

Re: [DRBD-user] "PingAck not received" messages

2012-05-20 Thread Florian Haas
Matthew, On Wed, May 16, 2012 at 10:11 PM, Matthew Bloch wrote: > I'm trying to understand a symptom for a client who uses drbd to run > sets of virtual machines between three pairs of servers (v1a/v1b, > v2a/v2b, v3a/v3b), and I wanted to understand a bit better how DRBD I/O > is buffered depend

Re: [DRBD-user] NFS not starting with heartbeat

2012-05-16 Thread Florian Haas
On Wed, May 16, 2012 at 10:36 PM, Matt Graham wrote: > From: Lars Ellenberg > [snippage] >> May I ask why you chose to use heartbeat haresource mode instead of >> RHCS or Pacemaker, or any other potential candidate for the job? >> >> Just curious here. I'm trying to figure out how common it is no

Re: [DRBD-user] Drbd many blocks out of sync

2012-05-16 Thread Florian Haas
On Wed, May 16, 2012 at 3:02 PM, Vladimir Kuklin wrote: > Hi, Lars > > Thank you for the  response. > > The proposed solution does not help. How about we back up a bit and ask Question Number One. Vladimir, what filesystem are you running on your dual-Primary DRBD? Florian -- Need help with Hi

Re: [DRBD-user] servers out of sync

2012-05-13 Thread Florian Haas
On Sun, May 13, 2012 at 3:25 PM, Marcel Kraan wrote: > i don't get it synced again. > they are now both stand alone? > i can ping them both. > > don't  have any options left. Yes you do. http://www.drbd.org/users-guide-8.3/s-resolve-split-brain.html Googling this log message would have led you

Re: [DRBD-user] ib or 10gbe?

2012-04-30 Thread Florian Haas
Hi James, On Mon, Apr 30, 2012 at 1:11 PM, James Harper wrote: > I'm considering configurations for a pair of new servers - a 2 node Xen > cluster with shared storage. > > It looks like I can build a HP server with direct connected 10gbe or ib for > approximately the same price. Given the choic

Re: [DRBD-user] Three node cluster?

2012-04-17 Thread Florian Haas
On Tue, Apr 17, 2012 at 3:05 AM, Arnold Krille wrote: > Hi, > > > On 15.04.2012 22:05, Björn Enroth wrote: >> >> I am looking for information of how to deal with a KVM three node cluster >> with DRBD >> I have a "baby machine" ubuntu 11.10 pacemaker/drbd cluster with two >> nodes, >> local disks w

Re: [DRBD-user] How to create WFConnection resource with one DRBD host?

2012-04-07 Thread Florian Haas
On Sat, Apr 7, 2012 at 2:43 PM, Dan Barker wrote: > I need to test that DRBD will peacefully cohabit with Oracle VM. I want to > build a single-node DRBD array and need my resource in WFConnection > Primary/unknown Uptodate/DUnknown. > > Otherwise I need two drbds (I can do that, but it's not germ

Re: [DRBD-user] Sync speed (getting to the limit)

2012-04-06 Thread Florian Haas
On Fri, Apr 6, 2012 at 3:10 AM, Brian R. Hellman wrote: > > > On 04/05/2012 05:58 PM, Marcelo Pereira wrote: >> Hi DRBD masters, >> >> I have been trying to push the sync speed to the limit, as the server >> was offline for a while and I will release it for general use only >> after it's sync'ed.

Re: [DRBD-user] question on recovery from network failure on primary/primary

2012-04-05 Thread Florian Haas
On Thu, Apr 5, 2012 at 8:34 PM, Brian Chrisman wrote: > I have a shared/parallel filesystem on top of drbd dual primary/protocol C > (using 8.3.11 right now). _Which_ filesystem precisely? > My question is about recovering after a network outage where I have a > 'resource-and-stonith' fence hand

Re: [DRBD-user] block: BARRIER is deprecated, use FLUSH/FUA instead

2012-04-05 Thread Florian Haas
On Thu, Apr 5, 2012 at 3:30 PM, Lars Ellenberg wrote: > On Thu, Apr 05, 2012 at 08:56:52AM +, Maurits van de Lande wrote: >> >Well you've mentioned here you're already working with Linbit, so >> >what did they say about this? Surely the same issue would appear >on >> >vmhost6a and vmhost6b who

Re: [DRBD-user] Question before upgrade

2012-04-05 Thread Florian Haas
On Thu, Apr 5, 2012 at 11:53 AM, Erik Schwalbe wrote: > > Hi, > Is our drbd status ok?? >From the information you've given, yes. But what is it that prompts you to upgrade to 8.4 at this point? Florian -- Need help with High Availability? http://www.hastexo.com/now _

Re: [DRBD-user] block: BARRIER is deprecated, use FLUSH/FUA instead

2012-04-05 Thread Florian Haas
On Thu, Apr 5, 2012 at 10:56 AM, Maurits van de Lande wrote: >>Well you've mentioned here you're already working with Linbit, so what did >>they say about this? Surely the same issue would appear >on vmhost6a and >>vmhost6b whose configurations you've previously posted here > Yes the same issue

Re: [DRBD-user] block: BARRIER is deprecated, use FLUSH/FUA instead

2012-04-05 Thread Florian Haas
On Thu, Apr 5, 2012 at 10:15 AM, Maurits van de Lande wrote: > When I start drbd 8.3.12 (elrepo build) I get the same message, this is my > drbd.conf content. I also included the bug report Well you've mentioned here you're already working with Linbit, so what did they say about this? Surely the

Re: [DRBD-user] block: BARRIER is deprecated, use FLUSH/FUA instead

2012-04-05 Thread Florian Haas
On Wed, Apr 4, 2012 at 10:29 PM, Ryan Shannon wrote: > Hi folks, > > We are running a file-server cluster using centos6, drbd, corosync, and > pacemaker. Each time drbd is started, a kernel oops is triggered (see > below). Nope. That's a call trace, but not an oops. > I have the 'no-disk-barrier

Re: [DRBD-user] drbd storage for Oracle VM

2012-04-03 Thread Florian Haas
On Tue, Apr 3, 2012 at 1:36 PM, Dan Barker wrote: >>> Eduardo, thank you for your input, but Virtual Box (workstation >>> product) is a completely different animal than Oracle VM (bare-metal >>> hypervisor). I don't get to choose the underlying storage, and OVM3 choses >>> OCFS2. >>> >>> >>> >>>

Re: [DRBD-user] Hardware-recomendation needed

2012-04-03 Thread Florian Haas
On Tue, Apr 3, 2012 at 12:28 PM, Maurits van de Lande wrote: >>What do the experts think: Should this be sufficient to get the perfomance of >>a single SATA-Disk without DRBD? > > Probably not, nothing will. Beg to differ. The question was, will DRBD pair of SSDs, when replicated with protocol

Re: [DRBD-user] drbd storage for Oracle VM

2012-04-03 Thread Florian Haas
On Tue, Apr 3, 2012 at 12:31 PM, Dan Barker wrote: > Eduardo, thank you for your input, but Virtual Box (workstation product) is > a completely different animal than Oracle VM (bare-metal hypervisor). I > don’t get to choose the underlying storage, and OVM3 choses OCFS2. > > > > I guess I’ll simpl

Re: [DRBD-user] Sync gets stalled at 100%

2012-04-02 Thread Florian Haas
On Mon, Apr 2, 2012 at 2:06 AM, Marcelo Pereira wrote: > Hello, > > I have been trying to sync two DRBD nodes, but it gets stalled at 100%. > > They are only 16Tb big. It has 6Tb of data to be sync'ed. How I know that?? > Well, I have never seen these nodes sync'ed, and I have transferred 6Tb of >

Re: [DRBD-user] compile drbd 8.4.1 kernel 3.3

2012-03-23 Thread Florian Haas
On Fri, Mar 23, 2012 at 4:57 PM, Jose Ildefonso Camargo Tolosa wrote: >> I have error on compiling drbd 8.4.1 with kernel 3.3 > > Any particular reason for using 3.3 now? (I mean, you are just > testing, right?, 3.3 is not stable). Define "stable"? It's definitely been released, a little less tha

[DRBD-user] Quick heads-up in case you see us disappearing from the list (was Re: Switch from MD to DRBD)

2012-03-20 Thread Florian Haas
On Sat, Mar 17, 2012 at 10:01 AM, Florian Haas wrote: > On Sat, Mar 17, 2012 at 4:42 AM, Jake Smith wrote: >> - Original Message - >>> From: "Arnold Krille" >>> To: drbd-user@lists.linbit.com >>> Sent: Friday, March 16, 2012 5:11:59 PM >&

Re: [DRBD-user] missing scripts or do I have to make them by myself?

2012-03-19 Thread Florian Haas
On Mon, Mar 19, 2012 at 6:04 PM, Digimer wrote: > On 03/19/2012 12:33 PM, Carlos Xavier wrote: >> Hi. >> >> We have an old cluster made of OCFS2 running over DRBD using the >> heartbeat protocol. Now we are moving to DRBD + OCFS2 + Pacemaker. >> Following the steps from the user's manual I tried t

Re: [DRBD-user] missing scripts or do I have to make them by myself?

2012-03-19 Thread Florian Haas
On Mon, Mar 19, 2012 at 5:33 PM, Carlos Xavier wrote: > Hi. > > We have an old cluster made of OCFS2 running over DRBD using the heartbeat > protocol. Now we are moving to DRBD + OCFS2 + Pacemaker. > Following the steps from the user's manual I tried to make this > configuration: > > disk { >    

Re: [DRBD-user] Switch from MD to DRBD

2012-03-17 Thread Florian Haas
On Sat, Mar 17, 2012 at 4:42 AM, Jake Smith wrote: > - Original Message - >> From: "Arnold Krille" >> To: drbd-user@lists.linbit.com >> Sent: Friday, March 16, 2012 5:11:59 PM >> Subject: Re: [DRBD-user] Switch from MD to DRBD >> >> On 16.03.2012 14:38, Jake Smith wrote: >> > Florian beat

Re: [DRBD-user] Switch from MD to DRBD

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 2:38 PM, Jake Smith wrote: > Florian beat me to it! > > He has those links on speed dial :-) I wrote them. :) Florian -- Need help with High Availability? http://www.hastexo.com/now ___ drbd-user mailing list drbd-user@lists.l

Re: [DRBD-user] Switch from MD to DRBD

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 1:32 PM, pierpaolo.fasano wrote: > Hi, > I'm in the middle of switching from a software RAID1 based on MD to another > one based on DRBD but using the same disks. > I've verified that no data loss occurs during this operation, but I'm > wondering if it's possible to avoid

Re: [DRBD-user] Page allocation failure (IPOIB, Infiniband, connected mode)

2012-03-16 Thread Florian Haas
On Wed, Mar 14, 2012 at 7:48 AM, Christian Balzer wrote: > Hello, > > This is basically a repeat of: > http://lists.linbit.com/pipermail/drbd-user/2011-August/016758.html > > 32GB RAM, Debian Squeeze, 3.2 (debian backport) kernel, 8.3.12 DRBD, > IPOIB in connected mode with a 64k MTU. Just 2 DRBD

Re: [DRBD-user] drbd kernel BUG: unable to handle kernel NULL pointer dereference at 0000000000000038

2012-03-16 Thread Florian Haas
On Fri, Mar 16, 2012 at 10:36 AM, France wrote: > Hi, > > i'm hitting a bug in drbd, with latest CentOs and drbd 8.3.12 using GFS2 on > top with cman and rgmanager. > > Here is the simplest method to have it occur. > 1. Start drbd on node s2 > 2. Start drbd on node s3 > They sync up: > [root@s3 ~]

Re: [DRBD-user] DRBD uses a wrong interface.

2012-03-07 Thread Florian Haas
On Wed, Mar 7, 2012 at 7:41 AM, Digimer wrote: > On 03/06/2012 05:39 PM, Ivan Pavlenko wrote: > I think you are confusing totem (corosync)'s communication (the log file > entries and DRBD. > > Can you paste your cluster config to confirm? Just to clarify, what's meant here is either your corosync

Re: [DRBD-user] Resync stalled at 100%

2012-03-07 Thread Florian Haas
On Tue, Mar 6, 2012 at 8:59 PM, Marcelo Pereira wrote: > Hello guys, > > I have been trying to resync a pair of servers and working on this for the > last two weeks. Lots of effort considering you're using a long-obsolete DRBD version. Upgrade to at least 8.3.11 and see if your issues persist. C

Re: [DRBD-user] Kernel hung on DRBD / MD RAID

2012-03-05 Thread Florian Haas
On Mon, Mar 5, 2012 at 11:45 PM, Andreas Bauer wrote: > I can share an observation: > > (Disclaimer: my knowledge of the Linux I/O stack is very limited) > > Kernel 3.1.0, DRBD 8.3.11, DRBD->LVM->MD-RAID1->SATA DISKS > (Disks use CFQ scheduler) > > issue command: drbdadm verify all > (with combine

Re: [DRBD-user] Off-site Quorum Provider?

2012-03-02 Thread Florian Haas
On Fri, Mar 2, 2012 at 11:21 PM, Digimer wrote: > On 03/02/2012 04:41 PM, Robinson, Eric wrote: >> We have two geographically separate data centers connected by 4 x >> Gigabit links (in 2 trunks). Our HA clusters are distributed between the >> data centers, with each node of a 2-node cluster in a

Re: [DRBD-user] Pacemaker + Dual Primary, handlers and fail-back issues

2012-03-01 Thread Florian Haas
On Thu, Mar 1, 2012 at 6:12 PM, Daniel Grunblatt wrote: > Andreas, Lars, > > Thanks much for the quick response. > > I made the changes. > [lots of stuff] > > And here's what happened: > [lots of stuff] Ever heard of pastebin? > /sbin/drbdadm fence-peer minor-3 exit code 126 (0x7e00) > Mar  1 13

Re: [DRBD-user] Downgrading 8.4.1 to 8.3.12

2012-02-29 Thread Florian Haas
On Wed, Feb 29, 2012 at 12:32 AM, William Seligman wrote: > Is there any way of downgrading 8.4.1 to 8.3.12 without erasing the drbd > partition? As this question has popped up a few times on the list and on IRC recently, I've taken the liberty to add this to the "Hints and Kinks" section on our

Re: [DRBD-user] flashcache + drbd + LVM2 + GFS2 + KVM live migration -> data corruption

2012-02-24 Thread Florian Haas
On 02/24/12 10:22, Maurits van de Lande wrote: >> One more thing; your libvirt configuration for that KVM instance (virsh >> dumpxml ) would be helpful too. > > done Thanks. I was looking for the block device caching configuration, but that part looks fine. Cheers, Florian -- Need help with H

Re: [DRBD-user] flashcache + drbd + LVM2 + GFS2 + KVM live migration -> data corruption

2012-02-24 Thread Florian Haas
One more thing; your libvirt configuration for that KVM instance (virsh dumpxml ) would be helpful too. Cheers, Florian -- Need help with High Availability? http://www.hastexo.com/now ___ drbd-user mailing list drbd-user@lists.linbit.com http://lists.l

Re: [DRBD-user] flashcache + drbd + LVM2 + GFS2 + KVM live migration -> data corruption

2012-02-24 Thread Florian Haas
On 02/23/12 12:15, Maurits van de Lande wrote: >> You're having us rely on crystal balls at this time. Feed us more >> information and maybe someone will be able to help out. > Okay, thanks for your reply. I didn't know how detailed the post should have > been. I'm sorry for the inconvenience. >

Re: [DRBD-user] flashcache + drbd + LVM2 + GFS2 + KVM live migration -> data corruption

2012-02-22 Thread Florian Haas
Maurits, Just reposting here isn't going to help. Neither in your original post nor in the bug report you reference did you mention anything other than that you got a "corrupted" VM. In what way? What was your _complete_ DRBD configuration? (You gave us only an obviously incomplete snippet). What

Re: [DRBD-user] Procedure to migrate from 8.4.1 to 8.3.11

2012-02-21 Thread Florian Haas
On Mon, Feb 20, 2012 at 11:58 PM, Adam Sobieraj wrote: > Hi, > > I have question about procedure do downgrade drbd from 8.4.1 to 8.3.11, > what is the simple way? > > I want do that because i have problems with 8.4.1 on 3.2.6 kernel. Are you using a self-built out-of-tree DRBD, or the one that sh

Re: [DRBD-user] missing /dev/drbd0

2012-02-18 Thread Florian Haas
On Sat, Feb 18, 2012 at 2:04 AM, K.Radacki wrote: > Dear all, > yesterday I had to turn off for an hour all our servers (replacement of > emergency power generator and simulation of failure). > After reboot the main server "lost" all 3 internet interfaces. For that > reason DRBD failed to start. A

Re: [DRBD-user] drbd stuck in WFbitmaps state in WAN link

2012-02-18 Thread Florian Haas
On Sat, Feb 18, 2012 at 3:58 PM, papu bhattacharya wrote: > Hi All, > I have successfully setup a DRBD setup between two server in my lan setup. > The speed is 100 Mbps.  The next staage was to do a DR setup with wan link. > The WAN link speed is 2Mbps.  In wan link , i see the DRBD is stuck in >

Re: [DRBD-user] drbd + flashcache

2012-02-12 Thread Florian Haas
On Sat, Feb 11, 2012 at 7:30 PM, Pascal BERTON wrote: > Florian, > > I've watched your video, very interesting indeed... Unfortunately for you it > raised some more questions to me :-) > First, just to be sure I correctly understood : flashcache will only (But > significantly) improve read operati

Re: [DRBD-user] drbd + flashcache

2012-02-10 Thread Florian Haas
On Fri, Feb 10, 2012 at 3:38 PM, Maurits van de Lande wrote: > Hello, > > I'm exploring the features of flashcache see: > http://www.github.com/facebook/flashcache and I like it's possibilities. > Fortunately toracat from elrepo has been so kind to create an el6 package. > http://elrepo.org/bugs

Re: [DRBD-user] Seeking extra info on loopback mounted devices

2012-02-09 Thread Florian Haas
On Thu, Feb 9, 2012 at 12:08 PM, Justin Cattle wrote: > Is this still the case - DRBD 8.3 (or even 8.4) using Linux 3.1 ? http://lists.linbit.com/pipermail/drbd-user/2011-May/016009.html Hope this helps. Cheers, Florian -- Need help with High Availability? http://www.hastexo.com/now _

Re: [DRBD-user] Any DRBD 8.3 package that work with kernel version 2.6.16.60-0.54.5-smp

2012-02-09 Thread Florian Haas
On Thu, Feb 9, 2012 at 9:31 AM, Felix Frank wrote: > Hi, > > what Linux distribution is this? I think many distributors build > separate kernel module packages for DRBD, so you may install this as well. > > For SUSE, it should have "kmp" somewhere in its name. IIRC, that's a SLES 10 kernel, and a

Re: [DRBD-user] Full resync after reboot

2012-02-02 Thread Florian Haas
On 02/02/12 21:29, Richard Baverstock wrote: > I found the checksum based sync last night after sending the email - > thanks for pointing it out though. > > I'm doing the sync now. Is there any way to see what drbd is able to > match via the checksums? There's a log message, when the resync compl

Re: [DRBD-user] can't mount on secondary node

2012-02-01 Thread Florian Haas
d, Feb 1, 2012 at 10:02 AM, Lawrence Strydom wrote: > Hi List. > > I am having trouble mounting my drbd device on the seccondary node.  It > mounts fine on the primary but when i try and mount it on the seccondary I > get this error: > > mount /dev/drbd0 /var/www > mount: block device /dev/drbd0 i

  1   2   3   4   >