subject:"High Availability"

Re: High Availability

2015-04-22 Thread Ciro Iriarte

El abr 22, 2015 12:51 AM, Bron Gondwana br...@fastmail.fm escribió:

 On Wed, Apr 22, 2015, at 02:27 PM, Ciro Iriarte wrote:

 Interesting, is the use of several instances needed because cyrus cannot
scale with threads in a single instance scenario?

 There are two interesting reasons:

 1) global locks.  There are some - mailboxes.db for example.  If you have
multiple instances on a single machine, then a lock never blocks up the
entire machine.

 2) replication and load spreading - right now there's no support for
partial replica - a Cyrus instance replicates every mailbox to its
replica.

 The second one is the kicker.

 If we replicated everything from one machine to another machine, then
we'd have 100% user load on one machine and nothing on the other - not
efficient use of resources, because the second one needs to have the
capacity to run at 100% in a failover situation too.

 Our first thought was to run two instances per machine and pair them - so
there was a master on one and a replica on the other.  At least then we're
running equally in the general situation, and only in a failover situation
are we loaded 100%.  But it's still nasty - you go from 50% load to 100%
load.

 So we have about 10 different replicas for each machine, and every
machine is running at 50% capacity.  If we need to take one machine down,
then 10 other machines run at 55% capacity instead for that time.  The load
change is much less.

 (as of about a year ago, we're fully paired odd-host-number to
even-host-number, and odd and even are in different cabinets, so we can
shut down an entire cabinet by raising the load on its replicas)

 Bron.

 --
 Bron Gondwana
 br...@fastmail.fm

Hi Bron, it makes sense from that perspective although it seems to imply a
management nightmare. Do you use any management/automation (webscale if you
want) framework?.

Regards,
Ciro

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: High Availability

2015-04-22 Thread Bron Gondwana


On Wed, Apr 22, 2015, at 11:32 PM, Ciro Iriarte wrote:
 Hi Bron, it makes sense from that perspective although it seems to
 imply a management nightmare. Do you use any management/automation
 (webscale if you want) framework?.


Less than you might imagine :)

We have a single file (production.dat) which contains all the layout
information mapping from machines to slot numbers, and slot numbers to
disks, for example:

i30 30 t15 0 1000 e 10.202.80.1

Which says that slots sloti30t01 through sloti30t15 are on server number
30, they have a zero sized meta drive (all meta is on the SSD) and a
1000 Mb sized data drive, running ext4 filesystem, and IP addresses from
10.202.80.1 through 80.15.

And then a store based on that is:

store23 n 0 90 sloti30t01 sloti15t03 slotti5t02 slotsi2d2t01

That's where my br...@fastmail.fm user lives - it has replicas on imap15
(New York), timap5 (Iceland) and simap2 (Amsterdam). The 'n' says that
the master should live in New York, the '0' is a bit bogus actually, as
we'll see in a sec, the 90 says that it has a target maximum disk
usage of 90%.

store254 n future 0 sloti30t15 sloti29t15 slotti1t06 slotsi1d2t40

This is a testing store, only one real user lives here, and that's my
personal non-work account. All the other users are test users. The
future says that it's running on the future branch of Cyrus, which is
where we try out experimental code. This means that all the commands
which find the correct binary for tools will look in the correct paths,
like this:

[brong@imap30 ~]$ cyr store254 Store: store254 Master: sloti30t15
(imap30) 10.202.80.15 Primary: sloti30t15 (imap30) 10.202.80.15 This:
sloti30t15 (imap30) 10.202.80.15 Other: sloti29t15 (imap29) 10.202.79.15
Other: slotsi1d2t40 (simap1) 10.206.51.80 Other: slotti1t06 (timap1)
10.205.161.6

sudo -u cyrus /usr/cyrus-future/bin/cyr_dbtool -C
/etc/cyrus/imapd-sloti30t15.conf
/mnt/ssd30/sloti30t15/store254/conf/mailboxes.db twoskip sudo -u cyrus
/usr/cyrus-future/bin/reconstruct -C /etc/cyrus/imapd-sloti30t15.conf
sudo -u cyrus /usr/cyrus-future/bin/dav_reconstruct -C
/etc/cyrus/imapd-sloti30t15.conf sudo -u cyrus
/usr/cyrus-future/bin/cyr_synclog -C /etc/cyrus/imapd-sloti30t15.conf -v
sudo -u cyrus /usr/cyrus-future/bin/ctl_conversationsdb -C
/etc/cyrus/imapd-sloti30t15.conf sudo -u cyrus
/usr/cyrus-future/bin/squatter -C /etc/cyrus/imapd-sloti30t15.conf -v -i
sudo -u cyrus /usr/cyrus-future/bin/sync_client -C
/etc/cyrus/imapd-sloti30t15.conf -n sloti29t15 -v sudo -u cyrus
/usr/cyrus-future/bin/sync_client -C /etc/cyrus/imapd-sloti30t15.conf -n
slotsi1d2t40 -v sudo -u cyrus /usr/cyrus-future/bin/sync_client -C
/etc/cyrus/imapd-sloti30t15.conf -n slotti1t06 -v

So I can even run 'cyr br...@fastmail.fm' and it will give me the
correct commands to run for my user.

If it wasn't heavily automated, it would be a pain. Configuration files
are built from Perl Template-Toolkit using Makefiles and data from the
production.dat file.

What we don't have so much yet is automated user moves or disk layout
building, though it's semi-automated. I have a script which can be told
make config for 5 new stores and it will find the least used machines,
within the constraints we have for placing slots, and pick out empty
slots on them.

For moving users, 'MultiMove.pl' knows about disk usage on backends and
can pick random users on busy backends to move. Our MoveServer.pl script
is very smart, it does what Ken at CMU and now Ellie have done in the
upstream branch with Sync-based-XFER, but externally. It runs
sync_client 3 times, plus squatter, plus cyr_expire for archiving and
locks out users in the DB, fiddles caches, etc. The upshot is that the
user gets about a 3 second pause, and their connections drop, then they
keep on working as if nothing happened.

Bron.

Bron.

--
Bron Gondwana br...@fastmail.fm



Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: High Availability

2015-04-21 Thread Ciro Iriarte

El abr 20, 2015 3:55 AM, Bron Gondwana br...@fastmail.fm escribió:

 (taking it back to the list in case it's useful to others)

 On Mon, Apr 20, 2015, at 05:45 PM, Lalot Dominique wrote:

 Hello Bron

 Unfortunately I would'nt be able to go to The Hague..


 Oh well :)


 Just as a simple question, the only drawback of not using is that you
won't be able to share folders?


 That's the only drawback we have.  You can only share folders with users
on the same server.  For our family/business accounts, we just make sure
all users are on the same backend.

 We run hundreds of servers, and use nginx as a proxy in front of them so
we can move users without them knowing or having to update settings.


 Can we have several imap servers without using muder?


 Sure.  We do a thing we call slots and stores, where we split each
machines up into up to 40 separate instances of Cyrus with 1Tb of storage
each, replicating to different machines.


 I had only used a simple setup, one imap server with several spools


 That works fine, but it wont' give you high availability.


 Is there some more information somewhere?


 Not much unfortunately.  We've written about our setup many times, most
recently here:

 http://blog.fastmail.com/2014/12/04/standalone-mail-servers/

 and more detail here:

 https://www.fastmail.com/help/technical/architecture.html

 But they don't give you quite enough configuration detail to just plug
and play.

 Our plan with the Cyrus Foundation and developing Cyrus 3.0 is to have
pre-configured Docker images which you can just run and add storage, and
they will work in a cluster.  It's very ambitious, and we might not have it
fully stable by July when we launch 3.0, but it's definitely the eventual
goal.

 What is your timeframe for setting up this new system?

 Regards,

 Bron.

 --
 Bron Gondwana
 br...@fastmail.fm

Interesting, is the use of several instances needed because cyrus cannot
scale with threads in a single instance scenario?

Regards,
Ciro

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: High Availability

2015-04-21 Thread Bron Gondwana

On Wed, Apr 22, 2015, at 02:27 PM, Ciro Iriarte wrote:
 Interesting, is the use of several instances needed because cyrus
 cannot scale with threads in a single instance scenario?


There are two interesting reasons:

1) global locks. There are some - mailboxes.db for example. If you have
   multiple instances on a single machine, then a lock never blocks up
   the entire machine.

2) replication and load spreading - right now there's no support for
   partial replica - a Cyrus instance replicates every mailbox to
   its replica.

The second one is the kicker.

If we replicated everything from one machine to another machine, then
we'd have 100% user load on one machine and nothing on the other - not
efficient use of resources, because the second one needs to have the
capacity to run at 100% in a failover situation too.

Our first thought was to run two instances per machine and pair them -
so there was a master on one and a replica on the other. At least then
we're running equally in the general situation, and only in a failover
situation are we loaded 100%. But it's still nasty - you go from 50%
load to 100% load.

So we have about 10 different replicas for each machine, and every
machine is running at 50% capacity. If we need to take one machine down,
then 10 other machines run at 55% capacity instead for that time. The
load change is much less.

(as of about a year ago, we're fully paired odd-host-number to
even-host-number, and odd and even are in different cabinets, so we can
shut down an entire cabinet by raising the load on its replicas)

Bron.

--
Bron Gondwana br...@fastmail.fm



Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: High Availability

2015-04-20 Thread Bron Gondwana

(taking it back to the list in case it's useful to others)

On Mon, Apr 20, 2015, at 05:45 PM, Lalot Dominique wrote:
 Hello Bron

 Unfortunately I would'nt be able to go to The Hague..

Oh well :)

 Just as a simple question, the only drawback of not using is that you
 won't be able to share folders?

That's the only drawback we have. You can only share folders with users
on the same server. For our family/business accounts, we just make sure
all users are on the same backend.

We run hundreds of servers, and use nginx as a proxy in front of them so
we can move users without them knowing or having to update settings.

 Can we have several imap servers without using muder?

Sure. We do a thing we call slots and stores, where we split each
machines up into up to 40 separate instances of Cyrus with 1Tb of
storage each, replicating to different machines.

 I had only used a simple setup, one imap server with several spools

That works fine, but it wont' give you high availability.

 Is there some more information somewhere?

Not much unfortunately. We've written about our setup many times, most
recently here:

http://blog.fastmail.com/2014/12/04/standalone-mail-servers/

and more detail here:

https://www.fastmail.com/help/technical/architecture.html

But they don't give you quite enough configuration detail to just
plug and play.

Our plan with the Cyrus Foundation and developing Cyrus 3.0 is to have
pre-configured Docker images which you can just run and add storage, and
they will work in a cluster. It's very ambitious, and we might not have
it fully stable by July when we launch 3.0, but it's definitely the
eventual goal.

What is your timeframe for setting up this new system?

Regards,

Bron.

--
Bron Gondwana br...@fastmail.fm



Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

High Availability

2015-04-20 Thread Lalot Dominique

Hi,

We used cyrus for many years and switch to a proprietary system. We are
juste looking back to cyrus.
I would like to know the status of cyrus and HA:
This documentation seems to consider that replication is edge..
http://cyrusimap.org/docs/cyrus-imapd/2.4.9/install-replication.php
and it has been written in 2007



*Note that Cyrus replication is still relatively young in the grand scheme
of things, and if you choose to deploy you are doing so at your own risk. *
Is there somewhere a documentation, an howto for HA (proxies, murder and
replication)
Thanks

Dom

-- 
Dominique LALOT
Ingénieur Systèmes et Réseaux
http://annuaire.univ-amu.fr/showuser.php?uid=lalot

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: High Availability

2015-04-20 Thread Niels Dettenbach

Am Montag, 20. April 2015, 08:32:52 schrieb Lalot Dominique:
 I would like to know the status of cyrus and HA:
 This documentation seems to consider that replication is edge..
 http://cyrusimap.org/docs/cyrus-imapd/2.4.9/install-replication.php
 and it has been written in 2007
Cyrus is a product mainly developed for large scale / ISP / enterprise level 
applications and in good old internet terms anything is edge which is not 
deployed over many years in such high requesting environments with a lot of 
experince around. But very few commercial / proprietary solutions did really 
have more experience and productive field testing behind when called 
stable by their marketing...


 *Note that Cyrus replication is still relatively young in the grand scheme
 of things, and if you choose to deploy you are doing so at your own risk. *
 Is there somewhere a documentation, an howto for HA (proxies, murder and
 replication)
Only for HA you are not required to use the new cyrus internal technologies 
- there still are many large scale cyrus installations which realized their 
own HA infrastructure / logic by standard or less standard tools / techniques.

But yes, some of the docs are a bit edgy, but in the last years the 
situation was changing step by step into a better situation. I.e. see for 
murder:

https://cyrusimap.org/docs/cyrus-imapd/2.4.6/install-murder.php
https://cyrusimap.org/mediawiki/index.php/Cyrus_Murder_Design

and even well know computer magazines wrote about setup details (sorry for the 
egrman versiuon, but it may exist in the english version of LM too):
http://www.linux-magazin.de/Ausgaben/2007/11/Mailvertreter

hth a bit
cheerioh,


Niels.

-- 
 ---
 Niels Dettenbach
 Syndicat IT  Internet
 http://www.syndicat.com
 PGP: https://syndicat.com/pub_key.asc
 ---
 





signature.asc
Description: This is a digitally signed message part.

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: High Availability

2015-04-20 Thread Bron Gondwana

On Mon, Apr 20, 2015, at 04:32 PM, Lalot Dominique wrote:
 Hi,

 We used cyrus for many years and switch to a proprietary system. We
 are juste looking back to cyrus. I would like to know the status of
 cyrus and HA: This documentation seems to consider that replication is
 edge..
 http://cyrusimap.org/docs/cyrus-imapd/2.4.9/install-replication.php
 and it has been written in 2007

 *Note that Cyrus replication is still relatively young in the
grand scheme of things, and if you choose to deploy you are doing so at
your own risk. *

Yeah, that's pretty ancient.

 Is there somewhere a documentation, an howto for HA (proxies, murder
 and replication) Thanks

So I'll be talking about this in a couple of weeks if you want to make
your way over to The Hague :)

https://conference.kolab.org/kolab-summit/sessions/cyrus-imapd-past-current-and-future

The short version: replication in 2.4/2.5 is very stable. We're using it
at FastMail and have been in production for about 5 years now.

It doesn't integrate very well with murder yet though. We don't use
murder at FastMail.

My plan in the short to medium term is to merge replication/murder into
a general better HA system. I'd be very interested in having test cases
:) (as well as FastMail moving to it)

Bron.

--
Bron Gondwana br...@fastmail.fm



Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/
To Unsubscribe:
https://lists.andrew.cmu.edu/mailman/listinfo/info-cyrus

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-29 Thread Andrew Morgan

On Wed, 29 Sep 2010, Tomasz Chmielewski wrote:

 Hmm - I added this to imapd.conf:

 annotation_db: skiplist
 duplicate_db: skiplist
 mboxlist_db: skiplist
 ptscache_db: skiplist
 quota_db: skiplist
 seenstate_db: skiplist
 tlscache_db: skiplist


 When starting cyrus, I have this:

 Sep 29 02:53:48 omega cyrus/master[1089]: process started
 Sep 29 02:53:48 omega cyrus/ctl_cyrusdb[1090]: recovering cyrus databases
 Sep 29 02:53:48 omega cyrus/ctl_cyrusdb[1090]: done recovering cyrus databases
 Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: DBERROR db4: Program version 
 4.2 doesn't match environment version
 Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: DBERROR: dbenv-open 
 '/shared/var/lib/cyrus/db' failed: Invalid argument
 Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: DBERROR: init() on berkeley
 Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: duplicate_prune: pruning back 3 
 days
 Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: duplicate_prune: purged 0 out 
 of 0 entries
 Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: expunged 0 out of 0 messages 
 from 0 mailboxes
 Sep 29 02:53:49 omega cyrus/tls_prune[1092]: tls_prune: purged 0 out of 0 
 entries
 Sep 29 02:53:49 omega cyrus/master[1089]: ready for work
 Sep 29 02:53:49 omega cyrus/ctl_cyrusdb[1093]: checkpointing cyrus databases
 Sep 29 02:53:49 omega cyrus/ctl_cyrusdb[1093]: done checkpointing cyrus 
 databases


 # file /shared/var/lib/cyrus/db/*
 /shared/var/lib/cyrus/db/__db.001:   data
 /shared/var/lib/cyrus/db/__db.002:   data
 /shared/var/lib/cyrus/db/__db.003:   data
 /shared/var/lib/cyrus/db/__db.004:   data
 /shared/var/lib/cyrus/db/__db.005:   data
 /shared/var/lib/cyrus/db/log.01: Berkeley DB (Log, version 8, native 
 byte-order)
 /shared/var/lib/cyrus/db/skipstamp:  data


 The error and Berkeley DB log file is there even if I empty this 
 directory, and start Cyrus.

 Did I miss some value in imapd.conf?

Cyrus is always linked with Berkeley DB, so it always tries to init the 
Berkeley DB environment.  Even with all your backends set to skiplist, 
you'll still see the Berkeley DB log files in {configdir}/db/.  You can 
safely ignore them.

I'm not sure why you still get Berkeley DB errors when starting Cyrus.  I 
have converted everything to skiplist, and I do not get those errors.

Andy

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Pascal Gienger


Le 28 sept. 2010 à 08:50, Tomasz Chmielewski a écrit :
 Sep 28 01:10:10 omega cyrus/ctl_cyrusdb[21728]: DBERROR db4: Program version 
 4.2 doesn't match environment version

Are you sure on each node the _SAME_ Cyrus version linked to the _SAME_ bdb 
libs is running?

And - just a little side note - you can dump bdb in favor to skiplist... I bet 
you'll have much less problems in your cluster environment setup.

Pascal

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Sebastian Hagedorn

--On 28. September 2010 08:50:00 +0200 Tomasz Chmielewski man...@wpkg.org 
wrote:



How do you manage your Cyrus installations highly-available?


Check the archives. There have been many discussions regarding this.


I though a minimal example could be like below:


   internet
  |
server1 - server2


There would be Heartbeat/Pacemaker running on both servers. Its role
would be:

- assign Cyrus IP to a given server,
- start Cyrus where Cyrus IP is up.


Still, we need to have Cyrus database, mail storage accessible for both
servers. I though using glusterfs for it would be a good idea (assuming
Cyrus only runs on one of the servers at a given time).


We use a similar setup with standard ext3 file systems that are mounted and 
unmounted as needed; in our case that's done by the RHEL 3 Cluster Suite. 
That's been working great for almost 6 years now.

--
.:.Sebastian Hagedorn - RZKR-R1 (Gebäude 52), Zimmer 18.:.
.:.Regionales Rechenzentrum (RRZK).:.
.:.Universität zu Köln / Cologne University - ✆ +49-221-478-5587.:.

p7sisIZ9P1qN8.p7s
Description: S/MIME cryptographic signature

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Tomasz Chmielewski

On 28.09.2010 09:13, Pascal Gienger wrote:

 Le 28 sept. 2010 à 08:50, Tomasz Chmielewski a écrit :
 Sep 28 01:10:10 omega cyrus/ctl_cyrusdb[21728]: DBERROR db4: Program version 
 4.2 doesn't match environment version

 Are you sure on each node the _SAME_ Cyrus version linked to the _SAME_ bdb 
 libs is running?

100% sure. If I copy everything off glusterfs to a local filesystem, 
Cyrus doesn't report any errors.


 And - just a little side note - you can dump bdb in favor to skiplist... I 
 bet you'll have much less problems in your cluster environment setup.

Yep, I found more or less it could be some mmap problem with BDB.

Is there a way to convert the existing BDB databases to skiplist? Or, 
initialize empty skiplist databases for Cyrus?


-- 
Tomasz Chmielewski
http://wpkg.org

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Michael Menge


Quoting Tomasz Chmielewski man...@wpkg.org:


How do you manage your Cyrus installations highly-available?

I though a minimal example could be like below:


   internet
  |
server1 - server2


There would be Heartbeat/Pacemaker running on both servers. Its role  
would be:


- assign Cyrus IP to a given server,
- start Cyrus where Cyrus IP is up.


Still, we need to have Cyrus database, mail storage accessible for  
both servers. I though using glusterfs for it would be a good idea  
(assuming Cyrus only runs on one of the servers at a given time).


However, something doesn't work with it very well when Cyrus data is  
on a glusterfs mount point (if I move it to a local disk, everything  
works well):




Cyrus depends on locks and mmap, so your fs must support them.
I had written a summery of the diskussions about Cyrus and HA in the
old wiki. But the wiki was replaced by the new wiki. I will have a look
if I have a copy.

If you plan to run in active-passive mode, did you considre Cyrus
replication? You will need twice the disk space, but you remove a single
point of failure (glustefs)


Regards

 Michael Mege


M.MengeTel.: (49) 7071/29-70316
Universität Tübingen   Fax.: (49) 7071/29-5912
Zentrum für Datenverarbeitung  mail:  
michael.me...@zdv.uni-tuebingen.de

Wächterstraße 76
72074 Tübingen

smime.p7s
Description: S/MIME Signatur

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Tomasz Chmielewski

On 28.09.2010 10:56, Michael Menge wrote:

 Cyrus depends on locks and mmap, so your fs must support them.
 I had written a summery of the diskussions about Cyrus and HA in the
 old wiki. But the wiki was replaced by the new wiki. I will have a look
 if I have a copy.

I would be grateful.


 If you plan to run in active-passive mode, did you considre Cyrus
 replication? You will need twice the disk space, but you remove a single
 point of failure (glustefs)

Glusterfs is there to avoid SPOF - as the filesystem sits on two 
servers. So assuming I won't do rm -rf /gluster-filesystem, it should be 
quite safe. And it too needs twice the disk space, since it's replicated 
with glusterfs on both servers.


However, I'm of course open to better alternatives.

I'm running Debian Lenny, which ships with Cyrus 2.2.13 - not sure if 
Cyrus replication is possible there? I'd like to stick with distro 
packages, but if a newer Cyrus version provides features which let you 
do HA without too much hackarounds, I'll consider upgrading.


-- 
Tomasz Chmielewski
http://wpkg.org


Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Michael Menge


Quoting Tomasz Chmielewski man...@wpkg.org:


On 28.09.2010 10:56, Michael Menge wrote:


Cyrus depends on locks and mmap, so your fs must support them.
I had written a summery of the diskussions about Cyrus and HA in the
old wiki. But the wiki was replaced by the new wiki. I will have a look
if I have a copy.


I would be grateful.


I didn't find the Wiki Text, but the thread that was the base of this
Wiki-Text

http://www.irbs.net/internet/info-cyrus/0611/0279.html




If you plan to run in active-passive mode, did you considre Cyrus
replication? You will need twice the disk space, but you remove a single
point of failure (glustefs)


Glusterfs is there to avoid SPOF - as the filesystem sits on two
servers. So assuming I won't do rm -rf /gluster-filesystem, it should be
quite safe. And it too needs twice the disk space, since it's replicated
with glusterfs on both servers.


So there is no differens in diskspace. If glustefs keeps two copies of
each file or if you have two Cyrus-Servers. But with Cyrus Replication
you don't have the problem with mmap and locking. It may help not to use
BDB for the databases. But i don't know how good skiplist is in 2.2.13.
Many skiplist bugs have been fixed in 2.3.x




However, I'm of course open to better alternatives.

I'm running Debian Lenny, which ships with Cyrus 2.2.13 - not sure if
Cyrus replication is possible there? I'd like to stick with distro
packages, but if a newer Cyrus version provides features which let you
do HA without too much hackarounds, I'll consider upgrading.



Replication was introduced in 2.3.x. There are other features in 2.3.x
I don't want to live with out (e.g. delayed expunge). There was a
diskussion on the lists about that Debian wants to upgrade cyrus.
The main problem is the upgrade path (update of BDB Databases).





M.MengeTel.: (49) 7071/29-70316
Universität Tübingen   Fax.: (49) 7071/29-5912
Zentrum für Datenverarbeitung  mail:  
michael.me...@zdv.uni-tuebingen.de

Wächterstraße 76
72074 Tübingen

smime.p7s
Description: S/MIME Signatur

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Tomasz Chmielewski

On 28.09.2010 11:55, Michael Menge wrote:

 Replication was introduced in 2.3.x. There are other features in 2.3.x
 I don't want to live with out (e.g. delayed expunge). There was a
 diskussion on the lists about that Debian wants to upgrade cyrus.
 The main problem is the upgrade path (update of BDB Databases).

Assuming I start with empty mail pool (no accounts) - how can I trigger 
the creation of Cyrus databases (in skiplist format - I assume adding 
relevant skiplist info to the config file is not enough)?


-- 
Tomasz Chmielewski
http://wpkg.org

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Bron Gondwana

On Tue, Sep 28, 2010 at 12:13:14PM +0200, Tomasz Chmielewski wrote:
 On 28.09.2010 11:55, Michael Menge wrote:
 
  Replication was introduced in 2.3.x. There are other features in 2.3.x
  I don't want to live with out (e.g. delayed expunge). There was a
  diskussion on the lists about that Debian wants to upgrade cyrus.
  The main problem is the upgrade path (update of BDB Databases).
 
 Assuming I start with empty mail pool (no accounts) - how can I trigger 
 the creation of Cyrus databases (in skiplist format - I assume adding 
 relevant skiplist info to the config file is not enough)?

All databases will create automatically upon use.  Just set the type
in the config file.

Bron.

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread John Madden

 Still, we need to have Cyrus database, mail storage accessible for
 both servers. I though using glusterfs for it would be a good idea
 (assuming Cyrus only runs on one of the servers at a given time).

IMO, don't use glusterfs for this.  I found it to not even be sufficient 
for a PHP session store; it'll certainly fall over with IMAP loads.

John





-- 
John Madden
Sr UNIX Systems Engineer
Ivy Tech Community College of Indiana
jmad...@ivytech.edu

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Tomasz Chmielewski

On 28.09.2010 15:01, John Madden wrote:
 Still, we need to have Cyrus database, mail storage accessible for
 both servers. I though using glusterfs for it would be a good idea
 (assuming Cyrus only runs on one of the servers at a given time).

 IMO, don't use glusterfs for this. I found it to not even be sufficient
 for a PHP session store; it'll certainly fall over with IMAP loads.

Any other suggestions? There is an alternatives like Ceph[1], but it is 
just too new (and potentially can have some edge cases).

DRBD + GFS/OCFS2 just seem too complex for such setup.

Other than that, I use glusterfs in several setups, and I don't have any 
dramatic performance problems with it (still slower than bare metal of 
course) - will depend on workload and expected performance of course.


[1] http://ceph.newdream.net/about/


-- 
Tomasz Chmielewski
http://wpkg.org

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread John Madden

 Any other suggestions? There is an alternatives like Ceph[1], but it is
 just too new (and potentially can have some edge cases).

 DRBD + GFS/OCFS2 just seem too complex for such setup.

If you're doing failover, you don't need a cluster filesystem.  You can 
use just plain DRDB+ext4 if you don't have real shared storage.

John





-- 
John Madden
Sr UNIX Systems Engineer
Ivy Tech Community College of Indiana
jmad...@ivytech.edu

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Michael Menge


Quoting Tomasz Chmielewski man...@wpkg.org:


On 28.09.2010 15:01, John Madden wrote:

Still, we need to have Cyrus database, mail storage accessible for
both servers. I though using glusterfs for it would be a good idea
(assuming Cyrus only runs on one of the servers at a given time).


IMO, don't use glusterfs for this. I found it to not even be sufficient
for a PHP session store; it'll certainly fall over with IMAP loads.


Any other suggestions? There is an alternatives like Ceph[1], but it is
just too new (and potentially can have some edge cases).

DRBD + GFS/OCFS2 just seem too complex for such setup.

Other than that, I use glusterfs in several setups, and I don't have any
dramatic performance problems with it (still slower than bare metal of
course) - will depend on workload and expected performance of course.


[1] http://ceph.newdream.net/about/


Most Cluster-/Sharedfilesystems are good with few big files. But  
because of the metadatahandling these FS all lose performance if you  
have many small files, and cyrus has many files.





M.MengeTel.: (49) 7071/29-70316
Universität Tübingen   Fax.: (49) 7071/29-5912
Zentrum für Datenverarbeitung  mail:  
michael.me...@zdv.uni-tuebingen.de

Wächterstraße 76
72074 Tübingen

smime.p7s
Description: S/MIME Signatur

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Andre Felipe Machado

Hello
AFAIK, cyrus needs posix file locks and mmap support.
GlusterFS needs FUSE and it only supports writable mmap files after kernel
2.6.26 or so.
Therefore, you need recent kernel and recent fuse.
Further, you need to extremely fine tune your configuration, as the most robust
clustered filesystems suffer under load over small files.
Its their achiles heel...
And cyrus uses small files and hot spot files.
We are evaluating clustered fs like GlusterFS, GFS, OCFS2 (and other
shared/mirrored alternatives) since 2007 and they are not there yet for such
heavy load profile (small files).
GlusterFS is the most elegant, flexible and promising of them.
But clustered filesystems worth their performance penalty if you need
active-active servers.
For such active-active, you may consider using Dovecot, that was designed having
taking into account clustered filesystems and shared storage and multiple
servers. It has four file locking methods to choose for best suitability for a
given storage method and even sql backends for mailer internal db (not for
messages) .
But dovecot does not support shared folders across multiple backends yet as
cyrus. And *this* is a killer feature for us.
If you wants active-passive configuration, it is best to stay away from any
clustered filesystem, to not pay the heavy performance cost for small files (and
another layer of bugs) without REALLY needing the active-active fs sharing.
Keep it simple.
Maybe you even do not need real time up to the microsecond
replication/mirroring or sharing.
This allows even more simple and or reliable or recoverable or less resource
hungry solutions as more sync delay is accepted.
Low level (byte or even file) solutions will replicate crashes like bdb
corruptions and will slow down your app.
Byte, block, file replications need REALY FAST and EXTREMELY LOW LATENCY
networks, also. Notably for small files.
Answer yourself: What you desire? what you actually need?
Maybe you consider worthwhile to read some articles to bring some light to the
subject.
Also, remember that glusterfs evolved since the written article and newer
versions use somewhat different confs and tuning, that depends of YOUR
infrastructure.
You will need some translation service to articles on brazilian portuguese. Look
for Translate this page. link near bottom of each page.
Good luck.
Andre Felipe Machado

[0] http://www.techforce.com.br/news/linux_blog/glusterfs_tuning_small_files
[1 ]
http://www.techforce.com.br/news/linux_blog/lvm_raid_xfs_ext3_tuning_for_small_files_parallel_i_o_on_debian
[2]
http://www.techforce.com.br/news/linux_blog/storage_space_for_debian_on_ibm_ds_8300
[3]
http://www.techforce.com.br/news/linux_blog/how_to_configure_multipath_debian_centos_for_ibm_ds8300
[4] http://www.techforce.com.br/news/linux_blog/postgresql_ha_p1_5_com_glusterfs
[5] http://www.techforce.com.br/news/linux_blog/postgresql_ha_p1_com_glusterfs
[6]
http://www.techforce.com.br/news/media/multimedia/video_1_da_palestra_postgresql_em_alta_disponibilidade_parte_1_usando_sistema_de_arquivos_distribuido_glusterfs
[7]
http://www.techforce.com.br/news/linux_blog/red_hat_cluster_suite_debian_etch
[8]
http://www.techforce.com.br/news/linux_blog/virtualizacao_e_servico_de_arquivos_em_cluster_ha_com_debian_etch_parte_1
[9]
http://www.techforce.com.br/news/linux_blog/virtualizacao_e_servico_de_arquivos_em_cluster_ha_com_debian_etch_parte_2
[10]
http://www.techforce.com.br/news/linux_blog/virtualizacao_e_servico_de_arquivos_em_cluster_ha_com_debian_etch_parte_3
[11]
http://www.techforce.com.br/news/linux_blog/postgresql_ha_p1_5_com_glusterfs
[12] http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=595370

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

Re: high-availability Cyrus (i.e. glusterfs)?

2010-09-28 Thread Tomasz Chmielewski

On 28.09.2010 12:55, Bron Gondwana wrote:
 On Tue, Sep 28, 2010 at 12:13:14PM +0200, Tomasz Chmielewski wrote:
 On 28.09.2010 11:55, Michael Menge wrote:

 Replication was introduced in 2.3.x. There are other features in 2.3.x
 I don't want to live with out (e.g. delayed expunge). There was a
 diskussion on the lists about that Debian wants to upgrade cyrus.
 The main problem is the upgrade path (update of BDB Databases).

 Assuming I start with empty mail pool (no accounts) - how can I trigger
 the creation of Cyrus databases (in skiplist format - I assume adding
 relevant skiplist info to the config file is not enough)?
 
 All databases will create automatically upon use.  Just set the type
 in the config file.

Hmm - I added this to imapd.conf:

annotation_db: skiplist
duplicate_db: skiplist
mboxlist_db: skiplist
ptscache_db: skiplist
quota_db: skiplist
seenstate_db: skiplist
tlscache_db: skiplist


When starting cyrus, I have this:

Sep 29 02:53:48 omega cyrus/master[1089]: process started
Sep 29 02:53:48 omega cyrus/ctl_cyrusdb[1090]: recovering cyrus databases
Sep 29 02:53:48 omega cyrus/ctl_cyrusdb[1090]: done recovering cyrus databases
Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: DBERROR db4: Program version 4.2 
doesn't match environment version
Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: DBERROR: dbenv-open 
'/shared/var/lib/cyrus/db' failed: Invalid argument
Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: DBERROR: init() on berkeley
Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: duplicate_prune: pruning back 3 
days
Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: duplicate_prune: purged 0 out of 
0 entries
Sep 29 02:53:49 omega cyrus/cyr_expire[1091]: expunged 0 out of 0 messages from 
0 mailboxes
Sep 29 02:53:49 omega cyrus/tls_prune[1092]: tls_prune: purged 0 out of 0 
entries
Sep 29 02:53:49 omega cyrus/master[1089]: ready for work
Sep 29 02:53:49 omega cyrus/ctl_cyrusdb[1093]: checkpointing cyrus databases
Sep 29 02:53:49 omega cyrus/ctl_cyrusdb[1093]: done checkpointing cyrus 
databases


# file /shared/var/lib/cyrus/db/*
/shared/var/lib/cyrus/db/__db.001:   data
/shared/var/lib/cyrus/db/__db.002:   data
/shared/var/lib/cyrus/db/__db.003:   data
/shared/var/lib/cyrus/db/__db.004:   data
/shared/var/lib/cyrus/db/__db.005:   data
/shared/var/lib/cyrus/db/log.01: Berkeley DB (Log, version 8, native 
byte-order)
/shared/var/lib/cyrus/db/skipstamp:  data


The error and Berkeley DB log file is there even if I empty this directory, 
and start Cyrus.

Did I miss some value in imapd.conf?



-- 
Tomasz Chmielewski
http://wpkg.org

Cyrus Home Page: http://www.cyrusimap.org/
List Archives/Info: http://lists.andrew.cmu.edu/pipermail/info-cyrus/

High Availability approaches for Cyrus

2010-03-15 Thread Simpson, John R

Greetings all,

I've spent a good deal of time searching the Info-Cyrus archives 
(and various googled articles) to identify the recommended ways to improve 
Cyrus availability and reduce disaster recovery time.  The two main approaches 
appear to be Cyrus replication and file system replication using DRBD and 
Heartbeat/Pacemaker/RHCS.  Cyrus replication appears to be the preferred 
approach, since with DRBD a corrupted file system on the master would be 
replicated on the slave.  I have a few questions.

- Am I missing something?  Is there a third approach that is better than Cyrus 
or file system replication?
- Cyrus replication seems to be used in conjunction with manual failover 
procedures.  Is anyone using Heartbeat, etc. with Cyrus replication?
- We have three Cyrus servers, each with a single large mailstore.  Would there 
be a significant advantage to splitting them into multiple smaller mailstores?  
We're using Perdition but not Murder / Aggregator.
- Are there any situations where DRBD would be preferred to Cyrus replication?

Thank you for your time.

John

John Simpson
Senior Software Engineer, I. T. Engineering and Operations


Cyrus Home Page: http://cyrusimap.web.cmu.edu/
Cyrus Wiki/FAQ: http://cyrusimap.web.cmu.edu/twiki
List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html

Re: High Availability approaches for Cyrus

2010-03-15 Thread John Madden

 - We have three Cyrus servers, each with a single large mailstore. 
  Would there be a significant advantage to splitting them into multiple 
 smaller mailstores?  We’re using Perdition but not Murder / Aggregator.

Murder rocks, IMO, well worth the learning curve of the setup.  If 
you're going to take the extra step of doing HA for your storage nodes, 
I think Murder makes even more sense.

We deployed our Murder cluster back in November and recently cut off 
access to our old Cyrus (single instance, multiple-spool) system and 6 
nodes with FC meta partitions and SATA storage partitions plus a single 
frontend absolutely rocks for our over 450,000 users (2.6m mailboxes). 
We don't do HA but Murder makes it easy to do if needed.

John





-- 
John Madden
Sr UNIX Systems Engineer
Ivy Tech Community College of Indiana
jmad...@ivytech.edu

Cyrus Home Page: http://cyrusimap.web.cmu.edu/
Cyrus Wiki/FAQ: http://cyrusimap.web.cmu.edu/twiki
List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html

Re: High Availability approaches for Cyrus

2010-03-15 Thread Michael Menge


Quoting Simpson, John R john_simp...@reyrey.com:


Greetings all,

I've spent a good deal of time searching the Info-Cyrus  
archives (and various googled articles) to identify the recommended  
ways to improve Cyrus availability and reduce disaster recovery  
time.  The two main approaches appear to be Cyrus replication and  
file system replication using DRBD and Heartbeat/Pacemaker/RHCS.   
Cyrus replication appears to be the preferred approach, since with  
DRBD a corrupted file system on the master would be replicated on  
the slave.  I have a few questions.


- Am I missing something?  Is there a third approach that is better  
than Cyrus or file system replication?


I don't know any other.

- Cyrus replication seems to be used in conjunction with manual  
failover procedures.  Is anyone using Heartbeat, etc. with Cyrus  
replication?


You could write scripts to do the failover with Heartbeat, but IMHO
the reaction-time you win by using Heartbeat does not outwight the risk
of an ammok running Heartbeat (e.g. split brain)

- We have three Cyrus servers, each with a single large mailstore.   
Would there be a significant advantage to splitting them into  
multiple smaller mailstores?  We're using Perdition but not Murder /  
Aggregator.


Running two active instances of cyrus would allow you to share the  
load of the failed server on the two other instead of one server doing  
the work of two.


- Are there any situations where DRBD would be preferred to Cyrus  
replication?


Cyrus replication is very new, so you have to use a recent version of  
cyrus. If you have to use an older version of cyrus DRBD might be the  
only option.




Thank you for your time.

John

John Simpson
Senior Software Engineer, I. T. Engineering and Operations







M.MengeTel.: (49) 7071/29-70316
Universität Tübingen   Fax.: (49) 7071/29-5912
Zentrum für Datenverarbeitung  mail:  
michael.me...@zdv.uni-tuebingen.de

Wächterstraße 76
72074 Tübingen

smime.p7s
Description: S/MIME Signatur

Cyrus Home Page: http://cyrusimap.web.cmu.edu/
Cyrus Wiki/FAQ: http://cyrusimap.web.cmu.edu/twiki
List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html

RE: High Availability approaches for Cyrus

2010-03-15 Thread Simpson, John R

 -Original Message-
 From: John Madden [mailto:jmad...@ivytech.edu]
 Sent: Monday, March 15, 2010 2:07 PM
 To: Simpson, John R
 Cc: info-cyrus@lists.andrew.cmu.edu
 Subject: Re: High Availability approaches for Cyrus

  - We have three Cyrus servers, each with a single large mailstore.
   Would there be a significant advantage to splitting them into multiple
  smaller mailstores?  We're using Perdition but not Murder / Aggregator.

 Murder rocks, IMO, well worth the learning curve of the setup.  If
 you're going to take the extra step of doing HA for your storage nodes,
 I think Murder makes even more sense.

 We deployed our Murder cluster back in November and recently cut off
 access to our old Cyrus (single instance, multiple-spool) system and 6
 nodes with FC meta partitions and SATA storage partitions plus a single
 frontend absolutely rocks for our over 450,000 users (2.6m mailboxes).
 We don't do HA but Murder makes it easy to do if needed.

 John

Thank you, John.  I'm not a Cyrus expert but I'll be working with our Cyrus 
team on this project.  I'll definitely bring up Murder

John 

 --
 John Madden
 Sr UNIX Systems Engineer
 Ivy Tech Community College of Indiana
 jmad...@ivytech.edu

Cyrus Home Page: http://cyrusimap.web.cmu.edu/
Cyrus Wiki/FAQ: http://cyrusimap.web.cmu.edu/twiki
List Archives/Info: http://asg.web.cmu.edu/cyrus/mailing-list.html

1 2 >

1 - 100 of 167 matches

Mail list logo