Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Chu Duc Minh
March 16, 2015 at 7:49 AM > To: Florent B > Cc: "ceph-users@lists.ceph.com" > Subject: Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down ! > >I'm using the latest Giant and have the same issue. When i increase > PG_num of a pool from 2048 to 2148, my VMs is

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Michael Kuriger
il.com>> Date: Monday, March 16, 2015 at 7:49 AM To: Florent B mailto:flor...@coppint.com>> Cc: "ceph-users@lists.ceph.com<mailto:ceph-users@lists.ceph.com>" mailto:ceph-users@lists.ceph.com>> Subject: Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down ! I'

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Chu Duc Minh
I'm using the latest Giant and have the same issue. When i increase PG_num of a pool from 2048 to 2148, my VMs is still ok. When i increase from 2148 to 2400, some VMs die (Qemu-kvm process die). My physical servers (host VMs) running kernel 3.13 and use librbd. I think it's a bug in librbd with cr

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Azad Aliyar
May I know your ceph version.?. The latest version of firefly 80.9 has patches to avoid excessive data migrations during rewighting osds. You may need set a tunable inorder make this patch active. This is a bugfix release for firefly. It fixes a performance regression in librbd, an important CRUS

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Steffen W Sørensen
On 16/03/2015, at 12.23, Alexandre DERUMIER wrote: >>> We use Proxmox, so I think it uses librbd ? > > As It's me that I made the proxmox rbd plugin, I can confirm that yes, it's > librbd ;) > Is the ceph cluster on dedicated nodes ? or vms are running on same nodes > than osd daemons ? My c

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Alexandre DERUMIER
derumier" Cc: "ceph-users" Envoyé: Lundi 16 Mars 2015 12:35:11 Objet: Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down ! On 03/16/2015 12:23 PM, Alexandre DERUMIER wrote: >>> We use Proxmox, so I think it uses librbd ? > As It's me that I made t

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Alexandre DERUMIER
ent Bautista" À: "aderumier" Cc: "ceph-users" Envoyé: Lundi 16 Mars 2015 11:14:45 Objet: Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down ! On 03/16/2015 11:03 AM, Alexandre DERUMIER wrote: > This is strange, that could be: > > - qemu crash, maybe

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Steffen W Sørensen
On 16/03/2015, at 11.14, Florent B wrote: > On 03/16/2015 11:03 AM, Alexandre DERUMIER wrote: >> This is strange, that could be: >> >> - qemu crash, maybe a bug in rbd block storage (if you use librbd) >> - oom-killer on you host (any logs ?) >> >> what is your qemu version ? >> > > Now, we

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-16 Thread Alexandre DERUMIER
t;ceph-users" Envoyé: Lundi 16 Mars 2015 10:11:43 Objet: Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down ! Of course but it does not explain why VMs stopped... That full system slows down, OK, but brutal stop... On 03/14/2015 07:00 PM, Andrija Panic wrote: changin PG

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-14 Thread Andrija Panic
changin PG number - causes LOOOT of data rebalancing (in my case was 80%) which I learned the hard way... On 14 March 2015 at 18:49, Gabri Mate wrote: > I had the same issue a few days ago. I was increasing the pg_num of one > pool from 512 to 1024 and all the VMs in that pool stopped. I came to

Re: [ceph-users] [SPAM] Changing pg_num => RBD VM down !

2015-03-14 Thread Gabri Mate
I had the same issue a few days ago. I was increasing the pg_num of one pool from 512 to 1024 and all the VMs in that pool stopped. I came to the conclusion that doubling the pg_num caused such a high load in ceph that the VMs were blocked. The next time I will test with small increments. On 12:3