--- Begin Message ---
Hi Stefan,

El 3/6/23 a las 13:47, Stefan Radman via pve-user escribió:
I want to create a Proxmox VE HCI cluster on 3 old but indentical DL380 Gen9 
hosts (128GB, Dual CPU, 4x1GbE, 2x10GbE, 6x1.2T SFF 10K 12Gb SAS HDD on P440ar 
controller).

Corosync will run over 2 x 1GbE, connected to separate VLANs on different 
switches.
Ceph storage network will be a 10GbE routed mesh.

The P440ar controller will be switched to HBA mode.

I am planning to use 2 HDDs as redundant boot disks with ZFS (a waste, I know).

The other 4 HDDs will be used as Ceph OSDs in a single HDD pool.
Considering a single OSD failure the HDD pool should provide ~3TB usable 
capacity.

With 2 SFF slots still available I am considering adding one or two SSDs to 
each host for a Ceph SSD pool to improve performance for some virtual disks.

I am thinking to install a single SSD in each host as the failure of a second 
SSD would limit the usable capacity to 50% of the SSD pool because Ceph would 
immediately try re-create the 3rd replica on the still working SSD on the same 
node (from what I have read up to now).
A second SSD would thus not buy me any further usable capacity (I cannot create 
a pool of 4 SSDs because there are no more slots available).
Is that correct?

Yes, you got it right if you're planning to use replicated pools with size=3 min=2 as recommended. You can consider using a second SSD in each node to provide fast wal/db space for HDD OSDs.


With a single SSD in each host if that SSD fails, how would VMs on that same 
host behave?

VMs don't know about what happens to local OSDs in their host; for them they're as any other (remote) Ceph OSD.

Are they going to continue to run happily or is I/O to their virtual disks 
going to stop until the SSD OSD is replaced?

This depends on the min value in the replica pool. Recommended value 2 will keep all your VMs working. If that was 3, then all VMs in the cluster will stop their I/O.

If I/O to the SSD pool stops for all VMs running on the affected host, would HA 
fail them over to another host? (considering that 2 copies of the data exist on 
the other 2 hosts)

That won't happen (and wouldn't help at all).

Cheers

Eneko Lacunza
Zuzendari teknikoa | Director técnico
Binovo IT Human Project

Tel. +34 943 569 206 | https://www.binovo.es
Astigarragako Bidea, 2 - 2º izda. Oficina 10-11, 20180 Oiartzun

https://www.youtube.com/user/CANALBINOVO
https://www.linkedin.com/company/37269706/



--- End Message ---
_______________________________________________
pve-user mailing list
[email protected]
https://lists.proxmox.com/cgi-bin/mailman/listinfo/pve-user

Reply via email to