[PVE-User] problems with 3.2-beta

Adam Thompson Sun, 02 Feb 2014 10:01:40 -0800

Overall, the Ceph GUI is great. I actually got Ceph up and running (andworking) this time! Syncing ceph.conf through corosync is such anobvious way to simplify things... for small clusters, anyway.

I am seeing some problems, however, and I'm not sure if they're just me,or if I should be opening bugs:

1. I have one node that's up and running just fine, pvecm claimseverything's fine, but I can't migrate VMs that started somewhere elseto it - migration always fails, claiming the node is dead. Nothingunusual appears in any logfile that I can see... or at least nothingthat looks bad to me. I can create a new VM there, migrate it (online)to another node and migrate it back (online, again), but VMs that werestarted on another node won't migrate.

2. CPU usage in the "Summary" screen of each VM sometimes reportsnon-sensical values: right now one VM is using 126% of 1 vCPU.

3. The Wiki page on setting up CEPH Server doesn't mention that you cando most of the setup from within the GUI. Since I have write accessthere, I guess I should fix it myself :-).

4. (This isn't really new...) SPICE continues to be a major PITA whenrunning Ubuntu 12.04LTS as the management client. Hmm, I just found aPPA with virt-viewer packages that work. I should update the Wiki withthat info, too.

5. Stopping VMs with HA enabled is now an *extremely* slow process... IfI disable HA for a particular VM, I now notice that Stopping alsoproduces a Shutdown task, and it takes longer than previously, but notunreasonably slow. I don't understand why Stop isn't instantaneous,though. I notice that typing "stop" into a qm monitor also is slow...the only way I have to rapidly stop a VM is to kill the KVM processrunning it.

6. I'm not sure if this is new, but when I have a VM under HA, if I stopit manually, it immediately restarts. I don't know if I ever tried thatunder 3.1 Enterprise... maybe it always worked this way?

Ceph speeds are barely acceptable (10-20MB/sec) but that's typical ofCeph in my experience so far, even with caching turned on. (Still a bitof a letdown compared to Sheepdog's 300MB/sec burst throughput, though.)

One thing I'm not sure of is OSD placement... if I have two drives perhost dedicated to Ceph (and thus two OSDs), and my pool "size" is 2,does that mean a single node failure could render some dataunreachable? I've adjusted my "size" to 3 just in case, but I don'tunderstand how this works. Sheepdog guarantees that multiple copies ofan object won't be stored on the same host for exactly this reason, butI can't tell what Ceph does.

Also not sure what's going on with thin-provisioning; I guess Ceph andQEMU/KVM don't do thin provisioning at all, in any way, shape or form?


--
-Adam Thompson
 [email protected]

_______________________________________________
pve-user mailing list
[email protected]
http://pve.proxmox.com/cgi-bin/mailman/listinfo/pve-user

[PVE-User] problems with 3.2-beta

Reply via email to