Re: [Users] ComputeNode installation failed ovirt 3.3
Hi, can nobody shed some light on this? This is reproducible, we always get the exact same error during this command: [org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-32) SSH stderr during command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t ovirt-XX); trap chmod -R u+rwX \${M YTMP}\ /dev/null 21; rm -fr \${MYTMP}\ /dev/null 21 0; rm -fr ${MYTMP} mkdir ${MYTMP} tar --warning=no-timestamp -C ${MYTMP} -x ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/cust omization=bool:True': stderr: bash: line 1: 8635 Segmentation fault ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True it's an segmentation fault, so it seems to me there must be a bug hiding somewhere. Can anyone confirm to me that what I try to do is supported and/or works? Adding a 2.6.1 oVirt-Node to an 3.3.0-4.el6 Management Node? We are really stuck at this point. This error always shows up when trying to acknowledge the node through the webadmin UI, after registration. Regards Sven On 09.10.2013 11:22, Sven Kieske wrote: Hi, we have successfully deployed ovirt engine 3.3. However, when adding a node based on ovirt node 2.6.1. we get the following error after registration completed successfully and we try to activate the host via webadmin: Installing Host server4 Starting vdsm Host server4 Installation failed. Unexpected connection termination. I attached an excerpt from the engine.log, which shows that an ssh commands fails, but I don't know why. manual ssh connections to the node from the management node work just fine. The node has the IP 10.0.1.4 in the log. Any help would be appreciated. Regards Sven PS: The same Node works fine with oVirt management 3.2. We then did use the reinstall feature from the node iso to redeploy the node to the management 3.3 server. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ComputeNode installation failed ovirt 3.3
Hi, additional testing revealed that we are able to add an Host based on CentOS 6.4 minimal (full blown host) so it can't be a general setup problem or problem with our infrastructure. It must be related to the 2.6.1 node. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ComputeNode installation failed ovirt 3.3
- Original Message - From: Sven Kieske s.kie...@mittwald.de To: users@ovirt.org Sent: Wednesday, October 9, 2013 12:22:23 PM Subject: [Users] ComputeNode installation failed ovirt 3.3 Hi, we have successfully deployed ovirt engine 3.3. However, when adding a node based on ovirt node 2.6.1. we get the following error after registration completed successfully and we try to activate the host via webadmin: Installing Host server4 Starting vdsm Host server4 Installation failed. Unexpected connection termination. I attached an excerpt from the engine.log, which shows that an ssh commands fails, but I don't know why. manual ssh connections to the node from the management node work just fine. The node has the IP 10.0.1.4 in the log. You are trying to update not deploy... and you missing the iso image? 2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: /usr/share/ovirt-node-iso Any help would be appreciated. Regards Sven PS: The same Node works fine with oVirt management 3.2. We then did use the reinstall feature from the node iso to redeploy the node to the management 3.3 server. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ComputeNode installation failed ovirt 3.3
No, we installed the 2.6.1 Node via CD/ISO over Network and this error occurs after registration to the management server, when you try to acknowledge the host via webadmin UI. The Datacenter has no ISO-Domain at this point in the setup. Is this a problem? I noticed this error in the log, but it shouldn't affect acknowledging a host, should it? On 10.10.2013 10:05, Alon Bar-Lev wrote: - Original Message - From: Sven Kieske s.kie...@mittwald.de To: users@ovirt.org Sent: Wednesday, October 9, 2013 12:22:23 PM Subject: [Users] ComputeNode installation failed ovirt 3.3 Hi, we have successfully deployed ovirt engine 3.3. However, when adding a node based on ovirt node 2.6.1. we get the following error after registration completed successfully and we try to activate the host via webadmin: Installing Host server4 Starting vdsm Host server4 Installation failed. Unexpected connection termination. I attached an excerpt from the engine.log, which shows that an ssh commands fails, but I don't know why. manual ssh connections to the node from the management node work just fine. The node has the IP 10.0.1.4 in the log. You are trying to update not deploy... and you missing the iso image? 2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: /usr/share/ovirt-node-iso Any help would be appreciated. Regards Sven PS: The same Node works fine with oVirt management 3.2. We then did use the reinstall feature from the node iso to redeploy the node to the management 3.3 server. ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ComputeNode installation failed ovirt 3.3
Am Mittwoch, den 09.10.2013, 09:22 + schrieb Sven Kieske: Hi, we have successfully deployed ovirt engine 3.3. However, when adding a node based on ovirt node 2.6.1. we get the following error after registration completed successfully and we try to activate the host via webadmin: Installing Host server4 Starting vdsm Host server4 Installation failed. Unexpected connection termination. I attached an excerpt from the engine.log, which shows that an ssh commands fails, but I don't know why. manual ssh connections to the node from the management node work just fine. The node has the IP 10.0.1.4 in the log. Any help would be appreciated. Hey Sven, as said on IRC, I'd recommend to try a draft build form [0], they contain some fixes for the 3.0 branch. 2.6. might be to old and not contain all the stuff that is needed by engines setup script. Greetings fabian -- [0] http://fedorapeople.org/~fabiand/node/ ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ComputeNode installation failed ovirt 3.3
Am Donnerstag, den 10.10.2013, 04:05 -0400 schrieb Alon Bar-Lev: - Original Message - From: Sven Kieske s.kie...@mittwald.de To: users@ovirt.org Sent: Wednesday, October 9, 2013 12:22:23 PM Subject: [Users] ComputeNode installation failed ovirt 3.3 Hi, we have successfully deployed ovirt engine 3.3. However, when adding a node based on ovirt node 2.6.1. we get the following error after registration completed successfully and we try to activate the host via webadmin: Installing Host server4 Starting vdsm Host server4 Installation failed. Unexpected connection termination. I attached an excerpt from the engine.log, which shows that an ssh commands fails, but I don't know why. manual ssh connections to the node from the management node work just fine. The node has the IP 10.0.1.4 in the log. You are trying to update not deploy... and you missing the iso image? 2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: /usr/share/ovirt-node-iso Hey Alon, besides this error we also saw this: /var/log/ovirt-engine/engine.log:2013-10-10 08:35:27,016 ERROR [org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-4) SSH stderr during command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t ovirt-XX); trap chmod -R u+rwX \${MYTMP}\ /dev/null 21; rm -fr \${MYTMP}\ /dev/null 21 0; rm -fr ${MYTMP} mkdir ${MYTMP} tar --warning=no-timestamp -C ${MYTMP} -x ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True': stderr: bash: line 1: 10803 Segmentation fault ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True which shows that there is some segfault. Is there a way to enable debugging for this bash snippet? E.g. to keep the MYTMP dir if something goes wrong? Or enable bash -x (if setup is a bash script) to see what is being called. To me it's a black box what's happening inside setup. Greetings fabian ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Re: [Users] ComputeNode installation failed ovirt 3.3
Hi, just in case someone else runs into the same problem. We were able to workaround this by using an alternate draft-iso image kindly provided by fabian: http://fedorapeople.org/~fabiand/node/ovirt-node-iso-3.0.1-1.0.201310020841draft.vdsm.el6.iso if you want the fedora version it's: http://fedorapeople.org/~fabiand/node/ovirt-node-iso-3.1.0-0.999.201310020841draft.vdsm.fc19.iso We didn't test the fedora version. This iso registers completly with Mangement 3.3 on the same host, no more segfaults :-) A big thank you goes out to fabian, who helped me debugging this. Regards Sven ___ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
[Users] ComputeNode installation failed ovirt 3.3
Hi, we have successfully deployed ovirt engine 3.3. However, when adding a node based on ovirt node 2.6.1. we get the following error after registration completed successfully and we try to activate the host via webadmin: Installing Host server4 Starting vdsm Host server4 Installation failed. Unexpected connection termination. I attached an excerpt from the engine.log, which shows that an ssh commands fails, but I don't know why. manual ssh connections to the node from the management node work just fine. The node has the IP 10.0.1.4 in the log. Any help would be appreciated. Regards Sven PS: The same Node works fine with oVirt management 3.2. We then did use the reinstall feature from the node iso to redeploy the node to the management 3.3 server. 2013-10-09 10:50:49,697 INFO [org.ovirt.engine.core.bll.InstallerMessages] (VdsDeploy) Installation 10.0.1.4: Starting vdsm 2013-10-09 10:50:49,725 INFO [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) Correlation ID: 38f064c, Call Stack: null, Custom Event ID: -1, Message: Installing Host server4. Starting vdsm. 2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: /usr/share/ovirt-node-iso 2013-10-09 10:50:52,784 ERROR [org.ovirt.engine.core.bll.VdsDeploy] (VdsDeploy) Error during deploy dialog: java.io.IOException: Unexpected connection termination at org.ovirt.otopi.dialog.MachineDialogParser.nextEvent(MachineDialogParser.java:388) [otopi.jar:] at org.ovirt.otopi.dialog.MachineDialogParser.nextEvent(MachineDialogParser.java:405) [otopi.jar:] at org.ovirt.engine.core.bll.VdsDeploy._threadMain(VdsDeploy.java:750) [bll.jar:] at org.ovirt.engine.core.bll.VdsDeploy.access$1800(VdsDeploy.java:77) [bll.jar:] at org.ovirt.engine.core.bll.VdsDeploy$45.run(VdsDeploy.java:893) [bll.jar:] at java.lang.Thread.run(Thread.java:724) [rt.jar:1.7.0_25] 2013-10-09 10:50:52,785 ERROR [org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-32) SSH stderr during command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t ovirt-XX); trap chmod -R u+rwX \${M YTMP}\ /dev/null 21; rm -fr \${MYTMP}\ /dev/null 21 0; rm -fr ${MYTMP} mkdir ${MYTMP} tar --warning=no-timestamp -C ${MYTMP} -x ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/cust omization=bool:True': stderr: bash: line 1: 8635 Segmentation fault ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True 2013-10-09 10:50:52,787 ERROR [org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-32) SSH error running command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t ovirt-XX); trap chmod -R u+rwX \${M YTMP}\ /dev/null 21; rm -fr \${MYTMP}\ /dev/null 21 0; rm -fr ${MYTMP} mkdir ${MYTMP} tar --warning=no-timestamp -C ${MYTMP} -x ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/cust omization=bool:True': java.io.IOException: Command returned failure code 139 during SSH session 'root@10.0.1.4' at org.ovirt.engine.core.utils.ssh.SSHClient.executeCommand(SSHClient.java:508) [utils.jar:] at org.ovirt.engine.core.utils.ssh.SSHDialog.executeCommand(SSHDialog.java:311) [utils.jar:] at org.ovirt.engine.core.bll.VdsDeploy.execute(VdsDeploy.java:1039) [bll.jar:] at org.ovirt.engine.core.bll.InstallVdsCommand.installHost(InstallVdsCommand.java:192) [bll.jar:] at org.ovirt.engine.core.bll.InstallVdsCommand.executeCommand(InstallVdsCommand.java:105) [bll.jar:] at org.ovirt.engine.core.bll.ApproveVdsCommand.executeCommand(ApproveVdsCommand.java:49) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.executeWithoutTransaction(CommandBase.java:1128) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.executeActionInTransactionScope(CommandBase.java:1213) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.runInTransaction(CommandBase.java:1871) [bll.jar:] at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInSuppressed(TransactionSupport.java:174) [utils.jar:] at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInScope(TransactionSupport.java:116) [utils.jar:] at org.ovirt.engine.core.bll.CommandBase.execute(CommandBase.java:1231) [bll.jar:] at org.ovirt.engine.core.bll.CommandBase.executeAction(CommandBase.java:365) [bll.jar:] at org.ovirt.engine.core.bll.MultipleActionsRunner.executeValidatedCommand(MultipleActionsRunner.java:175) [bll.jar:] at org.ovirt.engine.core.bll.MultipleActionsRunner.RunCommands(MultipleActionsRunner.java:156) [bll.jar:] at org.ovirt.engine.core.bll.MultipleActionsRunner$1.run(MultipleActionsRunner.java:94) [bll.jar:] at org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:71) [utils.jar:] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)