Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Sven Kieske
Hi,

can nobody shed some light on this?
This is reproducible, we always get the exact same error during this
command:

[org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-32) SSH
stderr during command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t
ovirt-XX); trap chmod -R u+rwX \${M
YTMP}\  /dev/null 21; rm -fr \${MYTMP}\  /dev/null 21 0; rm
-fr ${MYTMP}  mkdir ${MYTMP}  tar --warning=no-timestamp -C
${MYTMP} -x   ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/cust
omization=bool:True': stderr: bash: line 1:  8635 Segmentation fault
  ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True

it's an segmentation fault, so it seems to me there must be a bug
hiding somewhere.

Can anyone confirm to me that what I try to do is supported and/or
works?


Adding a 2.6.1 oVirt-Node to an 3.3.0-4.el6 Management Node?

We are really stuck at this point.

This error always shows up when trying to acknowledge the node
through the webadmin UI, after registration.

Regards

Sven

On 09.10.2013 11:22, Sven Kieske wrote:
 Hi,
 
 we have successfully deployed ovirt engine 3.3.
 However, when adding a node based on ovirt node 2.6.1.
 we get the following error after registration completed successfully
 and we try to activate the host via webadmin:
 
 Installing Host server4 Starting vdsm
 Host server4 Installation failed. Unexpected connection termination.
 
 I attached an excerpt from the engine.log, which shows that an ssh
 commands fails, but I don't know why.
 
 manual ssh connections to the node from the management node work just fine.
 
 The node has the IP 10.0.1.4 in the log.
 
 Any help would be appreciated.
 
 Regards
 
 Sven
 
 PS:
 The same Node works fine with oVirt management 3.2.
 We then did use the reinstall feature from the node
 iso to redeploy the node to the management 3.3 server.
 
 
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Sven Kieske
Hi,

additional testing revealed that we are able to
add an Host based on CentOS 6.4 minimal (full blown host)

so it can't be a general setup problem or problem with our
infrastructure.

It must be related to the 2.6.1 node.
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Alon Bar-Lev


- Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: users@ovirt.org
 Sent: Wednesday, October 9, 2013 12:22:23 PM
 Subject: [Users] ComputeNode installation failed ovirt 3.3
 
 Hi,
 
 we have successfully deployed ovirt engine 3.3.
 However, when adding a node based on ovirt node 2.6.1.
 we get the following error after registration completed successfully
 and we try to activate the host via webadmin:
 
 Installing Host server4 Starting vdsm
 Host server4 Installation failed. Unexpected connection termination.
 
 I attached an excerpt from the engine.log, which shows that an ssh
 commands fails, but I don't know why.
 
 manual ssh connections to the node from the management node work just fine.
 
 The node has the IP 10.0.1.4 in the log.

You are trying to update not deploy... and you missing the iso image?

2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] 
(ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: 
/usr/share/ovirt-node-iso


 
 Any help would be appreciated.
 
 Regards
 
 Sven
 
 PS:
 The same Node works fine with oVirt management 3.2.
 We then did use the reinstall feature from the node
 iso to redeploy the node to the management 3.3 server.
 
 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Sven Kieske
No,

we installed the 2.6.1 Node via CD/ISO over Network and this error
occurs after registration to the management server, when you try to
acknowledge the host via webadmin UI.

The Datacenter has no ISO-Domain at this point in the setup.
Is this a problem?

I noticed this error in the log, but it shouldn't affect acknowledging
a host, should it?

On 10.10.2013 10:05, Alon Bar-Lev wrote:
 
 
 - Original Message -
 From: Sven Kieske s.kie...@mittwald.de
 To: users@ovirt.org
 Sent: Wednesday, October 9, 2013 12:22:23 PM
 Subject: [Users] ComputeNode installation failed ovirt 3.3

 Hi,

 we have successfully deployed ovirt engine 3.3.
 However, when adding a node based on ovirt node 2.6.1.
 we get the following error after registration completed successfully
 and we try to activate the host via webadmin:

 Installing Host server4 Starting vdsm
 Host server4 Installation failed. Unexpected connection termination.

 I attached an excerpt from the engine.log, which shows that an ssh
 commands fails, but I don't know why.

 manual ssh connections to the node from the management node work just fine.

 The node has the IP 10.0.1.4 in the log.
 
 You are trying to update not deploy... and you missing the iso image?
 
 2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] 
 (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: 
 /usr/share/ovirt-node-iso
 
 

 Any help would be appreciated.

 Regards

 Sven

 PS:
 The same Node works fine with oVirt management 3.2.
 We then did use the reinstall feature from the node
 iso to redeploy the node to the management 3.3 server.

 ___
 Users mailing list
 Users@ovirt.org
 http://lists.ovirt.org/mailman/listinfo/users

 
 
 
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Fabian Deutsch
Am Mittwoch, den 09.10.2013, 09:22 + schrieb Sven Kieske:
 Hi,
 
 we have successfully deployed ovirt engine 3.3.
 However, when adding a node based on ovirt node 2.6.1.
 we get the following error after registration completed successfully
 and we try to activate the host via webadmin:
 
 Installing Host server4 Starting vdsm
 Host server4 Installation failed. Unexpected connection termination.
 
 I attached an excerpt from the engine.log, which shows that an ssh
 commands fails, but I don't know why.
 
 manual ssh connections to the node from the management node work just fine.
 
 The node has the IP 10.0.1.4 in the log.
 
 Any help would be appreciated.

Hey Sven,

as said on IRC, I'd recommend to try a draft build form [0], they
contain some fixes for the 3.0 branch.
2.6. might be to old and not contain all the stuff that is needed by
engines setup script.

Greetings
fabian

--
[0] http://fedorapeople.org/~fabiand/node/

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Fabian Deutsch
Am Donnerstag, den 10.10.2013, 04:05 -0400 schrieb Alon Bar-Lev:
 
 - Original Message -
  From: Sven Kieske s.kie...@mittwald.de
  To: users@ovirt.org
  Sent: Wednesday, October 9, 2013 12:22:23 PM
  Subject: [Users] ComputeNode installation failed ovirt 3.3
  
  Hi,
  
  we have successfully deployed ovirt engine 3.3.
  However, when adding a node based on ovirt node 2.6.1.
  we get the following error after registration completed successfully
  and we try to activate the host via webadmin:
  
  Installing Host server4 Starting vdsm
  Host server4 Installation failed. Unexpected connection termination.
  
  I attached an excerpt from the engine.log, which shows that an ssh
  commands fails, but I don't know why.
  
  manual ssh connections to the node from the management node work just fine.
  
  The node has the IP 10.0.1.4 in the log.
 
 You are trying to update not deploy... and you missing the iso image?
 
 2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] 
 (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: 
 /usr/share/ovirt-node-iso

Hey Alon,

besides this error we also saw this:

 /var/log/ovirt-engine/engine.log:2013-10-10 08:35:27,016 ERROR
[org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-4) SSH stderr
during command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t
ovirt-XX); trap chmod -R u+rwX \${MYTMP}\  /dev/null 21;
rm -fr \${MYTMP}\  /dev/null 21 0; rm -fr ${MYTMP}  mkdir
${MYTMP}  tar --warning=no-timestamp -C ${MYTMP} -x 
${MYTMP}/setup DIALOG/dialect=str:machine
DIALOG/customization=bool:True': stderr: bash: line 1: 10803
Segmentation fault  ${MYTMP}/setup DIALOG/dialect=str:machine
DIALOG/customization=bool:True

which shows that there is some segfault.
Is there a way to enable debugging for this bash snippet?
E.g. to keep the MYTMP dir if something goes wrong? Or enable bash -x
(if setup is a bash script) to see what is being called.
To me it's a black box what's happening inside setup.

Greetings
fabian

___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


Re: [Users] ComputeNode installation failed ovirt 3.3

2013-10-10 Thread Sven Kieske
Hi,

just in case someone else runs into the same problem.

We were able to workaround this by using an alternate draft-iso image
kindly provided by fabian:

http://fedorapeople.org/~fabiand/node/ovirt-node-iso-3.0.1-1.0.201310020841draft.vdsm.el6.iso

if you want the fedora version it's:

http://fedorapeople.org/~fabiand/node/ovirt-node-iso-3.1.0-0.999.201310020841draft.vdsm.fc19.iso

We didn't test the fedora version.

This iso registers completly with Mangement 3.3 on the same host, no
more segfaults :-)

A big thank you goes out to fabian, who helped me debugging this.


Regards

Sven
___
Users mailing list
Users@ovirt.org
http://lists.ovirt.org/mailman/listinfo/users


[Users] ComputeNode installation failed ovirt 3.3

2013-10-09 Thread Sven Kieske
Hi,

we have successfully deployed ovirt engine 3.3.
However, when adding a node based on ovirt node 2.6.1.
we get the following error after registration completed successfully
and we try to activate the host via webadmin:

Installing Host server4 Starting vdsm
Host server4 Installation failed. Unexpected connection termination.

I attached an excerpt from the engine.log, which shows that an ssh
commands fails, but I don't know why.

manual ssh connections to the node from the management node work just fine.

The node has the IP 10.0.1.4 in the log.

Any help would be appreciated.

Regards

Sven

PS:
The same Node works fine with oVirt management 3.2.
We then did use the reinstall feature from the node
iso to redeploy the node to the management 3.3 server.
2013-10-09 10:50:49,697 INFO  [org.ovirt.engine.core.bll.InstallerMessages] (VdsDeploy) Installation 10.0.1.4: Starting vdsm
2013-10-09 10:50:49,725 INFO  [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector] (VdsDeploy) Correlation ID: 38f064c, Call Stack: null, Custom Event ID: -1, Message: Installing Host server4. Starting vdsm.
2013-10-09 10:50:51,423 ERROR [org.ovirt.engine.core.bll.GetoVirtISOsQuery] (ajp--127.0.0.1-8702-11) ovirt ISOs directory not found. Search in: /usr/share/ovirt-node-iso
2013-10-09 10:50:52,784 ERROR [org.ovirt.engine.core.bll.VdsDeploy] (VdsDeploy) Error during deploy dialog: java.io.IOException: Unexpected connection termination
at org.ovirt.otopi.dialog.MachineDialogParser.nextEvent(MachineDialogParser.java:388) [otopi.jar:]
at org.ovirt.otopi.dialog.MachineDialogParser.nextEvent(MachineDialogParser.java:405) [otopi.jar:]
at org.ovirt.engine.core.bll.VdsDeploy._threadMain(VdsDeploy.java:750) [bll.jar:]
at org.ovirt.engine.core.bll.VdsDeploy.access$1800(VdsDeploy.java:77) [bll.jar:]
at org.ovirt.engine.core.bll.VdsDeploy$45.run(VdsDeploy.java:893) [bll.jar:]
at java.lang.Thread.run(Thread.java:724) [rt.jar:1.7.0_25]

2013-10-09 10:50:52,785 ERROR [org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-32) SSH stderr during command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t ovirt-XX); trap chmod -R u+rwX \${M
YTMP}\  /dev/null 21; rm -fr \${MYTMP}\  /dev/null 21 0; rm -fr ${MYTMP}  mkdir ${MYTMP}  tar --warning=no-timestamp -C ${MYTMP} -x   ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/cust
omization=bool:True': stderr: bash: line 1:  8635 Segmentation fault  ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/customization=bool:True

2013-10-09 10:50:52,787 ERROR [org.ovirt.engine.core.utils.ssh.SSHDialog] (pool-6-thread-32) SSH error running command root@10.0.1.4:'umask 0077; MYTMP=$(mktemp -t ovirt-XX); trap chmod -R u+rwX \${M
YTMP}\  /dev/null 21; rm -fr \${MYTMP}\  /dev/null 21 0; rm -fr ${MYTMP}  mkdir ${MYTMP}  tar --warning=no-timestamp -C ${MYTMP} -x   ${MYTMP}/setup DIALOG/dialect=str:machine DIALOG/cust
omization=bool:True': java.io.IOException: Command returned failure code 139 during SSH session 'root@10.0.1.4'
at org.ovirt.engine.core.utils.ssh.SSHClient.executeCommand(SSHClient.java:508) [utils.jar:]
at org.ovirt.engine.core.utils.ssh.SSHDialog.executeCommand(SSHDialog.java:311) [utils.jar:]
at org.ovirt.engine.core.bll.VdsDeploy.execute(VdsDeploy.java:1039) [bll.jar:]
at org.ovirt.engine.core.bll.InstallVdsCommand.installHost(InstallVdsCommand.java:192) [bll.jar:]
at org.ovirt.engine.core.bll.InstallVdsCommand.executeCommand(InstallVdsCommand.java:105) [bll.jar:]
at org.ovirt.engine.core.bll.ApproveVdsCommand.executeCommand(ApproveVdsCommand.java:49) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.executeWithoutTransaction(CommandBase.java:1128) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.executeActionInTransactionScope(CommandBase.java:1213) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.runInTransaction(CommandBase.java:1871) [bll.jar:]
at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInSuppressed(TransactionSupport.java:174) [utils.jar:]
at org.ovirt.engine.core.utils.transaction.TransactionSupport.executeInScope(TransactionSupport.java:116) [utils.jar:]
at org.ovirt.engine.core.bll.CommandBase.execute(CommandBase.java:1231) [bll.jar:]
at org.ovirt.engine.core.bll.CommandBase.executeAction(CommandBase.java:365) [bll.jar:]
at org.ovirt.engine.core.bll.MultipleActionsRunner.executeValidatedCommand(MultipleActionsRunner.java:175) [bll.jar:]
at org.ovirt.engine.core.bll.MultipleActionsRunner.RunCommands(MultipleActionsRunner.java:156) [bll.jar:]
at org.ovirt.engine.core.bll.MultipleActionsRunner$1.run(MultipleActionsRunner.java:94) [bll.jar:]
at org.ovirt.engine.core.utils.threadpool.ThreadPoolUtil$InternalWrapperRunnable.run(ThreadPoolUtil.java:71) [utils.jar:]
at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:471)