Re: How to configure a new computing node?

2011-09-23 Thread Huang,Lei
Thanks for all of you for your help. Yes, Greg is right that my networks
were switched. It works now after I switch them back.

Many many thanks,
Lei

On 9/23/11 12:32 PM, "Alexander Patterson"
 wrote:

>I am posting this on behalf of Greg Duhon
>
>I have one more suggestion to add.  Verify that the networks are not
>switched.  In other words, is the public network on the private
>interface or private network on the public interface.  I can't tell
>you how many times this one dinged us when we first started using VCL.
>
>-Greg Duhon
>
>On Fri, Sep 23, 2011 at 9:32 AM, James O'Dell 
>wrote:
>> -BEGIN PGP SIGNED MESSAGE-
>> Hash: SHA1
>>
>> If you can't ping your gateway, it's either a network problem,
>> or a firewall problem.
>>
>> Try pinging the gateway, and then dumping your arp table
>> ('arp -an'). If you see the mac address for the gateway in
>> your arp table, it's probably a firewall issue. If the
>> mac address isn't there, it's a network problem
>>
>> __Jim
>>
>>
>> On 9/23/2011 9:27 AM, Huang,Lei wrote:
>>> Hi Alex,
>>>
>>> Thanks for your suggestions! For the image, it works fine when it
>>> lauches on virtual hosts on existing nodes. The problem happens
>>> when it runs on a new node. I assume the sshd works fine on the
>>> image itself.
>>>
>>> I have set up the ssh key to allow the management node to log in
>>> the node successfully. After I log into the new node running
>>> VMware, I can see that my network eth0 and eth1 have private and
>>> public ip address. However, I found that I cann't ping my gateway
>>> from the node. Are there any settings I need to do?
>>>
>>> Thanks, Lei
>>>
>>>  From: Alexander Patterson
>>> [alexander.patter...@csueastbay.edu] Sent: Friday, September 23,
>>> 2011 11:08 AM To: vcl-user@incubator.apache.org Cc:
>>> aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing
>>> node?
>>>
>>> Hello,
>>>
>>> Did you check su - Log in as root for the image you are working on
>>> Password: Turn on sshd by typing /etc/init.d/sshd start Got to the
>>> /etc/init.d/ then type chkconfig sshd on
>>>
>>> Have you made sure that you can ssh -i /etc/vcl/vcl.key >> or IP address> ?
>>>
>>> I had this issue with ssh not kicking off before as well and it was
>>> a issue with the image itself.
>>>
>>> Another helpfull command to check would be ssh -vv (IP address)
>>>
>>> Have you logged into your box running VMware to see if you are
>>> getting a public and private ip address?
>>>
>>> On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei 
>>> wrote:
 Sorry for asking again. I suspect that the problem may be related
 to network setting. I don't understand why ssd on VM was not
 active. Is it possible that the IP address of VM is not correct?
 Any suggestions/hints would be very much appreciated!

 Thanks, Lei  From:
 Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011
 10:36 AM To: vcl-user@incubator.apache.org;
 aaron_pee...@ncsu.edu Subject: Re: How to configure a new
 computing node?

 Aaron,

 Thank you for your reply. I copied some log information as
 follows. It looks like the VM was started, but sshd on the VM was
 not active. The script looped and waited for half an hour and
 failed at the end. The image works fine on existing blades. I
 wonder if there is some configuration of VM server I didn't set
 correctly for the new node.

 Thanks, Lei

 ===

 2011-09-21
 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing
 SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i
 /etc/vcl/vcl.key  -l root -p 22 -x CSB308 'vmware-cmd

/install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_
5-ba


>> se10-v0vmguest-10.vmx start' 2>&1
 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated
 for management node 1: 2011-09-21 23:31:31 2011-09-21
 23:31:36|3929|vcld:main(165)|lastcheckin time updated for
 management node 1: 2011-09-21 23:31:36 2011-09-21
 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking
 for connection by admin on vmguest-2, attempt 27 2011-09-21
 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing
 SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i
 /etc/vcl/vcl.key  -l root -p 22 -x vmguest-2 'netstat -an' 2>&1
 2011-09-21
 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing
 SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i
 /etc/vcl/vcl.key  -l root -p 22 -x vmguest-2 'who' 2>&1
 2011-09-21

23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_com
mand


>> output:
 |14122|517:511|inuse| none 2011-09-21
 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
 command executed on vmguest-2, r

RE: Windows 7 SP1 boot hangs

2011-09-23 Thread Waldron, Michael H
This solution looks very promising. I installed this hotfix and so far, it 
doesn't seem to hang anymore. The image has reloaded at least 10 times now 
without any problems. Hopefully it's resolved.

Thanks for finding that James.

Mike

Mike Waldron
Systems Specialist
ITS Research Computing
University of North Carolina at Chapel Hill
CB #3420, ITS Manning, Rm 2509
919-962-9778

From: James O'Dell [jod...@fullerton.edu]
Sent: Thursday, September 22, 2011 5:51 PM
To: vcl-user@incubator.apache.org
Subject: Re: Windows 7 SP1 boot hangs

-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

Maybe it's just a windows thing. They came out with a hotfix for it
anyway.


> Fix Windows 7 SP1 Slow Startup Due to Large Number of Restore Points
> http://news.softpedia.com/news/Fix-Windows-7-SP1-Slow-Startup-Due-to-Large-Number-of-Restore-Points-217883.shtml

> “This issue occurs because the boot plan for the ReadyBoot feature exceeds 
> the size limit of 512 kilobytes (KB). Each restore point creates a snapshot 
> of Windows that Volsnap.sys must validate during the startup process,” 
> Microsoft explained.

But it doesn't explain why sometimes its fast and sometimes it's slow.

Maybe disabling ReadyBoot would help?


On 9/22/2011 1:48 PM, Waldron, Michael H wrote:
> James,
>
> Thanks for the suggestions. I've already increased the ssh timeout, however 
> this hang goes way beyond a reasonable amount of time, anywhere from 20-30 
> minutes.
>
> I'll test with your dhcp suggestion. Although again, my Win7 images that 
> haven't been updated to SP1 are booting just fine, so I'm trying to figure 
> what got changed in the OS with the SP1 update that's causing this. One time 
> it might only spend 30 seconds at the Windows Starting screen, the next it 
> might be 10 minutes.
>
>
> Mike Waldron
> Systems Specialist
> ITS Research Computing
> University of North Carolina at Chapel Hill
> CB #3420, ITS Manning, Rm 2509
> 919-962-9778
> 
> From: James O'Dell [jod...@fullerton.edu]
> Sent: Thursday, September 22, 2011 3:36 PM
> To: vcl-user@incubator.apache.org
> Subject: Re: Windows 7 SP1 boot hangs
>
>
> I've got the same setup, and run into what looks to be the same
> problem. ( Win7 booting taking so long that the ssh connection
> times out )
>
> I've done a couple things, and it seems to have gotten better.
>
> 1) adjust the ssh timeout from 5 to 10 minutes
>
>   /opt/vcl/lib/VCL/Module/OS.pm
>
>  '$ssh_response_timeout_seconds = 1200;'
>
>Why is this hard coded anyway?
>
> 2) Turn off the WINS, and the netbios-over-tcp using settings in
>the dhcp server to prevent the booting system from registering
>with a WINS server, and from using NBoT.
>
> shared-network VCLGuestRDPnetwork {
> ...
> option netbios-name-servers noip;
> if substring ( option vendor-class-identifier, 0, 8 ) = "MSFT 5.0" {
>vendor-option-space MSFT;
># 1 = enable, 2 = disable - NetBIOS over TCP/IP:
>option MSFT.nbt 2;
> }
> ...
> }
>
> 'noip' does not resolve to anything. Which causes dhcp to clear the
> 'netbios-name-servers' (aka WINS) setting if it is globally set.
>
> Maybe this will help
>
> __Jim
>
> On 9/22/2011 11:53 AM, Waldron, Michael H wrote:
>> I've been running Windows 7 images on ESXi 4.1 hosts in our VCL without
>> problem. When I updated several of those Windows 7 images to service
>> pack 1, I'm seeing an issue where more times than not, the VM will hang
>> at the Windows Starting screen for up to 30 minutes. If I revert back to
>> the pre-SP1 image, it boots just fine.
>
>> What's maddening is that it's not consistent. Some times it will boot
>> normally, but many times there's this long hang while booting, which of
>> course causes the reservation to fail. This happens across different VMs
>> running on different ESXi hosts. The common factor is Windows 7 SP1.
>
>> Has anyone else seen this?
>
>> Mike Waldron
>> Systems Specialist
>> ITS Research Computing
>> University of North Carolina at Chapel Hill
>> CB #3420, ITS Manning, Rm 2509
>> 919-962-9778
>
>

- --
Jim O'Dell
Network Analyst
California State University Fullerton
Email: jod...@fullerton.edu
Phone: (657) 278-2256
-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.9 (MingW32)
Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/

iEYEARECAAYFAk57rdQACgkQREVHAOnXPYQtTwCdEQk+EuR/yhumZZhhJwjcqj7q
G+gAoJdw7DKZSwf7AMBMwboyguDiSfQh
=oJeV
-END PGP SIGNATURE-


Re: How to configure a new computing node?

2011-09-23 Thread Alexander Patterson
I am posting this on behalf of Greg Duhon

I have one more suggestion to add.  Verify that the networks are not
switched.  In other words, is the public network on the private
interface or private network on the public interface.  I can't tell
you how many times this one dinged us when we first started using VCL.

-Greg Duhon

On Fri, Sep 23, 2011 at 9:32 AM, James O'Dell  wrote:
> -BEGIN PGP SIGNED MESSAGE-
> Hash: SHA1
>
> If you can't ping your gateway, it's either a network problem,
> or a firewall problem.
>
> Try pinging the gateway, and then dumping your arp table
> ('arp -an'). If you see the mac address for the gateway in
> your arp table, it's probably a firewall issue. If the
> mac address isn't there, it's a network problem
>
> __Jim
>
>
> On 9/23/2011 9:27 AM, Huang,Lei wrote:
>> Hi Alex,
>>
>> Thanks for your suggestions! For the image, it works fine when it
>> lauches on virtual hosts on existing nodes. The problem happens
>> when it runs on a new node. I assume the sshd works fine on the
>> image itself.
>>
>> I have set up the ssh key to allow the management node to log in
>> the node successfully. After I log into the new node running
>> VMware, I can see that my network eth0 and eth1 have private and
>> public ip address. However, I found that I cann't ping my gateway
>> from the node. Are there any settings I need to do?
>>
>> Thanks, Lei
>>
>>  From: Alexander Patterson
>> [alexander.patter...@csueastbay.edu] Sent: Friday, September 23,
>> 2011 11:08 AM To: vcl-user@incubator.apache.org Cc:
>> aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing
>> node?
>>
>> Hello,
>>
>> Did you check su - Log in as root for the image you are working on
>> Password: Turn on sshd by typing /etc/init.d/sshd start Got to the
>> /etc/init.d/ then type chkconfig sshd on
>>
>> Have you made sure that you can ssh -i /etc/vcl/vcl.key > or IP address> ?
>>
>> I had this issue with ssh not kicking off before as well and it was
>> a issue with the image itself.
>>
>> Another helpfull command to check would be ssh -vv (IP address)
>>
>> Have you logged into your box running VMware to see if you are
>> getting a public and private ip address?
>>
>> On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei 
>> wrote:
>>> Sorry for asking again. I suspect that the problem may be related
>>> to network setting. I don't understand why ssd on VM was not
>>> active. Is it possible that the IP address of VM is not correct?
>>> Any suggestions/hints would be very much appreciated!
>>>
>>> Thanks, Lei  From:
>>> Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011
>>> 10:36 AM To: vcl-user@incubator.apache.org;
>>> aaron_pee...@ncsu.edu Subject: Re: How to configure a new
>>> computing node?
>>>
>>> Aaron,
>>>
>>> Thank you for your reply. I copied some log information as
>>> follows. It looks like the VM was started, but sshd on the VM was
>>> not active. The script looped and waited for half an hour and
>>> failed at the end. The image works fine on existing blades. I
>>> wonder if there is some configuration of VM server I didn't set
>>> correctly for the new node.
>>>
>>> Thanks, Lei
>>>
>>> ===
>>>
>>> 2011-09-21
>>> 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing
>>> SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i
>>> /etc/vcl/vcl.key  -l root -p 22 -x CSB308 'vmware-cmd
>>> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
>>>
>>>
> se10-v0vmguest-10.vmx start' 2>&1
>>> 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated
>>> for management node 1: 2011-09-21 23:31:31 2011-09-21
>>> 23:31:36|3929|vcld:main(165)|lastcheckin time updated for
>>> management node 1: 2011-09-21 23:31:36 2011-09-21
>>> 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking
>>> for connection by admin on vmguest-2, attempt 27 2011-09-21
>>> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing
>>> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i
>>> /etc/vcl/vcl.key  -l root -p 22 -x vmguest-2 'netstat -an' 2>&1
>>> 2011-09-21
>>> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing
>>> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i
>>> /etc/vcl/vcl.key  -l root -p 22 -x vmguest-2 'who' 2>&1
>>> 2011-09-21
>>> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
>>>
>>>
> output:
>>> |14122|517:511|inuse| none 2011-09-21
>>> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
>>> command executed on vmguest-2, returning (0, "none") 2011-09-21
>>> 23:31:41|3929|vcld:main(165)|lastcheckin time updated for
>>> management node 1: 2011-09-21 23:31:41 2011-09-21
>>> 23:31:46|3929|vcld:main(165)|lastcheckin time updated for
>>> management node 1: 2011-09-21 23:31:46 2011-09-21
>>> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_comman

Re: How to configure a new computing node?

2011-09-23 Thread James O'Dell
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

If you can't ping your gateway, it's either a network problem,
or a firewall problem.

Try pinging the gateway, and then dumping your arp table
('arp -an'). If you see the mac address for the gateway in
your arp table, it's probably a firewall issue. If the
mac address isn't there, it's a network problem

__Jim


On 9/23/2011 9:27 AM, Huang,Lei wrote:
> Hi Alex,
> 
> Thanks for your suggestions! For the image, it works fine when it
> lauches on virtual hosts on existing nodes. The problem happens
> when it runs on a new node. I assume the sshd works fine on the
> image itself.
> 
> I have set up the ssh key to allow the management node to log in
> the node successfully. After I log into the new node running
> VMware, I can see that my network eth0 and eth1 have private and
> public ip address. However, I found that I cann't ping my gateway
> from the node. Are there any settings I need to do?
> 
> Thanks, Lei
> 
>  From: Alexander Patterson
> [alexander.patter...@csueastbay.edu] Sent: Friday, September 23,
> 2011 11:08 AM To: vcl-user@incubator.apache.org Cc:
> aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing
> node?
> 
> Hello,
> 
> Did you check su - Log in as root for the image you are working on 
> Password: Turn on sshd by typing /etc/init.d/sshd start Got to the
> /etc/init.d/ then type chkconfig sshd on
> 
> Have you made sure that you can ssh -i /etc/vcl/vcl.key  or IP address> ?
> 
> I had this issue with ssh not kicking off before as well and it was
> a issue with the image itself.
> 
> Another helpfull command to check would be ssh -vv (IP address)
> 
> Have you logged into your box running VMware to see if you are
> getting a public and private ip address?
> 
> On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei 
> wrote:
>> Sorry for asking again. I suspect that the problem may be related
>> to network setting. I don't understand why ssd on VM was not
>> active. Is it possible that the IP address of VM is not correct?
>> Any suggestions/hints would be very much appreciated!
>> 
>> Thanks, Lei  From:
>> Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011
>> 10:36 AM To: vcl-user@incubator.apache.org;
>> aaron_pee...@ncsu.edu Subject: Re: How to configure a new
>> computing node?
>> 
>> Aaron,
>> 
>> Thank you for your reply. I copied some log information as
>> follows. It looks like the VM was started, but sshd on the VM was
>> not active. The script looped and waited for half an hour and
>> failed at the end. The image works fine on existing blades. I
>> wonder if there is some configuration of VM server I didn't set
>> correctly for the new node.
>> 
>> Thanks, Lei
>> 
>> ===
>> 
>> 2011-09-21 
>> 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing
>> SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i
>> /etc/vcl/vcl.key  -l root -p 22 -x CSB308 'vmware-cmd 
>> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
>>
>> 
se10-v0vmguest-10.vmx start' 2>&1
>> 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated
>> for management node 1: 2011-09-21 23:31:31 2011-09-21
>> 23:31:36|3929|vcld:main(165)|lastcheckin time updated for 
>> management node 1: 2011-09-21 23:31:36 2011-09-21 
>> 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking
>> for connection by admin on vmguest-2, attempt 27 2011-09-21 
>> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing
>> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i
>> /etc/vcl/vcl.key  -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 
>> 2011-09-21 
>> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing
>> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i
>> /etc/vcl/vcl.key  -l root -p 22 -x vmguest-2 'who' 2>&1 
>> 2011-09-21 
>> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
>>
>> 
output:
>> |14122|517:511|inuse| none 2011-09-21
>> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH 
>> command executed on vmguest-2, returning (0, "none") 2011-09-21
>> 23:31:41|3929|vcld:main(165)|lastcheckin time updated for 
>> management node 1: 2011-09-21 23:31:41 2011-09-21
>> 23:31:46|3929|vcld:main(165)|lastcheckin time updated for 
>> management node 1: 2011-09-21 23:31:46 2011-09-21 
>> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command
>>
>> 
output:
>> |14402|518:512|new| start() = 1 2011-09-21
>> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH 
>> command executed on CSB308, returning (0, "start() = 1") 
>> 2011-09-21
>> 23:31:48|14402|518:512|new|vmware.pm:load(808)|started 
>> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
>>
>> 
se10-v0vmguest-10.vmx on CSB308
>> 2011-09-21 
>> 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted 
>> compute

RE: How to configure a new computing node?

2011-09-23 Thread Huang,Lei
Hi Alex,

  Thanks for your suggestions! For the image, it works fine when it lauches on 
virtual hosts on existing nodes. The problem happens when it runs on a new 
node. I assume the sshd works fine on the image itself.

   I have set up the ssh key to allow the management node to log in the node 
successfully. After I log into the new node running VMware, I can see that my 
network eth0 and eth1 have private and public ip address. However, I found that 
I cann't ping my gateway from the node. Are there any settings I need to do?

Thanks,
Lei


From: Alexander Patterson [alexander.patter...@csueastbay.edu]
Sent: Friday, September 23, 2011 11:08 AM
To: vcl-user@incubator.apache.org
Cc: aaron_pee...@ncsu.edu
Subject: Re: How to configure a new computing node?

Hello,

Did you check su -
Log in as root for the image you are working on
Password:
Turn on sshd by typing /etc/init.d/sshd start
Got to the /etc/init.d/ then type chkconfig sshd on

Have you made sure that you can ssh -i /etc/vcl/vcl.key  ?

I had this issue with ssh not kicking off before as well and it was a
issue with the image itself.

Another helpfull command to check would be ssh -vv (IP address)

Have you logged into your box running VMware to see if you are getting
a public and private ip address?

On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei  wrote:
> Sorry for asking again. I suspect that the problem may be related to network 
> setting. I don't understand why ssd on VM was not active. Is it possible that 
> the IP address of VM is not correct? Any suggestions/hints would be very much 
> appreciated!
>
> Thanks,
> Lei
> 
> From: Huang,Lei [lhu...@pvamu.edu]
> Sent: Thursday, September 22, 2011 10:36 AM
> To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu
> Subject: Re: How to configure a new computing node?
>
> Aaron,
>
>  Thank you for your reply. I copied some log information as follows. It
> looks like the VM was started, but sshd on the VM was not active. The
> script looped and waited for half an hour and failed at the end. The image
> works fine on existing blades. I wonder if there is some configuration of
> VM server I didn't set correctly for the new node.
>
> Thanks,
> Lei
>
> ===
>
> 2011-09-21
> 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH
> command on CSB308:
> |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> CSB308 'vmware-cmd
> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
> se10-v0vmguest-10.vmx start' 2>&1
> 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:31
> 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:36
> 2011-09-21
> 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for
> connection by admin on vmguest-2, attempt 27
> 2011-09-21
> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
> command on vmguest-2:
> |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> vmguest-2 'netstat -an' 2>&1
> 2011-09-21
> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
> command on vmguest-2:
> |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> vmguest-2 'who' 2>&1
> 2011-09-21
> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
>  output:
> |14122|517:511|inuse| none
> 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
> command executed on vmguest-2, returning (0, "none")
> 2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:41
> 2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:46
> 2011-09-21
> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command
> output:
> |14402|518:512|new| start() = 1
> 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH
> command executed on CSB308, returning (0, "start() = 1")
> 2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started
> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
> se10-v0vmguest-10.vmx on CSB308
> 2011-09-21
> 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted
> computer=17, startvm, started vm on CSB308
>
> |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> CSB308 'vmware-cmd
> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
> se10-v0vmguest-10.vmx getstate' 2>&1
> 2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:32:11
> 2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:32:16
> 2011-09-21
> 23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for
> connect

Re: How to configure a new computing node?

2011-09-23 Thread Alexander Patterson
Hello,

Did you check su -
Log in as root for the image you are working on
Password:
Turn on sshd by typing /etc/init.d/sshd start
Got to the /etc/init.d/ then type chkconfig sshd on

Have you made sure that you can ssh -i /etc/vcl/vcl.key  ?

I had this issue with ssh not kicking off before as well and it was a
issue with the image itself.

Another helpfull command to check would be ssh -vv (IP address)

Have you logged into your box running VMware to see if you are getting
a public and private ip address?

On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei  wrote:
> Sorry for asking again. I suspect that the problem may be related to network 
> setting. I don't understand why ssd on VM was not active. Is it possible that 
> the IP address of VM is not correct? Any suggestions/hints would be very much 
> appreciated!
>
> Thanks,
> Lei
> 
> From: Huang,Lei [lhu...@pvamu.edu]
> Sent: Thursday, September 22, 2011 10:36 AM
> To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu
> Subject: Re: How to configure a new computing node?
>
> Aaron,
>
>  Thank you for your reply. I copied some log information as follows. It
> looks like the VM was started, but sshd on the VM was not active. The
> script looped and waited for half an hour and failed at the end. The image
> works fine on existing blades. I wonder if there is some configuration of
> VM server I didn't set correctly for the new node.
>
> Thanks,
> Lei
>
> ===
>
> 2011-09-21
> 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH
> command on CSB308:
> |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> CSB308 'vmware-cmd
> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
> se10-v0vmguest-10.vmx start' 2>&1
> 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:31
> 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:36
> 2011-09-21
> 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for
> connection by admin on vmguest-2, attempt 27
> 2011-09-21
> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
> command on vmguest-2:
> |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> vmguest-2 'netstat -an' 2>&1
> 2011-09-21
> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
> command on vmguest-2:
> |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> vmguest-2 'who' 2>&1
> 2011-09-21
> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
>  output:
> |14122|517:511|inuse| none
> 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
> command executed on vmguest-2, returning (0, "none")
> 2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:41
> 2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:31:46
> 2011-09-21
> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command
> output:
> |14402|518:512|new| start() = 1
> 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH
> command executed on CSB308, returning (0, "start() = 1")
> 2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started
> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
> se10-v0vmguest-10.vmx on CSB308
> 2011-09-21
> 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted
> computer=17, startvm, started vm on CSB308
>
> |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> CSB308 'vmware-cmd
> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
> se10-v0vmguest-10.vmx getstate' 2>&1
> 2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:32:11
> 2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:32:16
> 2011-09-21
> 23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for
> connection by admin on vmguest-2, attempt 29
> 2011-09-21
> 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
> command on vmguest-2:
> |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> vmguest-2 'netstat -an' 2>&1
> 2011-09-21 23:32:21|3929|vcld:main(165)|lastcheckin time updated for
> management node 1: 2011-09-21 23:32:21
> 2011-09-21
> 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
> command on vmguest-2:
> |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
> vmguest-2 'who' 2>&1
> 2011-09-21
> 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
>  output:
> |14122|517:511|inuse| none
> 2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
> command executed on v

RE: How to configure a new computing node?

2011-09-23 Thread Huang,Lei
Sorry for asking again. I suspect that the problem may be related to network 
setting. I don't understand why ssd on VM was not active. Is it possible that 
the IP address of VM is not correct? Any suggestions/hints would be very much 
appreciated!

Thanks,
Lei

From: Huang,Lei [lhu...@pvamu.edu]
Sent: Thursday, September 22, 2011 10:36 AM
To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu
Subject: Re: How to configure a new computing node?

Aaron,

  Thank you for your reply. I copied some log information as follows. It
looks like the VM was started, but sshd on the VM was not active. The
script looped and waited for half an hour and failed at the end. The image
works fine on existing blades. I wonder if there is some configuration of
VM server I didn't set correctly for the new node.

Thanks,
Lei

===

2011-09-21
23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH
command on CSB308:
|14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
CSB308 'vmware-cmd
/install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
se10-v0vmguest-10.vmx start' 2>&1
2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:31:31
2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:31:36
2011-09-21
23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for
connection by admin on vmguest-2, attempt 27
2011-09-21
23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
command on vmguest-2:
|14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
vmguest-2 'netstat -an' 2>&1
2011-09-21
23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
command on vmguest-2:
|14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
vmguest-2 'who' 2>&1
2011-09-21
23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
 output:
|14122|517:511|inuse| none
2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
command executed on vmguest-2, returning (0, "none")
2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:31:41
2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:31:46
2011-09-21
23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command
output:
|14402|518:512|new| start() = 1
2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH
command executed on CSB308, returning (0, "start() = 1")
2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started
/install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
se10-v0vmguest-10.vmx on CSB308
2011-09-21
23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted
computer=17, startvm, started vm on CSB308

|14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
CSB308 'vmware-cmd
/install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba
se10-v0vmguest-10.vmx getstate' 2>&1
2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:32:11
2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:32:16
2011-09-21
23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for
connection by admin on vmguest-2, attempt 29
2011-09-21
23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
command on vmguest-2:
|14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
vmguest-2 'netstat -an' 2>&1
2011-09-21 23:32:21|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:32:21
2011-09-21
23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH
command on vmguest-2:
|14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x
vmguest-2 'who' 2>&1
2011-09-21
23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command
 output:
|14122|517:511|inuse| none
2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH
command executed on vmguest-2, returning (0, "none")
2011-09-21 23:32:26|3929|vcld:main(165)|lastcheckin time updated for
management node 1: 2011-09-21 23:32:26
2011-09-21
23:32:28|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command
output:
|14402|518:512|new| getstate() = on
2011-09-21 23:32:28|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH
command executed on CSB308, returning (0, "getstate() = on")
2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(831)|checking state
of vm vmguest-10
2011-09-21
23:32:28|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted
computer=17, vmstage1, node has been turned on
2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(838)|stage1 completed
vm vmguest-10 has been turned on
2011-09-21 23:32:28|14402|518:512|new