Re: How to configure a new computing node?
Thanks for all of you for your help. Yes, Greg is right that my networks were switched. It works now after I switch them back. Many many thanks, Lei On 9/23/11 12:32 PM, "Alexander Patterson" wrote: >I am posting this on behalf of Greg Duhon > >I have one more suggestion to add. Verify that the networks are not >switched. In other words, is the public network on the private >interface or private network on the public interface. I can't tell >you how many times this one dinged us when we first started using VCL. > >-Greg Duhon > >On Fri, Sep 23, 2011 at 9:32 AM, James O'Dell >wrote: >> -BEGIN PGP SIGNED MESSAGE- >> Hash: SHA1 >> >> If you can't ping your gateway, it's either a network problem, >> or a firewall problem. >> >> Try pinging the gateway, and then dumping your arp table >> ('arp -an'). If you see the mac address for the gateway in >> your arp table, it's probably a firewall issue. If the >> mac address isn't there, it's a network problem >> >> __Jim >> >> >> On 9/23/2011 9:27 AM, Huang,Lei wrote: >>> Hi Alex, >>> >>> Thanks for your suggestions! For the image, it works fine when it >>> lauches on virtual hosts on existing nodes. The problem happens >>> when it runs on a new node. I assume the sshd works fine on the >>> image itself. >>> >>> I have set up the ssh key to allow the management node to log in >>> the node successfully. After I log into the new node running >>> VMware, I can see that my network eth0 and eth1 have private and >>> public ip address. However, I found that I cann't ping my gateway >>> from the node. Are there any settings I need to do? >>> >>> Thanks, Lei >>> >>> From: Alexander Patterson >>> [alexander.patter...@csueastbay.edu] Sent: Friday, September 23, >>> 2011 11:08 AM To: vcl-user@incubator.apache.org Cc: >>> aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing >>> node? >>> >>> Hello, >>> >>> Did you check su - Log in as root for the image you are working on >>> Password: Turn on sshd by typing /etc/init.d/sshd start Got to the >>> /etc/init.d/ then type chkconfig sshd on >>> >>> Have you made sure that you can ssh -i /etc/vcl/vcl.key >> or IP address> ? >>> >>> I had this issue with ssh not kicking off before as well and it was >>> a issue with the image itself. >>> >>> Another helpfull command to check would be ssh -vv (IP address) >>> >>> Have you logged into your box running VMware to see if you are >>> getting a public and private ip address? >>> >>> On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei >>> wrote: Sorry for asking again. I suspect that the problem may be related to network setting. I don't understand why ssd on VM was not active. Is it possible that the IP address of VM is not correct? Any suggestions/hints would be very much appreciated! Thanks, Lei From: Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011 10:36 AM To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing node? Aaron, Thank you for your reply. I copied some log information as follows. It looks like the VM was started, but sshd on the VM was not active. The script looped and waited for half an hour and failed at the end. The image works fine on existing blades. I wonder if there is some configuration of VM server I didn't set correctly for the new node. Thanks, Lei === 2011-09-21 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_ 5-ba >> se10-v0vmguest-10.vmx start' 2>&1 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:31 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:36 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for connection by admin on vmguest-2, attempt 27 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_com mand >> output: |14122|517:511|inuse| none 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH command executed on vmguest-2, r
RE: Windows 7 SP1 boot hangs
This solution looks very promising. I installed this hotfix and so far, it doesn't seem to hang anymore. The image has reloaded at least 10 times now without any problems. Hopefully it's resolved. Thanks for finding that James. Mike Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 From: James O'Dell [jod...@fullerton.edu] Sent: Thursday, September 22, 2011 5:51 PM To: vcl-user@incubator.apache.org Subject: Re: Windows 7 SP1 boot hangs -BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Maybe it's just a windows thing. They came out with a hotfix for it anyway. > Fix Windows 7 SP1 Slow Startup Due to Large Number of Restore Points > http://news.softpedia.com/news/Fix-Windows-7-SP1-Slow-Startup-Due-to-Large-Number-of-Restore-Points-217883.shtml > “This issue occurs because the boot plan for the ReadyBoot feature exceeds > the size limit of 512 kilobytes (KB). Each restore point creates a snapshot > of Windows that Volsnap.sys must validate during the startup process,” > Microsoft explained. But it doesn't explain why sometimes its fast and sometimes it's slow. Maybe disabling ReadyBoot would help? On 9/22/2011 1:48 PM, Waldron, Michael H wrote: > James, > > Thanks for the suggestions. I've already increased the ssh timeout, however > this hang goes way beyond a reasonable amount of time, anywhere from 20-30 > minutes. > > I'll test with your dhcp suggestion. Although again, my Win7 images that > haven't been updated to SP1 are booting just fine, so I'm trying to figure > what got changed in the OS with the SP1 update that's causing this. One time > it might only spend 30 seconds at the Windows Starting screen, the next it > might be 10 minutes. > > > Mike Waldron > Systems Specialist > ITS Research Computing > University of North Carolina at Chapel Hill > CB #3420, ITS Manning, Rm 2509 > 919-962-9778 > > From: James O'Dell [jod...@fullerton.edu] > Sent: Thursday, September 22, 2011 3:36 PM > To: vcl-user@incubator.apache.org > Subject: Re: Windows 7 SP1 boot hangs > > > I've got the same setup, and run into what looks to be the same > problem. ( Win7 booting taking so long that the ssh connection > times out ) > > I've done a couple things, and it seems to have gotten better. > > 1) adjust the ssh timeout from 5 to 10 minutes > > /opt/vcl/lib/VCL/Module/OS.pm > > '$ssh_response_timeout_seconds = 1200;' > >Why is this hard coded anyway? > > 2) Turn off the WINS, and the netbios-over-tcp using settings in >the dhcp server to prevent the booting system from registering >with a WINS server, and from using NBoT. > > shared-network VCLGuestRDPnetwork { > ... > option netbios-name-servers noip; > if substring ( option vendor-class-identifier, 0, 8 ) = "MSFT 5.0" { >vendor-option-space MSFT; ># 1 = enable, 2 = disable - NetBIOS over TCP/IP: >option MSFT.nbt 2; > } > ... > } > > 'noip' does not resolve to anything. Which causes dhcp to clear the > 'netbios-name-servers' (aka WINS) setting if it is globally set. > > Maybe this will help > > __Jim > > On 9/22/2011 11:53 AM, Waldron, Michael H wrote: >> I've been running Windows 7 images on ESXi 4.1 hosts in our VCL without >> problem. When I updated several of those Windows 7 images to service >> pack 1, I'm seeing an issue where more times than not, the VM will hang >> at the Windows Starting screen for up to 30 minutes. If I revert back to >> the pre-SP1 image, it boots just fine. > >> What's maddening is that it's not consistent. Some times it will boot >> normally, but many times there's this long hang while booting, which of >> course causes the reservation to fail. This happens across different VMs >> running on different ESXi hosts. The common factor is Windows 7 SP1. > >> Has anyone else seen this? > >> Mike Waldron >> Systems Specialist >> ITS Research Computing >> University of North Carolina at Chapel Hill >> CB #3420, ITS Manning, Rm 2509 >> 919-962-9778 > > - -- Jim O'Dell Network Analyst California State University Fullerton Email: jod...@fullerton.edu Phone: (657) 278-2256 -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.9 (MingW32) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iEYEARECAAYFAk57rdQACgkQREVHAOnXPYQtTwCdEQk+EuR/yhumZZhhJwjcqj7q G+gAoJdw7DKZSwf7AMBMwboyguDiSfQh =oJeV -END PGP SIGNATURE-
Re: How to configure a new computing node?
I am posting this on behalf of Greg Duhon I have one more suggestion to add. Verify that the networks are not switched. In other words, is the public network on the private interface or private network on the public interface. I can't tell you how many times this one dinged us when we first started using VCL. -Greg Duhon On Fri, Sep 23, 2011 at 9:32 AM, James O'Dell wrote: > -BEGIN PGP SIGNED MESSAGE- > Hash: SHA1 > > If you can't ping your gateway, it's either a network problem, > or a firewall problem. > > Try pinging the gateway, and then dumping your arp table > ('arp -an'). If you see the mac address for the gateway in > your arp table, it's probably a firewall issue. If the > mac address isn't there, it's a network problem > > __Jim > > > On 9/23/2011 9:27 AM, Huang,Lei wrote: >> Hi Alex, >> >> Thanks for your suggestions! For the image, it works fine when it >> lauches on virtual hosts on existing nodes. The problem happens >> when it runs on a new node. I assume the sshd works fine on the >> image itself. >> >> I have set up the ssh key to allow the management node to log in >> the node successfully. After I log into the new node running >> VMware, I can see that my network eth0 and eth1 have private and >> public ip address. However, I found that I cann't ping my gateway >> from the node. Are there any settings I need to do? >> >> Thanks, Lei >> >> From: Alexander Patterson >> [alexander.patter...@csueastbay.edu] Sent: Friday, September 23, >> 2011 11:08 AM To: vcl-user@incubator.apache.org Cc: >> aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing >> node? >> >> Hello, >> >> Did you check su - Log in as root for the image you are working on >> Password: Turn on sshd by typing /etc/init.d/sshd start Got to the >> /etc/init.d/ then type chkconfig sshd on >> >> Have you made sure that you can ssh -i /etc/vcl/vcl.key > or IP address> ? >> >> I had this issue with ssh not kicking off before as well and it was >> a issue with the image itself. >> >> Another helpfull command to check would be ssh -vv (IP address) >> >> Have you logged into your box running VMware to see if you are >> getting a public and private ip address? >> >> On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei >> wrote: >>> Sorry for asking again. I suspect that the problem may be related >>> to network setting. I don't understand why ssd on VM was not >>> active. Is it possible that the IP address of VM is not correct? >>> Any suggestions/hints would be very much appreciated! >>> >>> Thanks, Lei From: >>> Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011 >>> 10:36 AM To: vcl-user@incubator.apache.org; >>> aaron_pee...@ncsu.edu Subject: Re: How to configure a new >>> computing node? >>> >>> Aaron, >>> >>> Thank you for your reply. I copied some log information as >>> follows. It looks like the VM was started, but sshd on the VM was >>> not active. The script looped and waited for half an hour and >>> failed at the end. The image works fine on existing blades. I >>> wonder if there is some configuration of VM server I didn't set >>> correctly for the new node. >>> >>> Thanks, Lei >>> >>> === >>> >>> 2011-09-21 >>> 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing >>> SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i >>> /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd >>> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba >>> >>> > se10-v0vmguest-10.vmx start' 2>&1 >>> 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated >>> for management node 1: 2011-09-21 23:31:31 2011-09-21 >>> 23:31:36|3929|vcld:main(165)|lastcheckin time updated for >>> management node 1: 2011-09-21 23:31:36 2011-09-21 >>> 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking >>> for connection by admin on vmguest-2, attempt 27 2011-09-21 >>> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing >>> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i >>> /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 >>> 2011-09-21 >>> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing >>> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i >>> /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 >>> 2011-09-21 >>> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command >>> >>> > output: >>> |14122|517:511|inuse| none 2011-09-21 >>> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH >>> command executed on vmguest-2, returning (0, "none") 2011-09-21 >>> 23:31:41|3929|vcld:main(165)|lastcheckin time updated for >>> management node 1: 2011-09-21 23:31:41 2011-09-21 >>> 23:31:46|3929|vcld:main(165)|lastcheckin time updated for >>> management node 1: 2011-09-21 23:31:46 2011-09-21 >>> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_comman
Re: How to configure a new computing node?
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 If you can't ping your gateway, it's either a network problem, or a firewall problem. Try pinging the gateway, and then dumping your arp table ('arp -an'). If you see the mac address for the gateway in your arp table, it's probably a firewall issue. If the mac address isn't there, it's a network problem __Jim On 9/23/2011 9:27 AM, Huang,Lei wrote: > Hi Alex, > > Thanks for your suggestions! For the image, it works fine when it > lauches on virtual hosts on existing nodes. The problem happens > when it runs on a new node. I assume the sshd works fine on the > image itself. > > I have set up the ssh key to allow the management node to log in > the node successfully. After I log into the new node running > VMware, I can see that my network eth0 and eth1 have private and > public ip address. However, I found that I cann't ping my gateway > from the node. Are there any settings I need to do? > > Thanks, Lei > > From: Alexander Patterson > [alexander.patter...@csueastbay.edu] Sent: Friday, September 23, > 2011 11:08 AM To: vcl-user@incubator.apache.org Cc: > aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing > node? > > Hello, > > Did you check su - Log in as root for the image you are working on > Password: Turn on sshd by typing /etc/init.d/sshd start Got to the > /etc/init.d/ then type chkconfig sshd on > > Have you made sure that you can ssh -i /etc/vcl/vcl.key or IP address> ? > > I had this issue with ssh not kicking off before as well and it was > a issue with the image itself. > > Another helpfull command to check would be ssh -vv (IP address) > > Have you logged into your box running VMware to see if you are > getting a public and private ip address? > > On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei > wrote: >> Sorry for asking again. I suspect that the problem may be related >> to network setting. I don't understand why ssd on VM was not >> active. Is it possible that the IP address of VM is not correct? >> Any suggestions/hints would be very much appreciated! >> >> Thanks, Lei From: >> Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011 >> 10:36 AM To: vcl-user@incubator.apache.org; >> aaron_pee...@ncsu.edu Subject: Re: How to configure a new >> computing node? >> >> Aaron, >> >> Thank you for your reply. I copied some log information as >> follows. It looks like the VM was started, but sshd on the VM was >> not active. The script looped and waited for half an hour and >> failed at the end. The image works fine on existing blades. I >> wonder if there is some configuration of VM server I didn't set >> correctly for the new node. >> >> Thanks, Lei >> >> === >> >> 2011-09-21 >> 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing >> SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i >> /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd >> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba >> >> se10-v0vmguest-10.vmx start' 2>&1 >> 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated >> for management node 1: 2011-09-21 23:31:31 2011-09-21 >> 23:31:36|3929|vcld:main(165)|lastcheckin time updated for >> management node 1: 2011-09-21 23:31:36 2011-09-21 >> 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking >> for connection by admin on vmguest-2, attempt 27 2011-09-21 >> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing >> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i >> /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 >> 2011-09-21 >> 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing >> SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i >> /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 >> 2011-09-21 >> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command >> >> output: >> |14122|517:511|inuse| none 2011-09-21 >> 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH >> command executed on vmguest-2, returning (0, "none") 2011-09-21 >> 23:31:41|3929|vcld:main(165)|lastcheckin time updated for >> management node 1: 2011-09-21 23:31:41 2011-09-21 >> 23:31:46|3929|vcld:main(165)|lastcheckin time updated for >> management node 1: 2011-09-21 23:31:46 2011-09-21 >> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command >> >> output: >> |14402|518:512|new| start() = 1 2011-09-21 >> 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH >> command executed on CSB308, returning (0, "start() = 1") >> 2011-09-21 >> 23:31:48|14402|518:512|new|vmware.pm:load(808)|started >> /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba >> >> se10-v0vmguest-10.vmx on CSB308 >> 2011-09-21 >> 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted >> compute
RE: How to configure a new computing node?
Hi Alex, Thanks for your suggestions! For the image, it works fine when it lauches on virtual hosts on existing nodes. The problem happens when it runs on a new node. I assume the sshd works fine on the image itself. I have set up the ssh key to allow the management node to log in the node successfully. After I log into the new node running VMware, I can see that my network eth0 and eth1 have private and public ip address. However, I found that I cann't ping my gateway from the node. Are there any settings I need to do? Thanks, Lei From: Alexander Patterson [alexander.patter...@csueastbay.edu] Sent: Friday, September 23, 2011 11:08 AM To: vcl-user@incubator.apache.org Cc: aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing node? Hello, Did you check su - Log in as root for the image you are working on Password: Turn on sshd by typing /etc/init.d/sshd start Got to the /etc/init.d/ then type chkconfig sshd on Have you made sure that you can ssh -i /etc/vcl/vcl.key ? I had this issue with ssh not kicking off before as well and it was a issue with the image itself. Another helpfull command to check would be ssh -vv (IP address) Have you logged into your box running VMware to see if you are getting a public and private ip address? On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei wrote: > Sorry for asking again. I suspect that the problem may be related to network > setting. I don't understand why ssd on VM was not active. Is it possible that > the IP address of VM is not correct? Any suggestions/hints would be very much > appreciated! > > Thanks, > Lei > > From: Huang,Lei [lhu...@pvamu.edu] > Sent: Thursday, September 22, 2011 10:36 AM > To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu > Subject: Re: How to configure a new computing node? > > Aaron, > > Thank you for your reply. I copied some log information as follows. It > looks like the VM was started, but sshd on the VM was not active. The > script looped and waited for half an hour and failed at the end. The image > works fine on existing blades. I wonder if there is some configuration of > VM server I didn't set correctly for the new node. > > Thanks, > Lei > > === > > 2011-09-21 > 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH > command on CSB308: > |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > CSB308 'vmware-cmd > /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba > se10-v0vmguest-10.vmx start' 2>&1 > 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:31 > 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:36 > 2011-09-21 > 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for > connection by admin on vmguest-2, attempt 27 > 2011-09-21 > 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH > command on vmguest-2: > |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > vmguest-2 'netstat -an' 2>&1 > 2011-09-21 > 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH > command on vmguest-2: > |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > vmguest-2 'who' 2>&1 > 2011-09-21 > 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command > output: > |14122|517:511|inuse| none > 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH > command executed on vmguest-2, returning (0, "none") > 2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:41 > 2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:46 > 2011-09-21 > 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command > output: > |14402|518:512|new| start() = 1 > 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH > command executed on CSB308, returning (0, "start() = 1") > 2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started > /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba > se10-v0vmguest-10.vmx on CSB308 > 2011-09-21 > 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted > computer=17, startvm, started vm on CSB308 > > |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > CSB308 'vmware-cmd > /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba > se10-v0vmguest-10.vmx getstate' 2>&1 > 2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:32:11 > 2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:32:16 > 2011-09-21 > 23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for > connect
Re: How to configure a new computing node?
Hello, Did you check su - Log in as root for the image you are working on Password: Turn on sshd by typing /etc/init.d/sshd start Got to the /etc/init.d/ then type chkconfig sshd on Have you made sure that you can ssh -i /etc/vcl/vcl.key ? I had this issue with ssh not kicking off before as well and it was a issue with the image itself. Another helpfull command to check would be ssh -vv (IP address) Have you logged into your box running VMware to see if you are getting a public and private ip address? On Fri, Sep 23, 2011 at 9:00 AM, Huang,Lei wrote: > Sorry for asking again. I suspect that the problem may be related to network > setting. I don't understand why ssd on VM was not active. Is it possible that > the IP address of VM is not correct? Any suggestions/hints would be very much > appreciated! > > Thanks, > Lei > > From: Huang,Lei [lhu...@pvamu.edu] > Sent: Thursday, September 22, 2011 10:36 AM > To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu > Subject: Re: How to configure a new computing node? > > Aaron, > > Thank you for your reply. I copied some log information as follows. It > looks like the VM was started, but sshd on the VM was not active. The > script looped and waited for half an hour and failed at the end. The image > works fine on existing blades. I wonder if there is some configuration of > VM server I didn't set correctly for the new node. > > Thanks, > Lei > > === > > 2011-09-21 > 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH > command on CSB308: > |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > CSB308 'vmware-cmd > /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba > se10-v0vmguest-10.vmx start' 2>&1 > 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:31 > 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:36 > 2011-09-21 > 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for > connection by admin on vmguest-2, attempt 27 > 2011-09-21 > 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH > command on vmguest-2: > |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > vmguest-2 'netstat -an' 2>&1 > 2011-09-21 > 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH > command on vmguest-2: > |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > vmguest-2 'who' 2>&1 > 2011-09-21 > 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command > output: > |14122|517:511|inuse| none > 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH > command executed on vmguest-2, returning (0, "none") > 2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:41 > 2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:31:46 > 2011-09-21 > 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command > output: > |14402|518:512|new| start() = 1 > 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH > command executed on CSB308, returning (0, "start() = 1") > 2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started > /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba > se10-v0vmguest-10.vmx on CSB308 > 2011-09-21 > 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted > computer=17, startvm, started vm on CSB308 > > |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > CSB308 'vmware-cmd > /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba > se10-v0vmguest-10.vmx getstate' 2>&1 > 2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:32:11 > 2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:32:16 > 2011-09-21 > 23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for > connection by admin on vmguest-2, attempt 29 > 2011-09-21 > 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH > command on vmguest-2: > |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > vmguest-2 'netstat -an' 2>&1 > 2011-09-21 23:32:21|3929|vcld:main(165)|lastcheckin time updated for > management node 1: 2011-09-21 23:32:21 > 2011-09-21 > 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH > command on vmguest-2: > |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x > vmguest-2 'who' 2>&1 > 2011-09-21 > 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command > output: > |14122|517:511|inuse| none > 2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH > command executed on v
RE: How to configure a new computing node?
Sorry for asking again. I suspect that the problem may be related to network setting. I don't understand why ssd on VM was not active. Is it possible that the IP address of VM is not correct? Any suggestions/hints would be very much appreciated! Thanks, Lei From: Huang,Lei [lhu...@pvamu.edu] Sent: Thursday, September 22, 2011 10:36 AM To: vcl-user@incubator.apache.org; aaron_pee...@ncsu.edu Subject: Re: How to configure a new computing node? Aaron, Thank you for your reply. I copied some log information as follows. It looks like the VM was started, but sshd on the VM was not active. The script looped and waited for half an hour and failed at the end. The image works fine on existing blades. I wonder if there is some configuration of VM server I didn't set correctly for the new node. Thanks, Lei === 2011-09-21 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx start' 2>&1 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:31 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:36 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for connection by admin on vmguest-2, attempt 27 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14122|517:511|inuse| none 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH command executed on vmguest-2, returning (0, "none") 2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:41 2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:46 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14402|518:512|new| start() = 1 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH command executed on CSB308, returning (0, "start() = 1") 2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx on CSB308 2011-09-21 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, startvm, started vm on CSB308 |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx getstate' 2>&1 2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:11 2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:16 2011-09-21 23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for connection by admin on vmguest-2, attempt 29 2011-09-21 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 2011-09-21 23:32:21|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:21 2011-09-21 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14122|517:511|inuse| none 2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH command executed on vmguest-2, returning (0, "none") 2011-09-21 23:32:26|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:26 2011-09-21 23:32:28|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14402|518:512|new| getstate() = on 2011-09-21 23:32:28|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH command executed on CSB308, returning (0, "getstate() = on") 2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(831)|checking state of vm vmguest-10 2011-09-21 23:32:28|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, vmstage1, node has been turned on 2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(838)|stage1 completed vm vmguest-10 has been turned on 2011-09-21 23:32:28|14402|518:512|new