Re: Problem with ssh on the vmguest machine
It would be helpful if you could provide some more information. Did you have to manually start the Cygwin SSHD service while logged in as another user? Did it display a credentials error when you first tried to start it? If so, what username is the service configured to use when the VM comes up? Nothing in the code should delete the root account. It would be helpful to know what accounts are listed when you run net user? Regards, Andy Kiran N wrote: Hello all, I have my Windows 2003 image configured with cygwin.There is a root account and a password has been set for that. I even updated the vcld.conf file with the password. When I make a reservation, the vguest(vcl01) machine starts but fails when it tries to do an ssh on it.. I tried running /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vcl01 'uname -s' 21 from command line says Host key authentication failed. When I start the cygwin on that vmguest machine, I saw that there is no root account so I infer thats why the above ssh fails.. Do I need to add root account again on every vmguest machine? Please help!! A part of the log file is below. 2010-04-14 12:41:28|20587|34:34|reload|utils.pm:run_ssh_command(6167)|executing SSH command on vcl01: |20587|34:34|reload| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vcl01 'uname -s' 21 2010-04-14 12:41:31|22231|vcld:main(165)|lastcheckin time updated for management node 1: 2010-04-14 12:41:31 2010-04-14 12:41:36|22231|vcld:main(165)|lastcheckin time updated for management node 1: 2010-04-14 12:41:36 |20587|34:34|reload| WARNING |20587|34:34|reload| 2010-04-14 12:41:36|20587|34:34|reload|utils.pm:run_ssh_command(6239)|attempt 1/3: failed to execute SSH command on vcl01: uname -s, exit status: 255, SSH exits with the exit status of the remote command or with 255 if an error occurred, output: |20587|34:34|reload| ssh output (uname -s): Permission denied, please try again. |20587|34:34|reload| ssh output (uname -s): Permission denied, please try again. |20587|34:34|reload| ssh output (uname -s): Received disconnect from 10.0.0.10: 2: Too many authentication failures for root |20587|34:34|reload| ( 0) utils.pm, notify (line: 727) |20587|34:34|reload| (-1) utils.pm, run_ssh_command (line: 6239) |20587|34:34|reload| (-2) utils.pm, _sshd_status (line: 2745) |20587|34:34|reload| (-3) vmware.pm, load (line: 847) |20587|34:34|reload| (-4) new.pm, reload_image (line: 665) |20587|34:34|reload| (-5) new.pm, process (line: 266) |20587|34:34|reload| (-6) vcld, make_new_child (line: 594)
Re: working on JIRA issues for 2.2 release
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On to the next round: VCL-325: removing a subimage from a cluster can result in all subimages getting removed VCL-206: Alphabetize list of virtual hosts VCL-125: Add ability to control whether or not users have root/administrator access VCL-272: block request form Again, work on these is pretty straightforward; so, I won't bother describing it. Josh On Monday April 12, 2010, Josh Thompson wrote: I've finished all of the previously listed issues. Now, I'm going to start on VCL-277: deleting user group that owns a resource group VCL-250: authentication for XML RPC X-User doesn't properly check to see if no authtype was found VCL-241: detailed loading summary does not compute times correctly for future reservations that are preloaded VCL-220: VMs in the available state which aren't assigned to a VM host may be reserved VCL-244: Staticics page - show number of failed loads per image Josh On Wednesday April 07, 2010, Josh Thompson wrote: I've been getting some work done on the frontend issues for the 2.2 release the last few days. I just remembered that we're supposed to be emailing this list before we start work on any issues stating which ones and some comments on how to resolve them. I'm about to start on the following issues (in no particular order). Most of them are pretty straightforward; so, I'll only add comments for more complicated ones. VCL-93: LDAP part of a login to fail silently on errors when Shibboleth authentication is used VCL-233: all calls to simpleAddRequest need to check for all return codes VCL-32: modify XMLRPCaddRequest to allow an end time to be specified -with this one I'll also add XMLRPCeditRequest that will allow the end time to be modified VCL-299: shib users may have incorrect full name and email address VCL-189: cannot add a computer in maintenance state VCL-138: remove antiquated vcldquery support VCL-303: tomaintenance reservations inserting wrong image revision VCL-300: shib users always have email notices turned off VCL-148: Add management node to current reservations view for ADMIN_DEVELOPER VCL-155: Decrease initial Current Reservation page refresh interval Josh - -- - --- Josh Thompson Systems Programmer Advanced Computing | VCL Developer North Carolina State University josh_thomp...@ncsu.edu 919-515-5323 my GPG/PGP key can be found at pgp.mit.edu -BEGIN PGP SIGNATURE- Version: GnuPG v2.0.14 (GNU/Linux) iEYEARECAAYFAkvHG3sACgkQV/LQcNdtPQNvzACfUHT/ugI1WMet32i58lK+npbe X8kAmQEk3QUAnuoCGjlnjHQKID1L1NW+ =OoXC -END PGP SIGNATURE-
Re: Only one vm working
Hi Terry, Is the image booting on vmguest-2 and 3 but SSH is failing, or is it not booting at all? If it isn't booting, check the computer.drivetype values for the VMs. I came across an issue with this last week with another pilot attempting to create a Linux VMware base image. If the value differs among the VMs, try swapping sda/hda and see what happens. If the image is booting but SSH isn't responding, check the MAC addresses and IP addresses that are assigned to the VMs. If it isn't receiving an IP address, do the private MAC addresses match dhcpd.conf and /etc/hosts? Also, check the VM host to make sure you don't have multiple instances of a VM using the same MAC address. I'm not sure what's causing the Failed to resolve given hostname error. I'm guessing this is coming from the nmap command. Was this error listed in vcld.log or did you see it somewhere else? Please provide some lines leading up to this error if it's from the log. Regards, Andy Terry McGuire wrote: Hi Andy (and anyone else following along here). I've been doing a lot of poking around, and, long story short, I can now (for the first time ever) successfully book and log into the Windows image (yay!) but, annoyingly, only with a single one of the vm guest computers I've configured. While stumbling around in the dark, I decided to try setting up a Linux base image as well as the Windows one. The process went much quicker, but, unfortunately, it seems to be getting hung up in a similar place to the Windows image, but that's not the interesting thing. When I created the Linux image, I created a new vm guest to run it on (vmguest-2). When I got tired of playing with the Linux image, I switched back to the Windows image, and, to my amazement, it worked! And then I realized that it was loading on vmguest-2. Still didn't work on vmguest-1. I created yet another vm - vmguest-3 - but it also won't work on it. Only vmguest-2. I can't quite figure out what's special about it. I even swapped the private ip addresses, so vmguest-1 had vmguest-2's address, same result. (And, with the wiki down at the moment, I can't get to the Linux base image documentation to see if there was something special about how I made the vm in the first place.) As well, the errors I get are different on vmguest-1 and 3. On 1, it can't ssh into the machine, as before. On 3, it starts giving me these: Failed to resolve given hostname/IP: vmguest-3. Note that you can't use '/mask' AND '1-4,7,100-' style IP ranges WARNING: No targets were specified, so 0 hosts scanned. To my newbie eyes, all three vm computers are all as identically configured in the vcl computers tables as possible under the circumstances. Another thing (though probably not related): The machines all come up with 512MB memory, but I've set them to have 1024MB. Clearly, I'm missing some config info somewhere. At this point it seems I have a useful situation for continued debugging: a working setup, but only for the Windows image, and only for a single VM. There's *gotta* be a way to figure out what's the difference making the difference. I'm not worrying about the Linux image right now. I figure, once I get Windows images running properly, I'll have a much easier time getting Linux working. On a (related) side note, I see the list is getting much busier with newbies like me asking newbie questions. A mixed blessing? Obvious interest in the product, but a whole lot of support work for you, huh? Once I actually have a clue, I fully intend to start contributing back, to help with this situation. Terry On 7 Apr 2010, at 1418h, Andy Kurth wrote: Is SSH working and is everything being processed by vcld to the point where you see the Connect button on the web page? If you are just manually running the scripts then RDP won't be available because the firewall port isn't open. vcld opens it later on in the process. I have not seen the error before in the output from IP config called from configure_networking.vbs: An internal error occurred: The file name is too long. I'm wondering if a problem occurred obtaining the IP address. Can you run ipconfig /all manually and does this error show up? If SSH is working correctly on the private interface, then I'm guessing there is a routing table problem. There are no 129.x entries. This seems odd. Do any entries appear for 129.x in the routing table it you run ipconfig /renew, then route print? If vcld is completely loading the computer, then the problems that occur in configure_networking.vbs may not be the problem. The output from the log file where set_public_default_route is called will be helpful. The .vbs script attempts to set default routes but the vcld code does this again later on. -- Andy Kurth Virtual Computing Lab Office of Information Technology North Carolina State University andy_ku...@ncsu.edu 919.513.4090
Re: Problem with ssh on the vmguest machine
Thanks Andy! I did have to manually start the cygwin SSHD service and I did not get any credentials error when I started up. It logged me in as watcher(so its configured to that I guess when vm starts up). When I checked for root account couldnt find one in /home directory. Also the /etc/passwd files are different when compared between vmhost win2003 machine and the new vmguest?(vcl01,which starts up when I make a reservation). Below are the list of accounts: Administrator watcher kaharris maestro sshd cyg_server Guest mjohnston SUPPORT_388945a0 hope this info helps.. ! Thanks! kiran On Thu, Apr 15, 2010 at 9:56 AM, Andy Kurth andy_ku...@ncsu.edu wrote: It would be helpful if you could provide some more information. Did you have to manually start the Cygwin SSHD service while logged in as another user? Did it display a credentials error when you first tried to start it? If so, what username is the service configured to use when the VM comes up? Nothing in the code should delete the root account. It would be helpful to know what accounts are listed when you run net user? Regards, Andy Kiran N wrote: Hello all, I have my Windows 2003 image configured with cygwin.There is a root account and a password has been set for that. I even updated the vcld.conf file with the password. When I make a reservation, the vguest(vcl01) machine starts but fails when it tries to do an ssh on it.. I tried running /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vcl01 'uname -s' 21 from command line says Host key authentication failed. When I start the cygwin on that vmguest machine, I saw that there is no root account so I infer thats why the above ssh fails.. Do I need to add root account again on every vmguest machine? Please help!! A part of the log file is below. 2010-04-14 12:41:28|20587|34:34|reload|utils.pm: run_ssh_command(6167)|executing SSH command on vcl01: |20587|34:34|reload| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vcl01 'uname -s' 21 2010-04-14 12:41:31|22231|vcld:main(165)|lastcheckin time updated for management node 1: 2010-04-14 12:41:31 2010-04-14 12:41:36|22231|vcld:main(165)|lastcheckin time updated for management node 1: 2010-04-14 12:41:36 |20587|34:34|reload| WARNING |20587|34:34|reload| 2010-04-14 12:41:36|20587|34:34|reload|utils.pm:run_ssh_command(6239)|attempt 1/3: failed to execute SSH command on vcl01: uname -s, exit status: 255, SSH exits with the exit status of the remote command or with 255 if an error occurred, output: |20587|34:34|reload| ssh output (uname -s): Permission denied, please try again. |20587|34:34|reload| ssh output (uname -s): Permission denied, please try again. |20587|34:34|reload| ssh output (uname -s): Received disconnect from 10.0.0.10: 2: Too many authentication failures for root |20587|34:34|reload| ( 0) utils.pm, notify (line: 727) |20587|34:34|reload| (-1) utils.pm, run_ssh_command (line: 6239) |20587|34:34|reload| (-2) utils.pm, _sshd_status (line: 2745) |20587|34:34|reload| (-3) vmware.pm, load (line: 847) |20587|34:34|reload| (-4) new.pm, reload_image (line: 665) |20587|34:34|reload| (-5) new.pm, process (line: 266) |20587|34:34|reload| (-6) vcld, make_new_child (line: 594) -- Thanks, Kiran