Re: RHEL5 Base Linux Image Creation
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 On Wed, Jul 14, 2010 at 09:43:26AM -0400, Hartl, Gerhard L. wrote: >I'm attempting to create a base Linux image and keep getting stuck in the >same spot. The image is capturing properly, but when Linux is going >through post load, something breaks ssh. I don't understand why >"/etc/init.d/ext_sshd stop" would halt ssh on the private interface?!? I ran into this same problem. The issue is the RHEL5 init script for sshd; when asked to "stop", it runs killproc on sshd (the command string). This kills both the private and public SSH daemons. I modified the RHEL script to instead call 'killproc -p' on the PID, not the command name: > diff /tmp/ext_sshd /etc/init.d/sshd 30,31c30 < PID_FILE=/var/run/ext_sshd.pid < OPTIONS='-f /etc/ssh/external_sshd_config' - --- > PID_FILE=/var/run/sshd.pid 124,125c123,124 < if [ -f $PID_FILE ]; then < killproc -p $PID_FILE - --- > if [ -n "`pidfileofproc $SSHD`" ] ; then > killproc $SSHD - -- Scott M. Sorrentino CIT Systems & Operations, Cornell University 705 Rhodes Hall // (607) 254-8535 GnuPG fingerprint: 6E30 0B83 43F8 CF8B 3B44 7DBE 6AAE DFC9 1DE6 8C1C -BEGIN PGP SIGNATURE- Version: GnuPG v1.4.7 (GNU/Linux) iD8DBQFMPcG6aq7fyR3mjBwRAgtPAKCejUix04KGTfO6HRj15IsVrDBvxQCgxKdD lz9R9Ft73k1OqGIkNkMxbC4= =OKHV -END PGP SIGNATURE-
RHEL5 Base Linux Image Creation
I'm attempting to create a base Linux image and keep getting stuck in the same spot. The image is capturing properly, but when Linux is going through post load, something breaks ssh. I don't understand why "/etc/init.d/ext_sshd stop" would halt ssh on the private interface?!? 2010-07-14 09:21:30|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 1 2010-07-14 09:21:35|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:21:35 2010-07-14 09:21:36|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 2 2010-07-14 09:21:40|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:21:40 2010-07-14 09:21:41|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 3 2010-07-14 09:21:45|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:21:45 2010-07-14 09:21:46|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 4 2010-07-14 09:21:50|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:21:50 2010-07-14 09:21:51|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 5 2010-07-14 09:21:55|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:21:55 2010-07-14 09:21:56|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 6 2010-07-14 09:22:00|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:22:00 2010-07-14 09:22:02|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 7 2010-07-14 09:22:05|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:22:05 2010-07-14 09:22:07|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 8 2010-07-14 09:22:10|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:22:10 2010-07-14 09:22:12|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 9 2010-07-14 09:22:15|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:22:15 2010-07-14 09:22:17|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 10 2010-07-14 09:22:20|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:22:20 2010-07-14 09:22:23|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 seconds, waiting for computer to start SSH. Try 11 2010-07-14 09:22:25|26430|vcld:main(165)|lastcheckin time updated for management node 2: 2010-07-14 09:22:25 2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6180)|executing SSH command on vmwarerhel54-base33-v0: |28645|68:72|reload| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmwarerhel54-base33-v0 'uname -s' 2>&1 2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6262)|run_ssh_command output: |28645|68:72|reload| Linux 2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6276)|SSH command executed on vmwarerhel54-base33-v0, returning (0, "Linux") 2010-07-14 09:22:28|28645|68:72|reload|esxthin.pm:load(595)|vmwarerhel54-base33-v0 now has active sshd running 2010-07-14 09:22:28|28645|68:72|reload|utils.pm:insertloadlog(4710)|inserted computer=516, dynamicDHCPaddress, collecting dynamic IP address for node 2010-07-14 09:22:28|28645|68:72|reload|esxthin.pm:load(618)|Attempting to query vmclient for its public IP... 2010-07-14 09:22:28|28645|68:72|reload|utils.pm:getdynamicaddress(1605)|PrivateIP address for vmwarerhel54-base33-v0 collected 192.168.131.254 2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6180)|executing SSH command on vmwarerhel54-base33-v0: |28645|68:72|reload| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmwarerhel54-base33-v0 '/sbin/ifconfig |grep inet' 2>&1 2010-07-14 09:22:29|28645|68:72|reload|utils.pm:run_ssh_command(6262)|run_ssh_command output: |28645|68:72|reload| inet addr:192.168.131.254 Bcast:192.168.131.255 Mask:255.255.254.0 |28645|68:72|reload| inet6 addr: fe80::250:56ff:fe00:3da/64 Scope:Link |28645|68:72|reload| inet addr:128.82.130.171 Bcast:128.82.131.255 Mask:255.255.254.0 |28645|68:72|reload| inet6 addr: fe80::250:56ff:fe00:3db/64 Scope:Link |28645|68:72|reload| inet addr:127.0.0.1 Mask:255.0.0.0 |28645|68:72|reload| inet6 addr: ::1/128 Scope:Host 2010-07-14 09:22:29|28645|68:72|reload|utils.pm:run_ssh_command(6276)|SSH command executed on vmwarerhel54-base33-v0, returning (0, "inet addr:192.168.131.254 Bcas...") 2010-07-14 09:22:29|28645|68:72|reload|utils.pm:getdyn
Linux Image
I've read through much of the mailing lists and found https://cwiki.apache.org/confluence/display/VCL/Create+a+Linux+Base+Image I've installed: 1) webnode 2) managementnode, having managementnode.id=1, installpath=/usr/local/vclimages 3) vmnode1 (VMWare SErver 1.0.10), having computer,id=1 I modeled after https://cwiki.apache.org/confluence/display/VCL/Create+a+Linux+Base+Imagedid a bunch of inserts. Instead of getting logs on vmwarelinux, I've got "|21106|1:9|image| 2010-05-10 14:23:54|21106|1:9|image|image.pm:reservation_failed(390)|winxp-base1-v0 image creation failed" on vcld.log The ssh from the managementnode to vmnode1 seem to be working, I saw "currentimage.txt" in the /root/. It contains references to winxp-base1-v0. I'm still trying to get pass by https://cwiki.apache.org/confluence/display/VCL/Create+a+Linux+Base+Image Thanks - Details of INSERT done: INSERT INTO `OS` ( `id`, `name`, `prettyname`, `type`, `installtype`, `sourcepath`, `moduleid`) VALUES (34, 'vmwarelinux', 'VMware Generic Linux', 'linux', 'vmware','vmware_images',2); INSERT INTO `vcl`.`image` ( `id` , `name` , `prettyname` , `ownerid` , `platformid` , `OSid` , `imagemetaid` , `minram` , `minprocnumber` , `minprocspeed` , `minnetwork` , `maxconcurrent` , `reloadtime` , `deleted` , `test` , `lastupdate` , `forcheckout` , `maxinitialtime` , `project` , `size` , `architecture` , `description` , `usage` , `basedoffrevisionid` ) VALUES ( '10' , 'vmwarelinux-base8-v1', 'No Apps (Linux)', '1', '1', '34', NULL, '512', '1', '1024', '100', NULL , '5', '0', '0', NOW(), '1', '0', 'vcl', '1500', 'x86', NULL , NULL , '0' ); INSERT INTO `vcl`.`imagerevision` ( `id` , `imageid` , `revision` , `userid` , `datecreated` , `deleted` , `datedeleted` , `production` , `comments` , `imagename` ) VALUES ( '10', '10', '1', '1', NOW(), '0', NULL , '1', NULL , 'vmwarelinux-base8-v1' ); ## select * from resourcetype; INSERT INTO `vcl`.`resource` ( `id` , `resourcetypeid` , `subid` ) VALUES ( NULL , '13', '10' ) INSERT INTO `vcl`.`request` ( `id` , `stateid` , `userid` , `laststateid` , `logid` , `forimaging` , `test` , `preload` , `start` , `end` , `daterequested` , `datemodified` ) VALUES ( '1' , '16', '1', '16', '', '1', '0', '0', NOW( ) , TIMESTAMPADD(MINUTE, 120, NOW()), NOW( ) , NULL ); INSERT INTO `vcl`.`reservation` ( `id` , `requestid` , `computerid` , `imageid` , `imagerevisionid` , `managementnodeid` , `remoteIP` , `lastcheck` , `pw` ) VALUES ( NULL , '1', '1', '10', '1', '1', NULL , NULL , NULL ); vcld.log 2010-05-10 12:27:40|20800|1:6|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , VCL -- NOTICE FAILED Image Creation No Apps (WinXP) 2010-05-10 12:27:40|20800|1:6|image|utils.pm:update_request_state(2186)|request 1 state updated to: maintenance, laststate to: image 2010-05-10 12:27:40|20800|1:6|image|image.pm:reservation_failed(442)|request state set to maintenance, laststate to image 2010-05-10 12:27:40|20800|1:6|image|utils.pm:update_computer_state(2228)|computer 1 state updated to: maintenance 2010-05-10 12:27:40|20800|1:6|image|image.pm:reservation_failed(450)|vmnode1 state set to maintenance 2010-05-10 12:27:40|20800|1:6|image|image.pm:reservation_failed(457)|exiting 2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(905)|destructor called, ref($self)=VCL::image 2010-05-10 12:27:40|20800|1:6|image|utils.pm:delete_computerloadlog_reservation(7551)|removing computerloadlog entries matching loadstate = begin 2010-05-10 12:27:40|20800|1:6|image|utils.pm:delete_computerloadlog_reservation(7598)|deleted rows from computerloadlog for reservation id=6 2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(912)|removed computerloadlog rows with loadstate=begin for reservation 2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(924)|number of database handles state process created: 1 2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(933)|process has a database handle stored in $ENV{dbh}, attempting disconnect 2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(935)|$ENV{dbh}: database disconnect successful 2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(949)|VCL::image process 20800 exiting 2010-05-10 12:27:40|20783|vcld:REAPER(744)|VCL process exited for reservation 6