Re: RHEL5 Base Linux Image Creation

2010-07-14 Thread Scott M. Sorrentino
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1

On Wed, Jul 14, 2010 at 09:43:26AM -0400, Hartl, Gerhard L. wrote:
>I'm attempting to create a base Linux image and keep getting stuck in the
>same spot.  The image is capturing properly, but when Linux is going
>through post load, something breaks ssh.  I don't understand why
>"/etc/init.d/ext_sshd stop" would halt ssh on the private interface?!?

I ran into this same problem.  The issue is the RHEL5 init script for sshd;
when asked to "stop", it runs killproc on sshd (the command string).
This kills both the private and public SSH daemons.

I modified the RHEL script to instead call 'killproc -p' on the PID, not the
command name:

> diff /tmp/ext_sshd /etc/init.d/sshd 
30,31c30
< PID_FILE=/var/run/ext_sshd.pid
< OPTIONS='-f /etc/ssh/external_sshd_config'
- ---
> PID_FILE=/var/run/sshd.pid
124,125c123,124
<   if [ -f $PID_FILE ]; then
<   killproc -p $PID_FILE
- ---
>   if [ -n "`pidfileofproc $SSHD`" ] ; then
>   killproc $SSHD



- -- 
Scott M. Sorrentino 
CIT Systems & Operations, Cornell University
705 Rhodes Hall // (607) 254-8535
GnuPG fingerprint: 6E30 0B83 43F8 CF8B 3B44  7DBE 6AAE DFC9 1DE6 8C1C

-BEGIN PGP SIGNATURE-
Version: GnuPG v1.4.7 (GNU/Linux)

iD8DBQFMPcG6aq7fyR3mjBwRAgtPAKCejUix04KGTfO6HRj15IsVrDBvxQCgxKdD
lz9R9Ft73k1OqGIkNkMxbC4=
=OKHV
-END PGP SIGNATURE-


RHEL5 Base Linux Image Creation

2010-07-14 Thread Hartl, Gerhard L.
I'm attempting to create a base Linux image and keep getting stuck in the same 
spot.  The image is capturing properly, but when Linux is going through post 
load, something breaks ssh.  I don't understand why "/etc/init.d/ext_sshd stop" 
would halt ssh on the private interface?!?



2010-07-14 09:21:30|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 1
2010-07-14 09:21:35|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:21:35
2010-07-14 09:21:36|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 2
2010-07-14 09:21:40|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:21:40
2010-07-14 09:21:41|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 3
2010-07-14 09:21:45|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:21:45
2010-07-14 09:21:46|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 4
2010-07-14 09:21:50|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:21:50
2010-07-14 09:21:51|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 5
2010-07-14 09:21:55|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:21:55
2010-07-14 09:21:56|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 6
2010-07-14 09:22:00|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:22:00
2010-07-14 09:22:02|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 7
2010-07-14 09:22:05|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:22:05
2010-07-14 09:22:07|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 8
2010-07-14 09:22:10|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:22:10
2010-07-14 09:22:12|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 9
2010-07-14 09:22:15|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:22:15
2010-07-14 09:22:17|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 10
2010-07-14 09:22:20|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:22:20
2010-07-14 09:22:23|28645|68:72|reload|esxthin.pm:load(607)|going to sleep 5 
seconds, waiting for computer to start SSH. Try 11
2010-07-14 09:22:25|26430|vcld:main(165)|lastcheckin time updated for 
management node 2: 2010-07-14 09:22:25
2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6180)|executing 
SSH command on vmwarerhel54-base33-v0:
|28645|68:72|reload| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x 
vmwarerhel54-base33-v0 'uname -s' 2>&1
2010-07-14 
09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6262)|run_ssh_command 
output:
|28645|68:72|reload| Linux
2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6276)|SSH 
command executed on vmwarerhel54-base33-v0, returning (0, "Linux")
2010-07-14 
09:22:28|28645|68:72|reload|esxthin.pm:load(595)|vmwarerhel54-base33-v0 now has 
active sshd running
2010-07-14 09:22:28|28645|68:72|reload|utils.pm:insertloadlog(4710)|inserted 
computer=516, dynamicDHCPaddress, collecting dynamic IP address for node
2010-07-14 09:22:28|28645|68:72|reload|esxthin.pm:load(618)|Attempting to query 
vmclient for its public IP...
2010-07-14 
09:22:28|28645|68:72|reload|utils.pm:getdynamicaddress(1605)|PrivateIP address 
for vmwarerhel54-base33-v0 collected 192.168.131.254
2010-07-14 09:22:28|28645|68:72|reload|utils.pm:run_ssh_command(6180)|executing 
SSH command on vmwarerhel54-base33-v0:
|28645|68:72|reload| /usr/bin/ssh -i /etc/vcl/vcl.key  -l root -p 22 -x 
vmwarerhel54-base33-v0 '/sbin/ifconfig |grep inet' 2>&1
2010-07-14 
09:22:29|28645|68:72|reload|utils.pm:run_ssh_command(6262)|run_ssh_command 
output:
|28645|68:72|reload| inet addr:192.168.131.254 Bcast:192.168.131.255 
Mask:255.255.254.0
|28645|68:72|reload| inet6 addr: fe80::250:56ff:fe00:3da/64 Scope:Link
|28645|68:72|reload| inet addr:128.82.130.171 Bcast:128.82.131.255 
Mask:255.255.254.0
|28645|68:72|reload| inet6 addr: fe80::250:56ff:fe00:3db/64 Scope:Link
|28645|68:72|reload| inet addr:127.0.0.1 Mask:255.0.0.0
|28645|68:72|reload| inet6 addr: ::1/128 Scope:Host
2010-07-14 09:22:29|28645|68:72|reload|utils.pm:run_ssh_command(6276)|SSH 
command executed on vmwarerhel54-base33-v0, returning (0, "inet 
addr:192.168.131.254 Bcas...")
2010-07-14 09:22:29|28645|68:72|reload|utils.pm:getdyn

Linux Image

2010-05-10 Thread My LinuxHAList
I've read through much of the mailing lists and found
https://cwiki.apache.org/confluence/display/VCL/Create+a+Linux+Base+Image

I've installed:
1) webnode
2) managementnode, having managementnode.id=1,
installpath=/usr/local/vclimages
3) vmnode1 (VMWare SErver 1.0.10), having computer,id=1

I modeled after
https://cwiki.apache.org/confluence/display/VCL/Create+a+Linux+Base+Imagedid
a bunch of inserts.

Instead of getting logs on vmwarelinux, I've got
"|21106|1:9|image| 2010-05-10
14:23:54|21106|1:9|image|image.pm:reservation_failed(390)|winxp-base1-v0
image creation failed" on vcld.log

The ssh from the managementnode to vmnode1 seem to be working, I saw
"currentimage.txt" in the /root/. It contains references to winxp-base1-v0.

I'm still trying to get pass by
https://cwiki.apache.org/confluence/display/VCL/Create+a+Linux+Base+Image

Thanks

-

Details of INSERT done:

INSERT INTO `OS` (
`id`, `name`, `prettyname`,
`type`, `installtype`, `sourcepath`, `moduleid`) VALUES
(34, 'vmwarelinux', 'VMware Generic Linux',
'linux', 'vmware','vmware_images',2);


INSERT INTO `vcl`.`image` (
`id` ,
`name` ,
`prettyname` ,
`ownerid` ,
`platformid` ,
`OSid` ,
`imagemetaid` ,
`minram` ,
`minprocnumber` ,
`minprocspeed` ,
`minnetwork` ,
`maxconcurrent` ,
`reloadtime` ,
`deleted` ,
`test` ,
`lastupdate` ,
`forcheckout` ,
`maxinitialtime` ,
`project` ,
`size` ,
`architecture` ,
`description` ,
`usage` ,
`basedoffrevisionid`
)
VALUES (
'10' , 'vmwarelinux-base8-v1', 'No Apps (Linux)', '1', '1', '34', NULL,
'512', '1', '1024', '100', NULL , '5', '0', '0', NOW(), '1', '0', 'vcl',
'1500',
'x86', NULL , NULL , '0'
);

INSERT INTO `vcl`.`imagerevision` (
`id` ,
`imageid` ,
`revision` ,
`userid` ,
`datecreated` ,
`deleted` ,
`datedeleted` ,
`production` ,
`comments` ,
`imagename`
)
VALUES (
'10', '10', '1', '1', NOW(), '0', NULL , '1', NULL , 'vmwarelinux-base8-v1'
);

## select * from resourcetype;
INSERT INTO `vcl`.`resource` (
`id` ,
`resourcetypeid` ,
`subid`
)
VALUES (
NULL , '13', '10'
)

INSERT INTO `vcl`.`request` (
`id` ,
`stateid` ,
`userid` ,
`laststateid` ,
`logid` ,
`forimaging` ,
`test` ,
`preload` ,
`start` ,
`end` ,
`daterequested` ,
`datemodified`
)
VALUES (
'1' , '16', '1', '16', '', '1', '0', '0', NOW( ) , TIMESTAMPADD(MINUTE, 120,
NOW()), NOW( ) , NULL
);


INSERT INTO `vcl`.`reservation` (
`id` ,
`requestid` ,
`computerid` ,
`imageid` ,
`imagerevisionid` ,
`managementnodeid` ,
`remoteIP` ,
`lastcheck` ,
`pw`
)
VALUES (
NULL , '1', '1', '10', '1', '1', NULL , NULL , NULL
);



vcld.log
2010-05-10 12:27:40|20800|1:6|image|utils.pm:mail(1348)|SUCCESS -- Sending
mail To: , VCL -- NOTICE FAILED Image Creation No Apps (WinXP)
2010-05-10 12:27:40|20800|1:6|image|utils.pm:update_request_state(2186)|request
1 state updated to: maintenance, laststate to: image
2010-05-10 12:27:40|20800|1:6|image|image.pm:reservation_failed(442)|request
state set to maintenance, laststate to image
2010-05-10 
12:27:40|20800|1:6|image|utils.pm:update_computer_state(2228)|computer
1 state updated to: maintenance
2010-05-10 12:27:40|20800|1:6|image|image.pm:reservation_failed(450)|vmnode1
state set to maintenance
2010-05-10 12:27:40|20800|1:6|image|image.pm:reservation_failed(457)|exiting
2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(905)|destructor called,
ref($self)=VCL::image
2010-05-10 
12:27:40|20800|1:6|image|utils.pm:delete_computerloadlog_reservation(7551)|removing
computerloadlog entries matching loadstate = begin
2010-05-10 
12:27:40|20800|1:6|image|utils.pm:delete_computerloadlog_reservation(7598)|deleted
rows from computerloadlog for reservation id=6
2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(912)|removed
computerloadlog rows with loadstate=begin for reservation
2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(924)|number of database
handles state process created: 1
2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(933)|process has a
database handle stored in $ENV{dbh}, attempting disconnect
2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(935)|$ENV{dbh}:
database disconnect successful
2010-05-10 12:27:40|20800|1:6|image|State.pm:DESTROY(949)|VCL::image process
20800 exiting
2010-05-10 12:27:40|20783|vcld:REAPER(744)|VCL process exited for
reservation 6