Hello, Ok, I actually have tried many times to create a base image. I had it with NIC 1 in the private network and NIC 2 in the public previously.
Each time, when VCL deletes my base image and copies it back to the VM server, the networks on the nics are REVERSED. When I start with nic 1 in Private and nic 2 in Public, it ends up with nic 1 in Pub and nic 2 in Priv. When I start with nic 1 in Public and nic 2 in Private, it ends up with nic 1 in Priv and nic 2 in Pub. The VM host profile has Virtual Switch 0 in Public and Virtual Switch 1 in Private. The DHCP server is only listening on the private network. I see unauthorized dhcp requests come in for public network IP addresses when VCL swaps them. In the last attempt, I did something new: When the new image is brought up, I manually swapped the nics back before VCL times out waiting for the image to boot. At that time, it declares the image a success, but it should not need such manual intervention. Any clues why VCL decides to swap the networks ? Al Quiros From: Aaron Coburn <acob...@amherst.edu<mailto:acob...@amherst.edu>> Reply-To: <vcl-user@incubator.apache.org<mailto:vcl-user@incubator.apache.org>> Date: Wed, 2 Nov 2011 16:24:02 -0400 To: <vcl-user@incubator.apache.org<mailto:vcl-user@incubator.apache.org>> Subject: Re: Base Image creation - problems - PLEASE HELP Evelio, in our VCL setup and according to the docs on this page: https://cwiki.apache.org/confluence/display/VCL/Create+a+Windows+Base+Image you may want to try configuring your base image to use NIC 1 for the private network and NIC 2 for the public network. The sshd configuration script that you ran on your base image typically closes ssh access over the public network, but if the two networks are switched, you may have inadvertently cut off access over the private network. Aaron -- Aaron Coburn Systems Administrator and Programmer Academic Technology Services, Amherst College (413) 542-5451 acob...@amherst.edu<mailto:acob...@amherst.edu> On Nov 2, 2011, at 4:09 PM, Evelio Quiros wrote: Hello, Ok, I'm stumped. I continue to have issues creating a base image on our vcl setup. Here is my info: I have a single server running web, management node, and database. It has two interfaces, one on our private network, another on our public net. I have a single Vmware server, also with two interfaces. One interface, switch 0, is on our public net. It also has the VM management net. Another interface, switch 1, is on our private net. Each of these switches is labeled the same as in the Management node config. When I create the base image to capture, I create eth0 (nic1) on the public network, then I create eth1 (nic2) on the private network. I set them both to dhcp, and the dhcp server on the VCL server responds and gives it an address of an image in my database. I set the ssh key in the new image. I set the image in /etc/hosts so that the server can resolve it. I set the vcld to capture the image, It begins and copies the image to the management node directory, It deletes the image from the Vmware server, and re-creates the image All well and good. but then : When the recreated image comes up, The OTHER interface begins asking the dhcp server for addresses in the public network. The dhcp server denies it as non-authoritative, and the private interface never comes up. Even though it was able to dhcp previously. The image creation completes successfully, but the newly created image cannot connect to the vcl server. So, I eventually get a failure message from VCL. WHAT GIVES ??? Here is my error email: reservation failed on image6: process failed after trying to load or make available ------------------------------------------------------------------------ time: 2011-11-02 16:05:30 caller: State.pm:reservation_failed(213) ( 0) State.pm, reservation_failed (line: 213) (-1) new.pm, process (line: 298) (-2) vcld, make_new_child (line: 568) (-3) vcld, main (line: 346) ------------------------------------------------------------------------ management node: vcltst.fiu.edu<http://vcltst.fiu.edu> reservation PID: 29777 parent vcld PID: 23341 request ID: 11 reservation ID: 11 request state/laststate: reload/image request start time: 2011-11-02 15:53:56 request end time: 2011-11-02 16:23:56 for imaging: no log ID: none computer: image6 computer id: 7 computer type: virtualmachine computer eth0 MAC address: 00:50:56:00:10:0a computer eth1 MAC address: 00:50:56:00:10:0b computer private IP address: 10.0.0.9 computer public IP address: 10.106.128.59 computer in block allocation: no provisioning module: VCL::Module::Provisioning::VMware::VMware vm host: vcldell01 vm host ID: 1 vm host computer ID: 1 vm profile: VMware ESX - local storage vm profile VM path: /vmfs/volumes/datastore1 vm profile repository path: /images vm profile datastore path: /vmfs/volumes/datastore1 vm profile disk type: localdisk image: vmwarelinux-redhatbase10-v0 image display name: redhatbase image ID: 10 image revision ID: 10 image size: 16384 MB use Sysprep: yes root access: yes image owner ID: 1 image owner affiliation: Local image revision date created: 2011-11-02 15:51:12 image revision production: yes OS module: VCL::Module::OS::Linux user: vclreload user name: vcl reload user ID: 2 user affiliation: Local ------------------------------------------------------------------------ RECENT LOG ENTRIES FOR THIS PROCESS: 2011-11-02 16:01:20|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:01:20|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 24: code returned false, seconds elapsed/remaining: 360/240, sleeping for 15 seconds 2011-11-02 16:01:35|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 25: waiting for image6 to respond to SSH 2011-11-02 16:01:35|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:01:35|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 25: code returned false, seconds elapsed/remaining: 375/225, sleeping for 15 seconds 2011-11-02 16:01:50|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 26: waiting for image6 to respond to SSH 2011-11-02 16:01:51|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:01:51|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 26: code returned false, seconds elapsed/remaining: 391/209, sleeping for 15 seconds 2011-11-02 16:02:06|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 27: waiting for image6 to respond to SSH 2011-11-02 16:02:07|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:02:07|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 27: code returned false, seconds elapsed/remaining: 407/193, sleeping for 15 seconds 2011-11-02 16:02:22|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 28: waiting for image6 to respond to SSH 2011-11-02 16:02:22|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:02:22|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 28: code returned false, seconds elapsed/remaining: 422/178, sleeping for 15 seconds 2011-11-02 16:02:37|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 29: waiting for image6 to respond to SSH 2011-11-02 16:02:38|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:02:38|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 29: code returned false, seconds elapsed/remaining: 438/162, sleeping for 15 seconds 2011-11-02 16:02:53|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 30: waiting for image6 to respond to SSH 2011-11-02 16:02:54|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:02:54|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 30: code returned false, seconds elapsed/remaining: 454/146, sleeping for 15 seconds 2011-11-02 16:03:09|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 31: waiting for image6 to respond to SSH 2011-11-02 16:03:09|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:03:09|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 31: code returned false, seconds elapsed/remaining: 469/131, sleeping for 15 seconds 2011-11-02 16:03:24|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 32: waiting for image6 to respond to SSH 2011-11-02 16:03:25|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:03:25|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 32: code returned false, seconds elapsed/remaining: 485/115, sleeping for 15 seconds 2011-11-02 16:03:40|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 33: waiting for image6 to respond to SSH 2011-11-02 16:03:40|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:03:40|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 33: code returned false, seconds elapsed/remaining: 500/100, sleeping for 15 seconds 2011-11-02 16:03:55|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 34: waiting for image6 to respond to SSH 2011-11-02 16:03:56|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:03:56|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 34: code returned false, seconds elapsed/remaining: 516/84, sleeping for 15 seconds 2011-11-02 16:04:11|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 35: waiting for image6 to respond to SSH 2011-11-02 16:04:12|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:04:12|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 35: code returned false, seconds elapsed/remaining: 532/68, sleeping for 15 seconds 2011-11-02 16:04:27|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 36: waiting for image6 to respond to SSH 2011-11-02 16:04:27|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:04:27|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 36: code returned false, seconds elapsed/remaining: 547/53, sleeping for 15 seconds 2011-11-02 16:04:42|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 37: waiting for image6 to respond to SSH 2011-11-02 16:04:43|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:04:43|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 37: code returned false, seconds elapsed/remaining: 563/37, sleeping for 15 seconds 2011-11-02 16:04:58|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 38: waiting for image6 to respond to SSH 2011-11-02 16:04:59|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:04:59|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 38: code returned false, seconds elapsed/remaining: 579/21, sleeping for 15 seconds 2011-11-02 16:05:14|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 39: waiting for image6 to respond to SSH 2011-11-02 16:05:14|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:05:14|29777|11:11|reload|Module.pm:code_loop_timeout(755)|attempt 39: code returned false, seconds elapsed/remaining: 594/6, sleeping for 15 seconds 2011-11-02 16:05:29|29777|11:11|reload|Module.pm:code_loop_timeout(759)|attempt 40: waiting for image6 to respond to SSH 2011-11-02 16:05:30|29777|11:11|reload|OS.pm:is_ssh_responding(386)|image6 is NOT responding to SSH, ports 22 or 24 are both closed 2011-11-02 16:05:30|29777|11:11|reload|Module.pm:code_loop_timeout(767)|waiting for image6 to respond to SSH, code did not return true after waiting 600 seconds |29777|11:11|reload| ---- WARNING ---- |29777|11:11|reload| 2011-11-02 16:05:30|29777|11:11|reload|OS.pm:wait_for_response(465)|failed to connect to image6 via SSH after 600 seconds |29777|11:11|reload| ( 0) OS.pm, wait_for_response (line: 465) |29777|11:11|reload| (-1) Linux.pm, post_load (line: 224) |29777|11:11|reload| (-2) VMware.pm, load (line: 415) |29777|11:11|reload| (-3) new.pm, reload_image (line: 618) |29777|11:11|reload| (-4) new.pm, process (line: 248) |29777|11:11|reload| (-5) vcld, make_new_child (line: 568) |29777|11:11|reload| ---- WARNING ---- |29777|11:11|reload| 2011-11-02 16:05:30|29777|11:11|reload|Linux.pm:post_load(225)|image6 never responded to SSH |29777|11:11|reload| ( 0) Linux.pm, post_load (line: 225) |29777|11:11|reload| (-1) VMware.pm, load (line: 415) |29777|11:11|reload| (-2) new.pm, reload_image (line: 618) |29777|11:11|reload| (-3) new.pm, process (line: 248) |29777|11:11|reload| (-4) vcld, make_new_child (line: 568) |29777|11:11|reload| (-5) vcld, main (line: 346) |29777|11:11|reload| ---- WARNING ---- |29777|11:11|reload| 2011-11-02 16:05:30|29777|11:11|reload|VMware.pm:load(419)|failed to perform OS post-load tasks on VM image6 on VM host: vcldell01 |29777|11:11|reload| ( 0) VMware.pm, load (line: 419) |29777|11:11|reload| (-1) new.pm, reload_image (line: 618) |29777|11:11|reload| (-2) new.pm, process (line: 248) |29777|11:11|reload| (-3) vcld, make_new_child (line: 568) |29777|11:11|reload| (-4) vcld, main (line: 346) |29777|11:11|reload| ---- WARNING ---- |29777|11:11|reload| 2011-11-02 16:05:30|29777|11:11|reload|new.pm:reload_image(623)|vmwarelinux-redhatbase10-v0 failed to load on image6, returning |29777|11:11|reload| ( 0) new.pm, reload_image (line: 623) |29777|11:11|reload| (-1) new.pm, process (line: 248) |29777|11:11|reload| (-2) vcld, make_new_child (line: 568) |29777|11:11|reload| (-3) vcld, main (line: 346) 2011-11-02 16:05:30|29777|11:11|reload|utils.pm:insertloadlog(3875)|inserted computer=7, loadimagefailed, vmwarelinux-redhatbase10-v0 failed to load on image6 |29777|11:11|reload| ---- WARNING ---- |29777|11:11|reload| 2011-11-02 16:05:30|29777|11:11|reload|new.pm:process(295)|failed to load image6 with vmwarelinux-redhatbase10-v0 |29777|11:11|reload| ( 0) new.pm, process (line: 295) |29777|11:11|reload| (-1) vcld, make_new_child (line: 568) |29777|11:11|reload| (-2) vcld, main (line: 346) 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:get_computer_state_name(2305)|attempting to retrieve current state of computer image6 from the database 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:get_computer_state_name(2336)|retrieved current state of computer image6 from the database: reloading 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:_automethod(834)|data structure updated: $self->request_data->{reservation}{11}{computer}{state}{name} |29777|11:11|reload| computer_state_name = reloading 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:get_computer_private_ip_address(1581)|attempting to retrieve private IP address for computer: image6 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:get_computer_private_ip_address(1585)|retrieved contents of /etc/hosts on this management node, contains 10 lines 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:get_computer_private_ip_address(1645)|returning IP address from /etc/hosts file: 10.0.0.9 2011-11-02 16:05:30|29777|11:11|reload|utils.pm:is_inblockrequest(6163)|zero rows were returned from database select 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:get_image_affiliation_name(2035)|image owner id: 1 2011-11-02 16:05:30|29777|11:11|reload|utils.pm:getnewdbh(2709)|database requested (information_schema) does not match handle stored in $ENV{dbh} (vcl2:localhost) 2011-11-02 16:05:30|29777|11:11|reload|utils.pm:getnewdbh(2760)|database handle stored in $ENV{dbh} 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:retrieve_user_data(1352)|attempting to retrieve and store data for user: user.id = '1' 2011-11-02 16:05:30|29777|11:11|reload|utils.pm:getnewdbh(2709)|database requested (vcl2) does not match handle stored in $ENV{dbh} (information_schema:localhost) 2011-11-02 16:05:30|29777|11:11|reload|utils.pm:getnewdbh(2760)|database handle stored in $ENV{dbh} 2011-11-02 16:05:30|29777|11:11|reload|DataStructure.pm:retrieve_user_data(1415)|data has been retrieved for user: admin (id: 1)