The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347
From: "Waldron, Michael H" <mwald...@email.unc.edu> To: "'vcl-dev@incubator.apache.org'" <vcl-dev@incubator.apache.org> Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout >From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -----Original Message----- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 38/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:06|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:06 2010-05-10 12:31:11|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:11 2010-05-10 12:31:16|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:16 2010-05-10 12:31:21|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:21 2010-05-10 12:31:26|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:26 2010-05-10 12:31:31|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:31 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista-Windows7FacStaff49-v0-all 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1254)|checking size of winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3109)|getting size of image: winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3175)|management node identifier argument was not specified 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3197)|attempting to determine repository path for image on vcl.is.marist.edu: |9225|144:178|image| image id: 49 |9225|144:178|image| OS name: winvista |9225|144:178|image| OS type: windows |9225|144:178|image| OS install type: partimage |9225|144:178|image| OS source path: image |9225|144:178|image| architecture: x86 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 39/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:36|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:36 2010-05-10 12:31:41|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:41 2010-05-10 12:31:46|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:46 2010-05-10 12:31:51|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:51 2010-05-10 12:31:56|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:56 2010-05-10 12:32:01|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:32:01 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista-Windows7FacStaff49-v0-all 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture_monitor(1254)|checking size of winvista-Windows7FacStaff49-v0 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:get_image_size(3109)|getting size of image: winvista-Windows7FacStaff49-v0 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:_get_image_repository_path(3175)|management node identifier argument was not specified 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:_get_image_repository_path(3197)|attempting to determine repository path for image on vcl.is.marist.edu: |9225|144:178|image| image id: 49 |9225|144:178|image| OS name: winvista |9225|144:178|image| OS type: windows |9225|144:178|image| OS install type: partimage |9225|144:178|image| OS source path: image |9225|144:178|image| architecture: x86 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete |9225|144:178|image| ---- WARNING ---- |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture_monitor(1278)|image copy timed out, waited 20 minutes, nodeset status for node4 never changed to boot: image image-x86-winvista-Windows7FacStaff49-v0-all |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) xCAT.pm, capture_monitor (line: 1278) |9225|144:178|image| (-2) xCAT.pm, capture (line: 1183) |9225|144:178|image| (-3) image.pm, process (line: 162) |9225|144:178|image| (-4) vcld, make_new_child (line: 594) |9225|144:178|image| (-5) vcld, main (line: 341) |9225|144:178|image| ---- WARNING ---- |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture(1187)|problem occurred while monitoring image capture |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) xCAT.pm, capture (line: 1187) |9225|144:178|image| (-2) image.pm, process (line: 162) |9225|144:178|image| (-3) vcld, make_new_child (line: 594) |9225|144:178|image| (-4) vcld, main (line: 341) |9225|144:178|image| ---- WARNING ---- |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|image.pm:process(166)|winvista-Windows7FacStaff49-v0 image failed to be captured by provisioning module |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) image.pm, process (line: 166) |9225|144:178|image| (-2) vcld, make_new_child (line: 594) |9225|144:178|image| (-3) vcld, main (line: 341) 2010-05-10 12:32:04|9225|144:178|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , PROBLEM -- image.pm |9225|144:178|image| ---- CRITICAL ---- |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|image.pm:reservation_failed(390)|winvista-Windows7FacStaff49-v0 image creation failed |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) image.pm, reservation_failed (line: 390) |9225|144:178|image| (-2) image.pm, process (line: 167) |9225|144:178|image| (-3) vcld, make_new_child (line: 594) |9225|144:178|image| (-4) vcld, main (line: 341) 2010-05-10 12:32:04|9225|144:178|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: r...@localhost, VCL -- NOTICE DELAY Image Creation Windows7FacStaff 2010-05-10 12:32:05|9225|144:178|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , VCL -- NOTICE FAILED Image Creation Windows7FacStaff 2010-05-10 12:32:05|9225|144:178|image|utils.pm:update_request_state(2186)|request 144 state updated to: maintenance, laststate to: image 2010-05-10 12:32:05|9225|144:178|image|image.pm:reservation_failed(442)|request state set to maintenance, laststate to image 2010-05-10 12:32:05|9225|144:178|image|utils.pm:update_computer_state(2228)|computer 6 state updated to: maintenance 2010-05-10 12:32:05|9225|144:178|image|image.pm:reservation_failed(450)|node4 state set to maintenance 2010-05-10 12:32:05|9225|144:178|image|image.pm:reservation_failed(457)|exiting 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(905)|destructor called, ref($self)=VCL::image 2010-05-10 12:32:05|9225|144:178|image|utils.pm:delete_computerloadlog_reservation(7551)|removing computerloadlog entries matching loadstate = begin 2010-05-10 12:32:05|9225|144:178|image|utils.pm:delete_computerloadlog_reservation(7598)|deleted rows from computerloadlog for reservation id=178 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(912)|removed computerloadlog rows with loadstate=begin for reservation 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(924)|number of database handles state process created: 1 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(933)|process has a database handle stored in $ENV{dbh}, attempting disconnect 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(935)|$ENV{dbh}: database disconnect successful 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(949)|VCL::image process 9225 exiting 2010-05-10 12:32:05|7448|vcld:REAPER(744)|VCL process exited for reservation 178 Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: "Waldron, Michael H" <mwald...@email.unc.edu> To: "'vcl-dev@incubator.apache.org'" <vcl-dev@incubator.apache.org> Date: 05/10/2010 02:50 PM Subject: RE: Change image timeout Roger, By image uploads do you mean capturing a new image? From what I've observed with ours, as long as the image file size is increasing on the management node, it assumes the capture is still proceeding and continues to check. Do you have vcld.log output you can provide from when the problem occurs? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -----Original Message----- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 11:28 AM To: vcl-dev@incubator.apache.org Subject: Change image timeout Where do I need to make changes to scripts to increase the image capture wait time restore time etc.. I am having issues with image uploads taking longer than allowed to upload. Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347