Re: Change image timeout
xCAT puts that in the site table which is on default xCAT 1.3 installations at /opt/xcat/etc/site On Mon, May 17, 2010 at 10:35 AM, Roger Herling roger.herl...@marist.eduwrote: Where do I set that? Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Andy Kurth andy_ku...@ncsu.edu To: vcl-dev@incubator.apache.org Date: 05/14/2010 10:03 AM Subject: Re: Change image timeout I'm assuming you're using xCAT 1.3. Would it be possible to configure xCAT to store the images in /opt/image/x86? The VCL xCAT code really needs the image path to contain a 3rd component because it determines where the images reside as follows: managementnode.installpath/OS.sourcepath/image.architecture If you can configure xCAT to use /opt/image/x86, then managementnode.installpath should be set to /opt and I believe the path should be constructed correctly. Hope this helps, Andy Roger Herling wrote: xCAT root path is /opt/xcat according to the vcld log but the images are actuially store in /opt/x86 Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Andy Kurth andy_ku...@ncsu.edu To: vcl-dev@incubator.apache.org Date: 05/11/2010 10:49 AM Subject: Re: Change image timeout I believe the problem is related to the Install Path setting for your management node: Management Nodes Edit Management Node Information Edit Based on the paths being used, I'm guessing yours is set to 'storage/x86'. Where does the partimage .gz file reside? If it actually resides in '/storage/image/x86' then change the Install Path setting to 'storage'. The .tmpl file residing in this directory may indicate another problem with the xCAT root path. Where is xCAT installed and which version of xCAT are you using? Search your log file for 'xCAT root path found:'. What path does this line contain? Regards, Andy Roger Herling wrote: [r...@vcl ~]# du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* 4 /storage/x86/image/x86/winvista-Windows7FacStaff49-v0.tmpl 4 total [r...@vcl ~]# ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -rw-r--r-- 1 root root 352 May 10 12:09 /storage/x86/image/x86/winvista-Windows7FacStaff49-v0.tmpl Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Andy Kurth andy_ku...@ncsu.edu To: vcl-dev@incubator.apache.org Date: 05/11/2010 09:06 AM Subject: Re: Change image timeout The command that is being executed to check the size should be: du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* Try running this. What output does it generate? The last line should look like: total The code appears to be finding the 'total' line but is either not parsing it correctly, not calculating the size correctly, or the du command isn't correct. Also, it would be helpful if you include the output from: ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -Andy Waldron, Michael H wrote: So you never answered whether you watched the capture occurring from the node console. Can you watch the console and see the capture running while monitoring the log that shows the capture file is size 0? What about any other error messages for the rest of the log capture for this imaging request? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:18 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout yup Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:17 PM Subject: RE: Change image timeout So you can do an ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0 and it shows some size? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:03 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout
Re: Change image timeout
The command that is being executed to check the size should be: du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* Try running this. What output does it generate? The last line should look like: total The code appears to be finding the 'total' line but is either not parsing it correctly, not calculating the size correctly, or the du command isn't correct. Also, it would be helpful if you include the output from: ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -Andy Waldron, Michael H wrote: So you never answered whether you watched the capture occurring from the node console. Can you watch the console and see the capture running while monitoring the log that shows the capture file is size 0? What about any other error messages for the rest of the log capture for this imaging request? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:18 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout yup Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:17 PM Subject: RE: Change image timeout So you can do an ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0 and it shows some size? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:03 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 38/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:06|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:06 2010-05-10 12:31:11|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:11 2010-05-10 12:31:16|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:16 2010-05-10 12:31:21|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:21 2010-05-10 12:31:26|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:26 2010-05-10 12:31:31|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:31 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista-Windows7FacStaff49-v0-all 2010-05-10 12:31:32|9225|144:178|image
Re: Change image timeout
[r...@vcl ~]# du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* 4 /storage/x86/image/x86/winvista-Windows7FacStaff49-v0.tmpl 4 total [r...@vcl ~]# ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -rw-r--r-- 1 root root 352 May 10 12:09 /storage/x86/image/x86/winvista-Windows7FacStaff49-v0.tmpl Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Andy Kurth andy_ku...@ncsu.edu To: vcl-dev@incubator.apache.org Date: 05/11/2010 09:06 AM Subject: Re: Change image timeout The command that is being executed to check the size should be: du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* Try running this. What output does it generate? The last line should look like: total The code appears to be finding the 'total' line but is either not parsing it correctly, not calculating the size correctly, or the du command isn't correct. Also, it would be helpful if you include the output from: ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -Andy Waldron, Michael H wrote: So you never answered whether you watched the capture occurring from the node console. Can you watch the console and see the capture running while monitoring the log that shows the capture file is size 0? What about any other error messages for the rest of the log capture for this imaging request? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:18 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout yup Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:17 PM Subject: RE: Change image timeout So you can do an ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0 and it shows some size? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:03 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 38/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:06|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:06 2010-05-10 12:31:11|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:11 2010-05-10 12:31:16|7448|vcld:main(165)|lastcheckin time updated for management node 1
Re: Change image timeout
I believe the problem is related to the Install Path setting for your management node: Management Nodes Edit Management Node Information Edit Based on the paths being used, I'm guessing yours is set to 'storage/x86'. Where does the partimage .gz file reside? If it actually resides in '/storage/image/x86' then change the Install Path setting to 'storage'. The .tmpl file residing in this directory may indicate another problem with the xCAT root path. Where is xCAT installed and which version of xCAT are you using? Search your log file for 'xCAT root path found:'. What path does this line contain? Regards, Andy Roger Herling wrote: [r...@vcl ~]# du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* 4 /storage/x86/image/x86/winvista-Windows7FacStaff49-v0.tmpl 4 total [r...@vcl ~]# ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -rw-r--r-- 1 root root 352 May 10 12:09 /storage/x86/image/x86/winvista-Windows7FacStaff49-v0.tmpl Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Andy Kurth andy_ku...@ncsu.edu To: vcl-dev@incubator.apache.org Date: 05/11/2010 09:06 AM Subject: Re: Change image timeout The command that is being executed to check the size should be: du -c /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* Try running this. What output does it generate? The last line should look like: total The code appears to be finding the 'total' line but is either not parsing it correctly, not calculating the size correctly, or the du command isn't correct. Also, it would be helpful if you include the output from: ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0* -Andy Waldron, Michael H wrote: So you never answered whether you watched the capture occurring from the node console. Can you watch the console and see the capture running while monitoring the log that shows the capture file is size 0? What about any other error messages for the rest of the log capture for this imaging request? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:18 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout yup Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:17 PM Subject: RE: Change image timeout So you can do an ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0 and it shows some size? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:03 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178
RE: Change image timeout
|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete |9225|144:178|image| WARNING |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture_monitor(1278)|image copy timed out, waited 20 minutes, nodeset status for node4 never changed to boot: image image-x86-winvista-Windows7FacStaff49-v0-all |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) xCAT.pm, capture_monitor (line: 1278) |9225|144:178|image| (-2) xCAT.pm, capture (line: 1183) |9225|144:178|image| (-3) image.pm, process (line: 162) |9225|144:178|image| (-4) vcld, make_new_child (line: 594) |9225|144:178|image| (-5) vcld, main (line: 341) |9225|144:178|image| WARNING |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture(1187)|problem occurred while monitoring image capture |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) xCAT.pm, capture (line: 1187) |9225|144:178|image| (-2) image.pm, process (line: 162) |9225|144:178|image| (-3) vcld, make_new_child (line: 594) |9225|144:178|image| (-4) vcld, main (line: 341) |9225|144:178|image| WARNING |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|image.pm:process(166)|winvista-Windows7FacStaff49-v0 image failed to be captured by provisioning module |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) image.pm, process (line: 166) |9225|144:178|image| (-2) vcld, make_new_child (line: 594) |9225|144:178|image| (-3) vcld, main (line: 341) 2010-05-10 12:32:04|9225|144:178|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , PROBLEM -- image.pm |9225|144:178|image| CRITICAL |9225|144:178|image| 2010-05-10 12:32:02|9225|144:178|image|image.pm:reservation_failed(390)|winvista-Windows7FacStaff49-v0 image creation failed |9225|144:178|image| ( 0) utils.pm, notify (line: 737) |9225|144:178|image| (-1) image.pm, reservation_failed (line: 390) |9225|144:178|image| (-2) image.pm, process (line: 167) |9225|144:178|image| (-3) vcld, make_new_child (line: 594) |9225|144:178|image| (-4) vcld, main (line: 341) 2010-05-10 12:32:04|9225|144:178|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: r...@localhost, VCL -- NOTICE DELAY Image Creation Windows7FacStaff 2010-05-10 12:32:05|9225|144:178|image|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , VCL -- NOTICE FAILED Image Creation Windows7FacStaff 2010-05-10 12:32:05|9225|144:178|image|utils.pm:update_request_state(2186)|request 144 state updated to: maintenance, laststate to: image 2010-05-10 12:32:05|9225|144:178|image|image.pm:reservation_failed(442)|request state set to maintenance, laststate to image 2010-05-10 12:32:05|9225|144:178|image|utils.pm:update_computer_state(2228)|computer 6 state updated to: maintenance 2010-05-10 12:32:05|9225|144:178|image|image.pm:reservation_failed(450)|node4 state set to maintenance 2010-05-10 12:32:05|9225|144:178|image|image.pm:reservation_failed(457)|exiting 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(905)|destructor called, ref($self)=VCL::image 2010-05-10 12:32:05|9225|144:178|image|utils.pm:delete_computerloadlog_reservation(7551)|removing computerloadlog entries matching loadstate = begin 2010-05-10 12:32:05|9225|144:178|image|utils.pm:delete_computerloadlog_reservation(7598)|deleted rows from computerloadlog for reservation id=178 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(912)|removed computerloadlog rows with loadstate=begin for reservation 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(924)|number of database handles state process created: 1 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(933)|process has a database handle stored in $ENV{dbh}, attempting disconnect 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(935)|$ENV{dbh}: database disconnect successful 2010-05-10 12:32:05|9225|144:178|image|State.pm:DESTROY(949)|VCL::image process 9225 exiting 2010-05-10 12:32:05|7448|vcld:REAPER(744)|VCL process exited for reservation 178 Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 02:50 PM Subject: RE: Change image timeout Roger, By image uploads do you mean capturing a new image? From what I've observed with ours, as long as the image file size is increasing on the management node, it assumes the capture is still proceeding and continues to check. Do you have vcld.log output you can provide from when the problem occurs? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 11:28 AM To: vcl-dev@incubator.apache.org Subject: Change image timeout Where do I need to make
RE: Change image timeout
From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 38/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:06|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:06 2010-05-10 12:31:11|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:11 2010-05-10 12:31:16|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:16 2010-05-10 12:31:21|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:21 2010-05-10 12:31:26|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:26 2010-05-10 12:31:31|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:31 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista-Windows7FacStaff49-v0-all 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1254)|checking size of winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3109)|getting size of image: winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3175)|management node identifier argument was not specified 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3197)|attempting to determine repository path for image on vcl.is.marist.edu: |9225|144:178|image| image id: 49 |9225|144:178|image| OS name: winvista |9225|144:178|image| OS type: windows |9225|144:178|image| OS install type: partimage |9225|144:178|image| OS source path: image |9225|144:178|image| architecture: x86 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 39/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:36|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:36 2010-05-10 12:31:41|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:41 2010-05-10 12:31:46|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:46 2010-05-10 12:31:51|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:51 2010-05-10 12:31:56|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:56 2010-05-10 12:32:01|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:32:01 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:32:02|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista
RE: Change image timeout
The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 38/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:06|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:06 2010-05-10 12:31:11|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:11 2010-05-10 12:31:16|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:16 2010-05-10 12:31:21|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:21 2010-05-10 12:31:26|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:26 2010-05-10 12:31:31|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:31 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista-Windows7FacStaff49-v0-all 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1254)|checking size of winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3109)|getting size of image: winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3175)|management node identifier argument was not specified 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3197)|attempting to determine repository path for image on vcl.is.marist.edu: |9225|144:178|image| image id: 49 |9225|144:178|image| OS name: winvista |9225|144:178|image| OS type: windows |9225|144:178|image| OS install type: partimage |9225|144:178|image| OS source path: image |9225|144:178|image| architecture: x86 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 39/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:36|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:36 2010-05-10 12:31:41|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:41 2010-05-10 12:31:46|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:46 2010-05-10 12:31:51|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:51 2010-05-10 12:31:56|7448|vcld:main(165)|lastcheckin time updated for management
RE: Change image timeout
So you never answered whether you watched the capture occurring from the node console. Can you watch the console and see the capture running while monitoring the log that shows the capture file is size 0? What about any other error messages for the rest of the log capture for this imaging request? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:18 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout yup Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:17 PM Subject: RE: Change image timeout So you can do an ls -l /storage/x86/image/x86/winvista-Windows7FacStaff49-v0 and it shows some size? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 4:03 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout The image is definately being captured and I can force the image to be used after it fails by manually undeleting it from the mysql database Roger Herling Sr. Desktop Administrator Marist College PH: 845-575-3347 From: Waldron, Michael H mwald...@email.unc.edu To: 'vcl-dev@incubator.apache.org' vcl-dev@incubator.apache.org Date: 05/10/2010 04:01 PM Subject: RE: Change image timeout From these lines in the log, it's indicating that the capture hasn't started: 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds Notice the image size is 0 MB, meaning the partimage save isn't occurring. After 20 minutes of nothing being saved, it gives up. Can you watch the capture from the server console to verify that the PXE boot is running and that it's booting into the image capture mode? Do the log entries at the beginning of the imaging process show any errors? Mike Waldron Systems Specialist ITS Research Computing University of North Carolina at Chapel Hill CB #3420, ITS Manning, Rm 2509 919-962-9778 -Original Message- From: Roger Herling [mailto:roger.herl...@marist.edu] Sent: Monday, May 10, 2010 3:25 PM To: vcl-dev@incubator.apache.org Subject: RE: Change image timeout 010-05-10 12:31:01|9225|144:178|image|xCAT.pm:_get_image_repository_path(3271)|returning repository path for vcl.is.marist.edu: /storage/x86/image/x86 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:get_image_size(3146)|returning image size: 0 MB (4 bytes) 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1258)|image size is the same: 0=0, copy may be complete 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1238)|image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:01|9225|144:178|image|xCAT.pm:capture_monitor(1239)|attempt 38/40: image copy not complete, sleeping for 30 seconds 2010-05-10 12:31:06|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:06 2010-05-10 12:31:11|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:11 2010-05-10 12:31:16|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:16 2010-05-10 12:31:21|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:21 2010-05-10 12:31:26|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:26 2010-05-10 12:31:31|7448|vcld:main(165)|lastcheckin time updated for management node 1: 2010-05-10 12:31:31 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_nodeset_option(1697)|node4 in image image-x86-winvista-Windows7FacStaff49-v0-all state 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1246)|nodeset status for node4: image image-x86-winvista-Windows7FacStaff49-v0-all 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:capture_monitor(1254)|checking size of winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:get_image_size(3109)|getting size of image: winvista-Windows7FacStaff49-v0 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3175)|management node identifier argument was not specified 2010-05-10 12:31:32|9225|144:178|image|xCAT.pm:_get_image_repository_path(3197)|attempting to determine repository path for image on vcl.is.marist.edu: |9225|144:178|image| image id: 49 |9225|144:178|image| OS