Looks like that did the trick. Thanks! --Travis
On Apr 10, 2011, at 11:19 PM, David Anderson wrote: > OK, I finally tracked this down. > Please update and try again. > -- David > > On 10-Apr-2011 2:06 PM, Travis Desell wrote: >> There's the following in our config.xml >> >> <max_wus_in_progress> >> 3 >> </max_wus_in_progress> >> <max_wus_in_progress_gpu> >> 3 >> </max_wus_in_progress_gpu> >> <max_ncpus> >> 16 >> </max_ncpus> >> <max_ngpus> >> 4 >> </max_ngpus> >> >> >> >> On Apr 10, 2011, at 1:05 PM, David Anderson wrote: >> >>> The line >>> >>> > 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] GPU: base 3 scaled 0 >>> >>> says that there is no GPU limit. >>> Where do you specify a GPU limit? >>> -- David >>> >>> On 09-Apr-2011 3:31 PM, Travis Desell wrote: >>>> So it looks like something weird might be going on with the scheduler. It >>>> looks >>>> like it's just ignoring the GPU limit. >>>> >>>> >>>> 2011-04-09 18:25:36.2280 [PID=7716 ] Request: [USER#60490] [HOST#124171] >>>> [IP >>>> 65.41.108.102] client 6.10.58 >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] [HOST#124171] [HAV#171] >>>> Resetting >>>> n_jobs_today >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] [HOST#124171] [HAV#172] >>>> Resetting >>>> n_jobs_today >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] max jobs per RPC: 200 >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] Overall limit on jobs in >>>> progress: >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] CPU: base 3 scaled 12 >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [quota] GPU: base 3 scaled 0 >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [send] Not using matchmaker >>>> scheduling; Not >>>> using EDF sim >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [send] CPU: req 0.00 sec, 0.00 >>>> instances; >>>> est delay 0.00 >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [send] ATI: req 288000.58 sec, 0.67 >>>> instances; est delay 0.00 >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [send] work_req_seconds: 0.00 secs >>>> 2011-04-09 18:25:36.2284 [PID=7716 ] [send] available disk 99.66 GB, >>>> work_buf_min 0 >>>> 2011-04-09 18:25:36.2285 [PID=7716 ] [send] active_frac 0.999381 on_frac >>>> 0.920469 >>>> 2011-04-09 18:25:36.2285 [PID=7716 ] [send] [AV#171] not reliable; cons >>>> valid >>>> 0 < 10 >>>> 2011-04-09 18:25:36.2285 [PID=7716 ] [send] set_trust: cons valid 0 < 10, >>>> don't >>>> use single replication >>>> 2011-04-09 18:25:36.2285 [PID=7716 ] [send] [AV#172] not reliable; cons >>>> valid >>>> 0 < 10 >>>> 2011-04-09 18:25:36.2285 [PID=7716 ] [send] set_trust: cons valid 0 < 10, >>>> don't >>>> use single replication >>>> 2011-04-09 18:25:36.2302 [PID=7716 ] [version] looking for version of >>>> milkyway >>>> 2011-04-09 18:25:36.2304 [PID=7716 ] [version] [AV#267] Skipping CPU >>>> version - >>>> user prefs say no CPUs >>>> 2011-04-09 18:25:36.2304 [PID=7716 ] [version] [AV#261] Skipping CPU >>>> version - >>>> user prefs say no CPUs >>>> 2011-04-09 18:25:36.2304 [PID=7716 ] [version] ati14 ATI app projected >>>> 1755.77G >>>> peak 0.00G 0.050 CPUs >>>> 2011-04-09 18:25:36.2304 [PID=7716 ] [version] [AV#290] (ati14) using >>>> unscaled >>>> projected flops: 1755.77G >>>> 2011-04-09 18:25:36.2304 [PID=7716 ] [version] Best version of app >>>> milkyway is >>>> [AV#290] (1755.77 GFLOPS) >>>> 2011-04-09 18:25:36.2304 [PID=7716 ] [send] est delay 0, skipping deadline >>>> check >>>> 2011-04-09 18:25:36.2315 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2323 [PID=7716 ] [send] est. duration for WU 9670: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2323 [PID=7716 ] [HOST#124171] Sending [RESULT#10902 >>>> de_separation_13_3s_free_1_6670_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2329 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2329 [PID=7716 ] [send] est. duration for WU 9671: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2329 [PID=7716 ] [send] [WU#9671] meets deadline: >>>> 18.33 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2335 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2339 [PID=7716 ] [send] est. duration for WU 9671: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2339 [PID=7716 ] [HOST#124171] Sending [RESULT#10903 >>>> de_separation_13_3s_free_1_6671_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2346 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2346 [PID=7716 ] [send] est. duration for WU 9672: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2346 [PID=7716 ] [send] [WU#9672] meets deadline: >>>> 36.66 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2351 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2355 [PID=7716 ] [send] est. duration for WU 9672: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2355 [PID=7716 ] [HOST#124171] Sending [RESULT#10904 >>>> de_separation_13_3s_free_1_6672_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2363 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2363 [PID=7716 ] [send] est. duration for WU 9673: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2363 [PID=7716 ] [send] [WU#9673] meets deadline: >>>> 54.99 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2370 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2374 [PID=7716 ] [send] est. duration for WU 9673: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2374 [PID=7716 ] [HOST#124171] Sending [RESULT#10905 >>>> de_separation_13_3s_free_1_6673_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2378 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2378 [PID=7716 ] [send] est. duration for WU 9674: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2378 [PID=7716 ] [send] [WU#9674] meets deadline: >>>> 73.32 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2384 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2389 [PID=7716 ] [send] est. duration for WU 9674: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2389 [PID=7716 ] [HOST#124171] Sending [RESULT#10906 >>>> de_separation_13_3s_free_1_6674_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2400 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2400 [PID=7716 ] [send] est. duration for WU 9675: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2400 [PID=7716 ] [send] [WU#9675] meets deadline: >>>> 91.65 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2408 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2411 [PID=7716 ] [send] est. duration for WU 9675: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2411 [PID=7716 ] [HOST#124171] Sending [RESULT#10907 >>>> de_separation_13_3s_free_1_6675_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2416 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2416 [PID=7716 ] [send] est. duration for WU 9676: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2416 [PID=7716 ] [send] [WU#9676] meets deadline: >>>> 109.98 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2421 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2425 [PID=7716 ] [send] est. duration for WU 9676: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2425 [PID=7716 ] [HOST#124171] Sending [RESULT#10908 >>>> de_separation_13_3s_free_1_6676_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2431 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2431 [PID=7716 ] [send] est. duration for WU 9660: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2432 [PID=7716 ] [send] [WU#9660] meets deadline: >>>> 128.30 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2438 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2442 [PID=7716 ] [send] est. duration for WU 9660: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2442 [PID=7716 ] [HOST#124171] Sending [RESULT#10892 >>>> de_separation_13_3s_free_1_6660_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2452 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2452 [PID=7716 ] [send] est. duration for WU 9661: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2452 [PID=7716 ] [send] [WU#9661] meets deadline: >>>> 146.63 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2460 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2483 [PID=7716 ] [send] est. duration for WU 9661: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2484 [PID=7716 ] [HOST#124171] Sending [RESULT#10893 >>>> de_separation_13_3s_free_1_6661_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2508 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2508 [PID=7716 ] [send] est. duration for WU 9662: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2508 [PID=7716 ] [send] [WU#9662] meets deadline: >>>> 164.96 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2517 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2524 [PID=7716 ] [send] est. duration for WU 9662: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2525 [PID=7716 ] [HOST#124171] Sending [RESULT#10894 >>>> de_separation_13_3s_free_1_6662_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2531 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2531 [PID=7716 ] [send] est. duration for WU 9663: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2531 [PID=7716 ] [send] [WU#9663] meets deadline: >>>> 183.29 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2538 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2542 [PID=7716 ] [send] est. duration for WU 9663: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2542 [PID=7716 ] [HOST#124171] Sending [RESULT#10895 >>>> de_separation_13_3s_free_1_6663_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2546 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2546 [PID=7716 ] [send] est. duration for WU 9639: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2546 [PID=7716 ] [send] [WU#9639] meets deadline: >>>> 201.62 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2551 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2555 [PID=7716 ] [send] est. duration for WU 9639: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2555 [PID=7716 ] [HOST#124171] Sending [RESULT#10871 >>>> de_separation_13_3s_free_1_6639_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2559 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2559 [PID=7716 ] [send] est. duration for WU 9640: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2559 [PID=7716 ] [send] [WU#9640] meets deadline: >>>> 219.95 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2564 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2568 [PID=7716 ] [send] est. duration for WU 9640: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2568 [PID=7716 ] [HOST#124171] Sending [RESULT#10872 >>>> de_separation_13_3s_free_1_6640_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2572 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2572 [PID=7716 ] [send] est. duration for WU 9641: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2572 [PID=7716 ] [send] [WU#9641] meets deadline: >>>> 238.28 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2578 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2582 [PID=7716 ] [send] est. duration for WU 9641: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2582 [PID=7716 ] [HOST#124171] Sending [RESULT#10873 >>>> de_separation_13_3s_free_1_6641_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2585 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2585 [PID=7716 ] [send] est. duration for WU 9642: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2585 [PID=7716 ] [send] [WU#9642] meets deadline: >>>> 256.61 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2590 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2594 [PID=7716 ] [send] est. duration for WU 9642: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2594 [PID=7716 ] [HOST#124171] Sending [RESULT#10874 >>>> de_separation_13_3s_free_1_6642_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2597 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2598 [PID=7716 ] [send] est. duration for WU 9643: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2598 [PID=7716 ] [send] [WU#9643] meets deadline: >>>> 274.94 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2603 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2606 [PID=7716 ] [send] est. duration for WU 9643: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2606 [PID=7716 ] [HOST#124171] Sending [RESULT#10875 >>>> de_separation_13_3s_free_1_6643_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2610 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2611 [PID=7716 ] [send] est. duration for WU 9644: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2611 [PID=7716 ] [send] [WU#9644] meets deadline: >>>> 293.27 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2615 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2625 [PID=7716 ] [send] est. duration for WU 9644: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2626 [PID=7716 ] [HOST#124171] Sending [RESULT#10876 >>>> de_separation_13_3s_free_1_6644_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2663 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2663 [PID=7716 ] [send] est. duration for WU 9645: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2663 [PID=7716 ] [send] [WU#9645] meets deadline: >>>> 311.60 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2673 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2677 [PID=7716 ] [send] est. duration for WU 9645: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2678 [PID=7716 ] [HOST#124171] Sending [RESULT#10877 >>>> de_separation_13_3s_free_1_6645_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2681 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2682 [PID=7716 ] [send] est. duration for WU 9646: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2682 [PID=7716 ] [send] [WU#9646] meets deadline: >>>> 329.93 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2688 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2692 [PID=7716 ] [send] est. duration for WU 9646: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2692 [PID=7716 ] [HOST#124171] Sending [RESULT#10878 >>>> de_separation_13_3s_free_1_6646_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2695 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2695 [PID=7716 ] [send] est. duration for WU 9647: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2696 [PID=7716 ] [send] [WU#9647] meets deadline: >>>> 348.25 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2701 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2704 [PID=7716 ] [send] est. duration for WU 9647: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2704 [PID=7716 ] [HOST#124171] Sending [RESULT#10879 >>>> de_separation_13_3s_free_1_6647_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2708 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2708 [PID=7716 ] [send] est. duration for WU 9648: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2708 [PID=7716 ] [send] [WU#9648] meets deadline: >>>> 366.58 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2713 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2716 [PID=7716 ] [send] est. duration for WU 9648: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2716 [PID=7716 ] [HOST#124171] Sending [RESULT#10880 >>>> de_separation_13_3s_free_1_6648_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2720 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2720 [PID=7716 ] [send] est. duration for WU 9649: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2720 [PID=7716 ] [send] [WU#9649] meets deadline: >>>> 384.91 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2725 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2728 [PID=7716 ] [send] est. duration for WU 9649: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2728 [PID=7716 ] [HOST#124171] Sending [RESULT#10881 >>>> de_separation_13_3s_free_1_6649_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2732 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2732 [PID=7716 ] [send] est. duration for WU 9650: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2732 [PID=7716 ] [send] [WU#9650] meets deadline: >>>> 403.24 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2739 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2742 [PID=7716 ] [send] est. duration for WU 9650: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2742 [PID=7716 ] [HOST#124171] Sending [RESULT#10882 >>>> de_separation_13_3s_free_1_6650_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2745 [PID=7716 ] [version] returning cached version: >>>> [AV#290] >>>> 2011-04-09 18:25:36.2745 [PID=7716 ] [send] est. duration for WU 9651: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2745 [PID=7716 ] [send] [WU#9651] meets deadline: >>>> 421.57 + >>>> 18.33 < 691200 >>>> 2011-04-09 18:25:36.2751 [PID=7716 ] [send] Sending app_version milkyway 1 >>>> 57 >>>> ati14; projected 1755.77 GFLOPS >>>> 2011-04-09 18:25:36.2754 [PID=7716 ] [send] est. duration for WU 9651: >>>> unscaled >>>> 16.86 scaled 18.33 >>>> 2011-04-09 18:25:36.2754 [PID=7716 ] [HOST#124171] Sending [RESULT#10883 >>>> de_separation_13_3s_free_1_6651_1302387007_0] (est. dur. 18.33 seconds) >>>> 2011-04-09 18:25:36.2763 [PID=7716 ] Sending reply to [HOST#124171]: 24 >>>> results, >>>> delay req 61.00 >>>> 2011-04-09 18:25:36.2767 [PID=7716 ] Scheduler ran 0.062 seconds >>>> >>>> >>>> >>>> >>>> >>>> >>>> >>>> On Apr 8, 2011, at 4:07 AM, David Anderson wrote: >>>> >>>>> Are those results in fact in the DB? >>>>> -- David >>>>> >>>>> On 07-Apr-2011 11:39 PM, Travis Desell wrote: >>>>>> And also a lot of >>>>>> >>>>>> 2011-04-08 01:02:18.1660 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_13_3s_fix20_1_53007_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1660 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_13_3s_fix20_1_53006_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1660 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_13_3s_fix20_1_53005_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1660 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_13_3s_fix20_1_53003_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1661 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_13_3s_fix20_1_53001_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1661 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_10_3s_fix20_1_52992_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1661 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_10_3s_fix20_1_52990_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1661 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_10_3s_fix20_1_52989_1302061461_0] reported result not in DB >>>>>> 2011-04-08 01:02:18.1661 [PID=88759] [CRITICAL] [HOST#233686] [RESULT#? >>>>>> de_separation_10_3s_fix20_1_52988_1302061461_0] reported result not in DB >>>>>> >>>>>> >>>>>> On Apr 8, 2011, at 2:12 AM, David Anderson wrote: >>>>>> >>>>>>> Putting <debug_quota/> in your config.xml >>>>>>> enables various log messages that may shed light on things. >>>>>>> -- David >>>>>>> >>>>>>> On 07-Apr-2011 10:56 PM, Travis Desell wrote: >>>>>>>> Any reason why a client could get significantly more tasks than they >>>>>>>> should >>>>>>>> be allowed? >>>>>>>> >>>>>>>> Ex.: >>>>>>>> >>>>>>>> http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=27103 >>>>>>>> http://milkyway.cs.rpi.edu/milkyway/show_host_detail.php?hostid=162154 >>>>>>>> >>>>>>>> With max_ncpus set to 32 and max_ngpus set to 8, and 6 >>>>>>>> max_workunits_in_progress, max_workunits_in_progress_gpu? >>>>>>>> >>>>>>>> >>>>>>>> ---------------------------------------------------------------------------------------------------------- >>>>>>>> Travis Desell<deselt @ cs.rpi.edu <http://cs.rpi.edu> >>>>>>>> <http://cs.rpi.edu> >>>>>>>> <http://cs.rpi.edu>> >>>>>>>> 1-518-867-1054 >>>>>>>> Adjunct Professor& Postdoctoral Research Assistant >>>>>>>> Rensselaer Polytechnic Institute, 110 8th Street, Troy NY 12180, USA >>>>>>>> http://www.cs.rpi.edu/~deselt/ >>>>>>>> MilkyWay@Home ( http://milkyway.cs.rpi.edu/ ) >>>>>>>> DNA@Home ( http://dnahome.cs.rpi.edu/ ) >>>>>>>> Worldwide Computing Laboratory ( http://wcl.cs.rpi.edu/ ) >>>>>>>> ---------------------------------------------------------------------------------------------------------- >>>>>>>> >>>>>>>> _______________________________________________ >>>>>>>> boinc_projects mailing list >>>>>>>> [email protected] >>>>>>>> <mailto:[email protected]> >>>>>>>> <mailto:[email protected]> >>>>>>>> <mailto:[email protected]> >>>>>>>> http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_projects >>>>>>>> To unsubscribe, visit the above URL and >>>>>>>> (near bottom of page) enter your email address. >>>>>>> _______________________________________________ >>>>>>> boinc_projects mailing list >>>>>>> [email protected] <mailto:[email protected]> >>>>>>> <mailto:[email protected]> >>>>>>> <mailto:[email protected]> >>>>>>> http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_projects >>>>>>> To unsubscribe, visit the above URL and >>>>>>> (near bottom of page) enter your email address. >>>>>> >>>>>> ---------------------------------------------------------------------------------------------------------- >>>>>> Travis Desell <deselt @ cs.rpi.edu <http://cs.rpi.edu> >>>>>> <http://cs.rpi.edu> >>>>>> <http://cs.rpi.edu/>> >>>>>> 1-518-867-1054 >>>>>> Adjunct Professor & Postdoctoral Research Assistant >>>>>> Rensselaer Polytechnic Institute, 110 8th Street, Troy NY 12180, USA >>>>>> http://www.cs.rpi.edu/~deselt/ >>>>>> MilkyWay@Home ( http://milkyway.cs.rpi.edu/ ) >>>>>> DNA@Home ( http://dnahome.cs.rpi.edu/ ) >>>>>> Worldwide Computing Laboratory ( http://wcl.cs.rpi.edu/ ) >>>>>> ---------------------------------------------------------------------------------------------------------- >>>>>> >>>> >>>> ---------------------------------------------------------------------------------------------------------- >>>> Travis Desell <deselt @ cs.rpi.edu <http://cs.rpi.edu> >>>> <http://cs.rpi.edu/>> >>>> 1-518-867-1054 >>>> Adjunct Professor & Postdoctoral Research Assistant >>>> Rensselaer Polytechnic Institute, 110 8th Street, Troy NY 12180, USA >>>> http://www.cs.rpi.edu/~deselt/ >>>> MilkyWay@Home ( http://milkyway.cs.rpi.edu/ ) >>>> DNA@Home ( http://dnahome.cs.rpi.edu/ ) >>>> Worldwide Computing Laboratory ( http://wcl.cs.rpi.edu/ ) >>>> ---------------------------------------------------------------------------------------------------------- >>>> >> >> ---------------------------------------------------------------------------------------------------------- >> Travis Desell <deselt @ cs.rpi.edu <http://cs.rpi.edu/>> 1-518-867-1054 >> Adjunct Professor & Postdoctoral Research Assistant >> Rensselaer Polytechnic Institute, 110 8th Street, Troy NY 12180, USA >> http://www.cs.rpi.edu/~deselt/ >> MilkyWay@Home ( http://milkyway.cs.rpi.edu/ ) >> DNA@Home ( http://dnahome.cs.rpi.edu/ ) >> Worldwide Computing Laboratory ( http://wcl.cs.rpi.edu/ ) >> ---------------------------------------------------------------------------------------------------------- >> ---------------------------------------------------------------------------------------------------------- Travis Desell <deselt @ cs.rpi.edu> 1-518-867-1054 Adjunct Professor & Postdoctoral Research Assistant Rensselaer Polytechnic Institute, 110 8th Street, Troy NY 12180, USA http://www.cs.rpi.edu/~deselt/ MilkyWay@Home ( http://milkyway.cs.rpi.edu/ ) DNA@Home ( http://dnahome.cs.rpi.edu/ ) Worldwide Computing Laboratory ( http://wcl.cs.rpi.edu/ ) ---------------------------------------------------------------------------------------------------------- _______________________________________________ boinc_dev mailing list [email protected] http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev To unsubscribe, visit the above URL and (near bottom of page) enter your email address.
