I have a host that is refusing to try to fetch work where it does not have enough work to actually get to the next connection. (I have the network connection turned on for debugging.
Quick analysis: Computation times remaining: Edges 34:40, Edges 34:40, Edges 34:40, Edges 34:40, Drug discovery 38:33, EON 1:51:26, EON 1:51:54 EON1:53:16, EON 1:58:40, SETI 3:20:30, WCG 8:22:15, CPDN 49:22:16 NOTE: all projects have a resource share of 100 except for CPDN which as a resource share of 1000. The nubers were written down while the system is running and the estimates for EON are falling rapidly. It is a 4 CPU system. If the tasks are run in the worst packing into CPUs, it looks like: CPU0: 34:40 + 38:33 + 1:58:40 = 2:11:53 CPU1: 34:40 + 1:51:26 + 3:20:30 = 5:46:36 CPU2: 34:40 + 1:51:54 + 8:22:15 = 10:47:49 CPU3: 34:40 + 1:53:16 + 49:22:26 = 51:50:22 Best packing is: CPU0: 49:22:26 = 49:22:26 CPU1: 8:22:15 = 8:22:15 CPU2: 3:20:30 + 1:51:26 + 38:33 + 34:40 + 34:40 = 6:59:49 CPU2: 1:58:40 + 1:53:16 + 1:51:54 + 34:40 + 34:40 = 6:53:10 Worst case, there is a CPU idle 21:48:07 before the end of the connect interval specified, and there is about 53.5 lost hours of CPU time. Best case there is a CPU idle 17:06:50 before the endo of the connect interval specified, and there is about 49.5 lost hours of CPU time. Having any safety margin in the CPU time estimates for work fetch tends to waste CPU time. Not having any safety margin in the CPU time estimates for CPU scheduling will tend to waste CPU time due to late work and the resend of work that is then required. The two calculations probably need to be separated. Log Follows: 7608 9/15/2011 10:06:17 AM [work_fetch] work fetch start 7609 9/15/2011 10:06:17 AM [rr_sim] start: work_buf min 86400 additional 0 total 86400 on_frac 0.999 active_frac 1.000 7610 climateprediction.net 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting hadcm3n_ya6r_1900_40_007346061_0 (1.00 CPU) 7611 SETI@home 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting ap_16jl11ab_B0_P1_00228_20110912_21323.wu_0 (1.00 CPU) 7612 World Community Grid 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting faah24549_ZINC02913842_x2IEN_wtHIV_00_0 (1.00 CPU) 7613 DrugDiscovery@Home 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting autodock_ga_run_10_bt_1ijy_w_md1_Autodock.pdb_lig_25592_ChemDiv_8007-0_ts_1315489568464883000_3 (1.00 CPU) 7614 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting wu_3_fd888020-6cee-41a0-b77e-cf13e6776f1c_201109151520_0 (1.00 CPU) 7615 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting wu_3_f46cbdb5-dc92-4abf-aa57-e12bc2492682_201109151520_0 (1.00 CPU) 7616 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting wu_3_ca38a265-60b4-4466-a7db-02ac658b204a_201109151520_0 (1.00 CPU) 7617 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting wu_3_be9a739e-8523-4eb2-a6c1-b9a7f71f1ae6_201109151520_0 (1.00 CPU) 7618 eon2 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting 1737419861_93706_0 (1.00 CPU) 7619 eon2 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting 1737419861_93705_0 (1.00 CPU) 7620 eon2 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting 1737419861_93690_0 (1.00 CPU) 7621 eon2 9/15/2011 10:06:17 AM [rr_sim] 0.00: starting 1737419861_93689_0 (1.00 CPU) 7622 DrugDiscovery@Home 9/15/2011 10:06:17 AM [rr_sim] 0.00: autodock_ga_run_10_bt_1ijy_w_md1_Autodock.pdb_lig_25592_ChemDiv_8007-0_ts_1315489568464883000_3 finishes after 14191.18 (5163.42G/0.36G) 7623 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 14191.18: wu_3_fd888020-6cee-41a0-b77e-cf13e6776f1c_201109151520_0 finishes after 35358.20 (3353.11G/0.09G) 7624 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 49549.38: wu_3_f46cbdb5-dc92-4abf-aa57-e12bc2492682_201109151520_0 finishes after 0.00 (0.00G/0.13G) 7625 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 49549.38: wu_3_ca38a265-60b4-4466-a7db-02ac658b204a_201109151520_0 finishes after 0.00 (0.00G/0.19G) 7626 EDGeS@Home 9/15/2011 10:06:17 AM [rr_sim] 49549.38: wu_3_be9a739e-8523-4eb2-a6c1-b9a7f71f1ae6_201109151520_0 finishes after 0.00 (0.00G/0.38G) 7627 SETI@home 9/15/2011 10:06:17 AM [rr_sim] 49549.38: ap_16jl11ab_B0_P1_00228_20110912_21323.wu_0 finishes after 14796.68 (35012.94G/2.37G) 7628 eon2 9/15/2011 10:06:17 AM [rr_sim] 64346.06: 1737419861_93705_0 finishes after 21918.36 (723.18G/0.03G) 7629 eon2 9/15/2011 10:06:17 AM [rr_sim] 1737419861_93705_0 misses deadline by 65795.60 7630 eon2 9/15/2011 10:06:17 AM [rr_sim] 86264.42: 1737419861_93689_0 finishes after 1477.12 (64.98G/0.04G) 7631 eon2 9/15/2011 10:06:17 AM [rr_sim] 1737419861_93689_0 misses deadline by 67272.71 7632 eon2 9/15/2011 10:06:17 AM [rr_sim] 87741.53: 1737419861_93690_0 finishes after 1951.98 (128.81G/0.07G) 7633 eon2 9/15/2011 10:06:17 AM [rr_sim] 1737419861_93690_0 misses deadline by 69224.69 7634 eon2 9/15/2011 10:06:17 AM [rr_sim] 89693.51: 1737419861_93706_0 finishes after 4653.55 (614.16G/0.13G) 7635 eon2 9/15/2011 10:06:17 AM [rr_sim] 1737419861_93706_0 misses deadline by 73878.24 7636 climateprediction.net 9/15/2011 10:06:17 AM [rr_sim] 94347.06: hadcm3n_ya6r_1900_40_007346061_0 finishes after 83643.69 (190314.24G/2.28G) 7637 World Community Grid 9/15/2011 10:06:17 AM [rr_sim] 177990.75: faah24549_ZINC02913842_x2IEN_wtHIV_00_0 finishes after 13743.08 (30627.33G/2.23G) 7638 9/15/2011 10:06:17 AM [work_fetch] ------- start work fetch state ------- 7639 9/15/2011 10:06:17 AM [work_fetch] target work buffer: 86400.00 + 0.00 sec 7640 9/15/2011 10:06:17 AM [work_fetch] CPU: shortfall 0.00 nidle 0.00 saturated 89693.51 busy 3760.92 RS fetchable 9162.00 runnable 2450.00 7641 ABC@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 80.09051 prio 0.00686 backoff dt 0.00 int 0.00 7642 wanless2 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.01033 backoff dt 0.00 int 0.00 (comm deferred) 7643 AlmereGrid Boinc Grid 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.00010 backoff dt 0.00 int 0.00 7644 rosetta@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 71.13325 prio 0.00725 backoff dt 0.00 int 0.00 7645 DrugDiscovery@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 5.29681 prio 0.00959 backoff dt 0.00 int 600.00 7646 Poem@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 1189.39277 prio -0.04124 backoff dt 0.00 int 0.00 7647 Leiden Classical 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 149.09013 prio 0.00387 backoff dt 0.00 int 0.00 7648 Evo@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.01033 backoff dt 0.00 int 0.00 (master fetch pending) (comm deferred) 7649 Collatz Conjecture 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 172.68379 prio 0.00285 backoff dt 0.00 int 0.00 7650 The Lattice Project 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 70.39430 prio 0.00728 backoff dt 0.00 int 0.00 7651 Biochemical Library 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 0.00000 prio 0.01033 backoff dt 0.00 int 600.00 7652 boincsimap 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 44.76316 prio 0.00839 backoff dt 0.00 int 600.00 7653 BURP 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.00103 backoff dt 0.00 int 0.00 7654 CAS@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 70.10303 prio 0.00729 backoff dt 0.00 int 0.00 7655 superlinkattechnion 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 0.00000 prio 0.01033 backoff dt 0.00 int 600.00 7656 climateprediction.net 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.22 rec 8642.65977 prio -0.20877 backoff dt 0.00 int 0.00 7657 CPDN Beta 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.22 rec 8294.10872 prio -0.15300 backoff dt 0.00 int 0.00 7658 SLinCA@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 208.02376 prio 0.00131 backoff dt 0.00 int 0.00 7659 DNETC@HOME 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 73.73280 prio 0.00714 backoff dt 0.00 int 0.00 7660 Docking 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 1251.59395 prio -0.04394 backoff dt 0.00 int 0.00 7661 Einstein@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 80.59482 prio 0.00684 backoff dt 0.00 int 0.00 7662 eon2 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 71.40509 prio 0.00606 backoff dt 0.00 int 0.00 7663 NFS@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 70.13783 prio 0.00729 backoff dt 0.00 int 0.00 7664 gerasim@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 70.73829 prio 0.00727 backoff dt 0.00 int 0.00 7665 Goldbach's Conjecture Project 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.00010 backoff dt 0.00 int 0.00 7666 EDGeS@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 69.09304 prio 0.00547 backoff dt 0.00 int 0.00 7667 BOINC alpha test 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.00103 backoff dt 0.00 int 0.00 (master fetch pending) (comm deferred) 7668 LHC@home 1.0 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 0.00000 prio 0.01033 backoff dt 0.00 int 600.00 7669 Mersenne@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 69.81570 prio 0.00731 backoff dt 0.00 int 0.00 7670 Milkyway@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 73.07594 prio 0.00716 backoff dt 0.00 int 0.00 7671 MindModeling@Beta 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 0.00128 prio 0.01033 backoff dt 0.00 int 600.00 7672 orbit@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.11 rec 0.00000 prio 0.10334 backoff dt 0.00 int 600.00 7673 Pirates@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.00052 backoff dt 0.00 int 0.00 (comm deferred) 7674 QMC@HOME 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 82.55454 prio 0.00675 backoff dt 0.00 int 0.00 7675 ralph@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 35.61421 prio 0.00879 backoff dt 0.00 int 600.00 7676 ibercivis 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 70.45988 prio 0.00728 backoff dt 0.00 int 0.00 7677 SETI@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 103.50527 prio -0.00881 backoff dt 0.00 int 0.00 7678 SETI@home Beta Test 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 76.76016 prio 0.00701 backoff dt 0.00 int 0.00 7679 Spinhenge@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 103.41247 prio 0.00585 backoff dt 0.00 int 0.00 7680 sudoku 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 72.83700 prio 0.00718 backoff dt 0.00 int 0.00 7681 SZTAKI Desktop Grid 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 72.84690 prio 0.00717 backoff dt 0.00 int 0.00 7682 Virtual Prairie 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.01033 backoff dt 0.00 int 600.00 (comm deferred) 7683 chess960@home alpha 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.01033 backoff dt 0.00 int 0.00 (comm deferred) 7684 Cosmology@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 116.42118 prio 0.00529 backoff dt 0.00 int 0.00 7685 DistributedDataMining 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 73.11039 prio 0.00716 backoff dt 0.00 int 0.00 7686 Enigma@Home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 75.08063 prio 0.00708 backoff dt 0.00 int 0.00 7687 Magnetism at home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 0.00000 prio 0.01033 backoff dt 0.00 int 600.00 7688 GPUGRID 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 0.00000 prio 0.01033 backoff dt 0.00 int 600.00 7689 malariacontrol.net 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 79.71588 prio 0.00688 backoff dt 0.00 int 0.00 7690 primaboinca 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 75.14229 prio 0.00708 backoff dt 0.00 int 0.00 7691 PrimeGrid 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 697.31231 prio -0.01991 backoff dt 0.00 int 0.00 7692 yoyo@home 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 82.20575 prio 0.00677 backoff dt 0.00 int 0.00 7693 RNA World 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 183.72140 prio 0.00237 backoff dt 0.00 int 0.00 7694 uFluids 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.00 rec 0.00000 prio 0.01033 backoff dt 0.00 int 0.00 (master fetch pending) (comm deferred) 7695 World Community Grid 9/15/2011 10:06:17 AM [work_fetch] CPU: fetch share 0.01 rec 231.34823 prio -0.01162 backoff dt 0.00 int 0.00 7696 ABC@home 9/15/2011 10:06:17 AM [work_fetch] REC 80.090507 7697 wanless2 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7698 AlmereGrid Boinc Grid 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7699 rosetta@home 9/15/2011 10:06:17 AM [work_fetch] REC 71.133250 7700 DrugDiscovery@Home 9/15/2011 10:06:17 AM [work_fetch] REC 5.296810 7701 Poem@Home 9/15/2011 10:06:17 AM [work_fetch] REC 1189.392775 7702 Leiden Classical 9/15/2011 10:06:17 AM [work_fetch] REC 149.090131 7703 Evo@Home 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7704 Collatz Conjecture 9/15/2011 10:06:17 AM [work_fetch] REC 172.683794 7705 The Lattice Project 9/15/2011 10:06:17 AM [work_fetch] REC 70.394296 7706 Biochemical Library 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7707 boincsimap 9/15/2011 10:06:17 AM [work_fetch] REC 44.763162 7708 BURP 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7709 CAS@home 9/15/2011 10:06:17 AM [work_fetch] REC 70.103033 7710 superlinkattechnion 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7711 climateprediction.net 9/15/2011 10:06:17 AM [work_fetch] REC 8642.659766 7712 CPDN Beta 9/15/2011 10:06:17 AM [work_fetch] REC 8294.108718 7713 SLinCA@Home 9/15/2011 10:06:17 AM [work_fetch] REC 208.023757 7714 DNETC@HOME 9/15/2011 10:06:17 AM [work_fetch] REC 73.732796 7715 Docking 9/15/2011 10:06:17 AM [work_fetch] REC 1251.593953 7716 Einstein@Home 9/15/2011 10:06:17 AM [work_fetch] REC 80.594819 7717 eon2 9/15/2011 10:06:17 AM [work_fetch] REC 71.405092 7718 NFS@Home 9/15/2011 10:06:17 AM [work_fetch] REC 70.137829 7719 gerasim@home 9/15/2011 10:06:17 AM [work_fetch] REC 70.738289 7720 Goldbach's Conjecture Project 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7721 EDGeS@Home 9/15/2011 10:06:17 AM [work_fetch] REC 69.093042 7722 BOINC alpha test 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7723 LHC@home 1.0 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7724 Mersenne@home 9/15/2011 10:06:17 AM [work_fetch] REC 69.815698 7725 Milkyway@home 9/15/2011 10:06:17 AM [work_fetch] REC 73.075941 7726 MindModeling@Beta 9/15/2011 10:06:17 AM [work_fetch] REC 0.001283 7727 orbit@home 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7728 Pirates@Home 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7729 QMC@HOME 9/15/2011 10:06:17 AM [work_fetch] REC 82.554544 7730 ralph@home 9/15/2011 10:06:17 AM [work_fetch] REC 35.614214 7731 ibercivis 9/15/2011 10:06:17 AM [work_fetch] REC 70.459879 7732 SETI@home 9/15/2011 10:06:17 AM [work_fetch] REC 103.505274 7733 SETI@home Beta Test 9/15/2011 10:06:17 AM [work_fetch] REC 76.760157 7734 Spinhenge@home 9/15/2011 10:06:17 AM [work_fetch] REC 103.412475 7735 sudoku 9/15/2011 10:06:17 AM [work_fetch] REC 72.837001 7736 SZTAKI Desktop Grid 9/15/2011 10:06:17 AM [work_fetch] REC 72.846900 7737 Virtual Prairie 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7738 chess960@home alpha 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7739 Cosmology@Home 9/15/2011 10:06:17 AM [work_fetch] REC 116.421185 7740 DistributedDataMining 9/15/2011 10:06:17 AM [work_fetch] REC 73.110390 7741 Enigma@Home 9/15/2011 10:06:17 AM [work_fetch] REC 75.080628 7742 Magnetism at home 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7743 GPUGRID 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7744 malariacontrol.net 9/15/2011 10:06:17 AM [work_fetch] REC 79.715880 7745 primaboinca 9/15/2011 10:06:17 AM [work_fetch] REC 75.142285 7746 PrimeGrid 9/15/2011 10:06:17 AM [work_fetch] REC 697.312315 7747 yoyo@home 9/15/2011 10:06:17 AM [work_fetch] REC 82.205747 7748 RNA World 9/15/2011 10:06:17 AM [work_fetch] REC 183.721396 7749 uFluids 9/15/2011 10:06:17 AM [work_fetch] REC 0.000000 7750 World Community Grid 9/15/2011 10:06:17 AM [work_fetch] REC 231.348231 7751 9/15/2011 10:06:17 AM [work_fetch] ------- end work fetch state ------- 7752 9/15/2011 10:06:17 AM [work_fetch] No project chosen for work fetch 7753 FreeHAL@home 9/15/2011 10:06:19 AM [checkpoint] result fh_nci_0_38780211_59_0 checkpointed jm7 _______________________________________________ boinc_dev mailing list [email protected] http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev To unsubscribe, visit the above URL and (near bottom of page) enter your email address.
