> Date: Sat, 22 Dec 2012 17:50:55 +0300 > From: "Raistmer" <[email protected]> > Subject: Re: [boinc_dev] Unrecoverable 197 (0xc5) > EXIT_TIME_LIMIT_EXCEEDED... > To: "David Anderson" <[email protected]>, > <[email protected]>, "Josef W. Segur" <[email protected]> >> If there were a way for a user to reset the host averages for an >> individual app_version, that would be much better. Something like a >> "reset" button by each app_version on a host's application details page, >> with a confirm/cancel dialog explaining the action, might be suitable. >> (The "reset" would of course have to be only available to the account >> owner). > It will not fix design flaw IMHO. BOINC main design (and BOINC devs always > insisted on that) - to allow automatic operation. Such "way for user" would > be workaround, but not proper fix. BOINC should not kill task that makes > progress. Period. The single point when such task can be killed - when it > besides deadline already. Then kill and recompute estimate to avoid work > allocation, if needed. > If any BOINC estimates tell that task too slow - adapt estimates. If task > completion progress ticks - DON'T KILL task. That's quite simple. Not a good choice for the RNA World project, which has a forum thread devoted to asking for deadline extensions when their workunits are still running near their original deadlines and are expected to continue running past the deadlines (often a month or more past the original deadline). Currently, extensions are often granted on the server, but the client cannot see this change; the client can only keep running past the deadline, and let the client computer's user decide whether to kill the workunit.
If any versions of the BOINC client will, by default, kill workunits that reach their deadlines without completion, they also need to allow a way for the workunit to specify that this should not be done, or should not be done without first asking and receiving permission from the server. This method should be chosen so that clients too old to have a method of killing workunits at the deadline will simply ignore requests not to kill them. Also, the task completion ticks for the RNA World project are from a wrapper program that watches the programs doing the work; the longest program doing the work can easily run for months with no checkpoints and no signals to the wrapper program before it reaches the point of producing some output, and then it is almost finished. _______________________________________________ boinc_dev mailing list [email protected] http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev To unsubscribe, visit the above URL and (near bottom of page) enter your email address.
