Get the list of open files (ls -l /proc/$(pidof boinc)/fd) when that happens. Does the client die after that last fopen() failure? Maybe you could write a script to log the open file list every few minutes.
-- Nicolás 2013/5/16 "Steffen Möller" <[email protected]>: > Dear all, > > every few months I get an error like the one below (taken from the > stdoutdae.txt) the report too many open files. This is see for about three > years on several Linux machines, I only recall such with many cores (12 or > 24), though, Opterons and Xeons alike. Is anything jumping at you where to > look? > > Cheers, > > Steffen > > 16-May-2013 16:58:33 [World Community Grid] Sending scheduler request: To > fetch work. > 16-May-2013 16:58:33 [World Community Grid] Requesting new tasks for CPU > 16-May-2013 16:58:36 [World Community Grid] Scheduler request completed: got > 0 new tasks > 16-May-2013 16:58:36 [World Community Grid] No tasks sent > 16-May-2013 16:58:36 [World Community Grid] No tasks are available for The > Clean Energy Project - Phase 2 > 16-May-2013 16:58:36 [World Community Grid] No tasks are available for the > applications you have selected. > 16-May-2013 16:58:42 [Einstein@Home] Sending scheduler request: To fetch work. > 16-May-2013 16:58:42 [Einstein@Home] Reporting 4 completed tasks > 16-May-2013 16:58:42 [Einstein@Home] Requesting new tasks for CPU > 16-May-2013 16:58:46 [Einstein@Home] Scheduler request completed: got 1 new > tasks > 16-May-2013 17:15:53 [Einstein@Home] Sending scheduler request: To fetch work. > 16-May-2013 17:15:53 [Einstein@Home] Requesting new tasks for CPU > 16-May-2013 17:15:56 [Einstein@Home] Scheduler request completed: got 1 new > tasks > 16-May-2013 17:30:11 [World Community Grid] Can't get task disk usage: > opendir() failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:30:11 [Einstein@Home] Can't get task disk usage: opendir() > failed > 16-May-2013 17:32:31 [Einstein@Home] read_stderr_file(): malloc() failed > 16-May-2013 17:32:31 [Einstein@Home] Computation for task > LATeah0024U_80.0_500_-4.66e-10_1 finished > 16-May-2013 17:32:31 [Einstein@Home] md5_file failed for > projects/einstein.phys.uwm.edu/einstein_S6BucketLVE_1.04_i686-pc-linux-gnu__SSE2: > fopen() failed > 16-May-2013 17:32:31 [---] Can't open client_state_next.xml: fopen() failed > 16-May-2013 17:32:31 [---] Couldn't write state file: fopen() failed; giving > up _______________________________________________ boinc_dev mailing list [email protected] http://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev To unsubscribe, visit the above URL and (near bottom of page) enter your email address.
