On 27 September 2017 at 22:49, Juha Sointusalo <juha.sointus...@gmail.com>
wrote:

> I installed 7.8.2 a few minutes ago to make sure I can see the bug with
> that version. It will take about one hour to get results.
>

Yep, it's the bug in 7.8.2.

A Rosetta task exits and client tries to clean the slot directory:

27-Sep-2017 22:50:15 [---] [slot] cleaning out slots/5: handle_exited_app()
27-Sep-2017 22:50:15 [---] [slot] removed file
slots/5/boinc_checkpoint_count.txt
...
27-Sep-2017 22:50:17 [---] [slot] failed to remove file
slots/5/stderrgfx.txt: Error 32
27-Sep-2017 22:50:17 [---] [slot] removed file slots/5/stdout.txt
27-Sep-2017 22:50:17 [---] [slot] removed file slots/5/t000_.200.3mers.gz
27-Sep-2017 22:50:17 [---] [slot] removed file slots/5/t000_.200.9mers.gz
27-Sep-2017 22:50:17 [---] [slot] removed file slots/5/t000_.fasta
27-Sep-2017 22:50:17 [Rosetta@home] Computation for task
rb_09_27_71535_120475__t000__ab_robetta_IGNORE_THE_REST_516294_956_0
finished

Starting a new task in slot 5 even though it wasn't clean:

27-Sep-2017 22:50:17 [Rosetta@home] [slot] assigning slot 5 to
tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_abinitio_SAVE_ALL_OUT_515776_399_0
27-Sep-2017 22:50:19 [---] [slot] removed file slots/5/init_data.xml
27-Sep-2017 22:50:19 [Rosetta@home] setup_file:
projects/boinc.bakerlab.org_rosetta/minirosetta_3.73_windows_x86_64.exe
(input)
...
27-Sep-2017 22:50:19 [Rosetta@home] [cpu_sched] Restarting task
tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_abinitio_SAVE_ALL_OUT_515776_399_0
using minirosetta version 373 in slot 5
27-Sep-2017 22:50:20 [Rosetta@home] [sched_op] Reason: Unrecoverable error
for task
tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_abinitio_SAVE_ALL_OUT_515776_399_0
27-Sep-2017 22:50:20 [---] [slot] cleaning out slots/5: handle_exited_app()
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/boinc_lockfile
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/default.out.gz
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/graphics_app
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/Helvetica.txf
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/init_data.xml
27-Sep-2017 22:50:20 [---] [slot] removed file
slots/5/minirosetta_3.73_windows_x86_64.exe
27-Sep-2017 22:50:20 [---] [slot] removed file
slots/5/minirosetta_database.zip
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/stderr.txt
27-Sep-2017 22:50:20 [---] [slot] failed to remove file
slots/5/stderrgfx.txt: Error 32
27-Sep-2017 22:50:20 [---] [slot] removed file slots/5/stdout.txt
27-Sep-2017 22:50:20 [---] [slot] removed file
slots/5/tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_fold_data.zip
27-Sep-2017 22:50:20 [Rosetta@home] Computation for task
tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_abinitio_SAVE_ALL_OUT_515776_399_0
finished
27-Sep-2017 22:50:20 [Rosetta@home] Output file
tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_abinitio_SAVE_ALL_OUT_515776_399_0_r2138012187_0
for task
tj_9_24_juncA_X_1na0A_DHR54_l2_t3_t3_fragments_abinitio_SAVE_ALL_OUT_515776_399_0
absent

Starting one more task, still using slot 5:

27-Sep-2017 22:50:20 [Rosetta@home] [slot] assigning slot 5 to
ProBinder_S_15_fragments_fold_SAVE_ALL_OUT_516202_914_0
27-Sep-2017 22:50:22 [---] [slot] removed file slots/5/init_data.xml
27-Sep-2017 22:50:22 [Rosetta@home] setup_file:
projects/boinc.bakerlab.org_rosetta/minirosetta_3.73_windows_intelx86.exe
(input)
...
27-Sep-2017 22:50:22 [Rosetta@home] [cpu_sched] Restarting task
ProBinder_S_15_fragments_fold_SAVE_ALL_OUT_516202_914_0 using minirosetta
version 373 in slot 5
27-Sep-2017 22:50:23 [Rosetta@home] [sched_op] Deferring communication for
00:03:49
27-Sep-2017 22:50:23 [Rosetta@home] [sched_op] Reason: Unrecoverable error
for task ProBinder_S_15_fragments_fold_SAVE_ALL_OUT_516202_914_0
27-Sep-2017 22:50:23 [---] [slot] cleaning out slots/5: handle_exited_app()
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/boinc_lockfile
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/default.out.gz
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/graphics_app
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/Helvetica.txf
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/init_data.xml
27-Sep-2017 22:50:23 [---] [slot] removed file
slots/5/minirosetta_3.73_windows_intelx86.exe
27-Sep-2017 22:50:23 [---] [slot] removed file
slots/5/minirosetta_database.zip
27-Sep-2017 22:50:23 [---] [slot] removed file
slots/5/ProBinder_S_15_fragments_data.zip
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/stderr.txt
27-Sep-2017 22:50:23 [---] [slot] failed to remove file
slots/5/stderrgfx.txt: Error 32
27-Sep-2017 22:50:23 [---] [slot] removed file slots/5/stdout.txt
27-Sep-2017 22:50:23 [Rosetta@home] Computation for task
ProBinder_S_15_fragments_fold_SAVE_ALL_OUT_516202_914_0 finished
27-Sep-2017 22:50:23 [Rosetta@home] Output file
ProBinder_S_15_fragments_fold_SAVE_ALL_OUT_516202_914_0_r907963939_0 for
task ProBinder_S_15_fragments_fold_SAVE_ALL_OUT_516202_914_0 absent

Fortunately that's all I had cached.

In working client this is how it looks like:

A Rosetta task exits and client tries to clean the slot:

27-Sep-2017 20:53:01 [---] [slot] cleaning out slots/5: handle_exited_app()
27-Sep-2017 20:53:01 [---] [slot] removed file slots/5/00001.200.3mers
27-Sep-2017 20:53:01 [---] [slot] removed file slots/5/00001.200.9mers
27-Sep-2017 20:53:01 [---] [slot] removed file slots/5/00001.pdb
27-Sep-2017 20:53:01 [---] [slot] removed file
slots/5/boinc_checkpoint_count.txt
27-Sep-2017 20:53:01 [---] [slot] removed file slots/5/boinc_finish_called
...
27-Sep-2017 20:53:03 [---] [slot] removed file slots/5/stderr.txt
27-Sep-2017 20:53:03 [---] [slot] failed to remove file
slots/5/stderrgfx.txt: Error 32
27-Sep-2017 20:53:03 [---] [slot] removed file slots/5/stdout.txt
27-Sep-2017 20:53:03 [Rosetta@home] Computation for task
ProBinder_S_526_fragments_fold_SAVE_ALL_OUT_516241_807_0 finished

The client goes on to start a new task. Slot 5 is the first unused:

27-Sep-2017 20:53:03 [---] [slot] cleaning out slots/5: get_free_slot()
27-Sep-2017 20:53:03 [---] [slot] failed to remove file
slots/5/stderrgfx.txt: Error 32
27-Sep-2017 20:53:03 [Rosetta@home] [slot] failed to clean out dir:
unlink() failed

The client couldn't clean slot 5 and tries the next free slot which in this
case is slot 7:

27-Sep-2017 20:53:03 [---] [slot] cleaning out slots/7: get_free_slot()
27-Sep-2017 20:53:03 [Rosetta@home] [slot] assigning slot 7 to
rb_09_27_71504_120474__t000__ab_robetta_IGNORE_THE_REST_516293_2819_1
27-Sep-2017 20:53:04 [---] [slot] removed file slots/7/init_data.xml
27-Sep-2017 20:53:04 [Rosetta@home] setup_file:
projects/boinc.bakerlab.org_rosetta/minirosetta_3.73_windows_x86_64.exe
(input)
...
27-Sep-2017 20:53:04 [Rosetta@home] Starting task
rb_09_27_71504_120474__t000__ab_robetta_IGNORE_THE_REST_516293_2819_1
27-Sep-2017 20:53:04 [Rosetta@home] [cpu_sched] Starting task
rb_09_27_71504_120474__t000__ab_robetta_IGNORE_THE_REST_516293_2819_1 using
minirosetta version 373 in slot 7

And everything works right.

-Juha
_______________________________________________
boinc_dev mailing list
boinc_dev@ssl.berkeley.edu
https://lists.ssl.berkeley.edu/mailman/listinfo/boinc_dev
To unsubscribe, visit the above URL and
(near bottom of page) enter your email address.

Reply via email to