Hi,
Garrick Staples wrote (2007/07/03 4:51):
07/02 12:54:26 INFO: checkpointing node 'p4-6'
07/02 12:54:26 INFO: checkpointing node 'p4-7'
...
07/02 12:54:26 INFO: checkpointing node 'pd4-13'
07/02 12:54:26 INFO: checkpointing node '5958.jasmine'
07/02 12:54:26 INFO: checkpointing node '5959.jasmine'
...
07/02 12:54:26 INFO: checkpointing node '6044.jasmine'
This looks like an old bug in the pbs client libraries that was fixed
years ago. Maui would issue a pbs_statnode() call, the data read had a
particular timeout, and the data would still be on the wire for the next
call to pbs_statjob().
OK, I see.
You didn't say the version, but I assume an old version of TORQUE.
Update your TORQUE and rebuild Maui after installing the updated TORQUE
(updating Maui is not required for this particular bug).
Hmm, I'm using TORQUE 2.1.7 (not so old, isn't it?).
Anyway, I'll update TORQUE and check this phenomenon.
Thank you very much.
Heiga ZEN (Byung Ha CHUN)
--
------------------------------------------------
Heiga ZEN (in Japanese pronunciation)
Byung Ha CHUN (in Korean pronunciation)
Department of Computer Science and Engineering
Nagoya Institute of Technology
Gokiso-cho, Showa-ku, Nagoya 466-8555 Japan
http://www.sp.nitech.ac.jp/~zen
------------------------------------------------
_______________________________________________
mauiusers mailing list
[email protected]
http://www.supercluster.org/mailman/listinfo/mauiusers