On 03/13/2010 02:55 AM, Joseph Bester wrote:
> Weird. I'll have to try that and see where it goes wrong.

Hi Joseph,

Thanks for looking into it.

>> > I only get:
>> > 
>> > [INFO] Enter globus_l_pbs_increase_buffer
>> > [INFO] Exit globus_l_pbs_increase_buffer
>> > 
>> > and the output stops there.   Any idea what's wrong?
> That seems an odd place for the output to stop. Any chance the
> globus-scheduler-event-generator program is built threaded? I think I
> see a deadlock condition that can occur in it if that's the case.

I did compile both non-threaded and threaded version with:

make gram5-pbs
make gram5-pbs-thr

And when I look at the memory map of the
globus-scheduler-event-generator process, I see it's linked against
gcc64dbgpthr libraries.

And with strace, I see the process is stuck in a futex syscall.

And when I connect to the running process with gdb, I get the following
stack-trace:

> (gdb) where
> #0  0x0000003b1100d2e4 in __lll_lock_wait () from /lib64/libpthread.so.0
> #1  0x0000003b11008c3a in _L_lock_1034 () from /lib64/libpthread.so.0
> #2  0x0000003b11008afc in pthread_mutex_lock () from /lib64/libpthread.so.0
> #3  0x00002b1598c184d3 in globus_mutex_lock (mut=0x2b1596aad920) at 
> globus_thread_pthreads.c:827
> #4  0x00002b15968ac963 in globus_scheduler_event_generator_get_timestamp 
> (timestamp=0x7fffa46e9a08) at globus_scheduler_event_generator.c:382
> #5  0x00002aaaaaaae2e1 in globus_l_pbs_module_activate () at 
> seg_pbs_module.c:265
> #6  0x00002b1598c0e01e in globus_module_activate_proxy 
> (module_descriptor=0x2aaaaacb35e0, deactivate_cb=0x2b1598c1f2f8 
> <globus_l_extension_deactivate_proxy>, 
>     user_arg=0xbd4f8d0) at globus_module.c:215
> #7  0x00002b1598c1fd0a in globus_extension_activate (extension_name=0xbd4ecc0 
> "globus_seg_pbs") at globus_extension.c:630
> #8  0x00002b15968acb3c in globus_scheduler_event_generator_load_module 
> (module_name=0x7fffa46eba72 "pbs") at globus_scheduler_event_generator.c:465
> #9  0x00000000004020e5 in main (argc=5, argv=0x7fffa46e9c48) at main.c:172

Does that give any clue?

Thanks for your help!

Cheers,
Vladimir


-- 
Vladimir Mencl, Ph.D.
E-Research Services and Systems Consultant
BlueFern Supercomputing Services
University of Canterbury
Private Bag 4800
Christchurch 8140
New Zealand

http://www.bluefern.canterbury.ac.nz
mailto:[email protected]
Phone: +64 3 364 3012
Mobile: +64 21 997 352
Fax: +64 3 364 2332

Reply via email to