> On 1 Jun 2020, at 11:43, Crocker, Deborah <[email protected]> wrote: > > Is this sufficient? Again, this server has a light load and we don't think > we saw the problem, although I do note that the CPU usage seems pretty high > for such a light load.
All threads are idle except thread 1 that is checking if there are new connections. I don't see anything obviously wrong here ... :( > > > Thread 26 (Thread 0x7f0600d32700 (LWP 11330)): > #0 0x00007f06420a09a3 in select () > at ../sysdeps/unix/syscall-template.S:81 > #1 0x00007f06452e0649 in DS_Sleep () > at /usr/lib64/dirsrv/libslapd.so.0 > #2 0x00007f063a136bf7 in deadlock_threadmain () > at /usr/lib64/dirsrv/plugins/libback-ldbm.so > #3 0x00007f064305dc5b in _pt_root (arg=0x557b3b2f5b00) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #4 0x00007f06429fdea5 in start_thread (arg=0x7f0600d32700) > at pthread_create.c:307 > #5 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 25 (Thread 0x7f0600531700 (LWP 11331)): > #0 0x00007f06420a09a3 in select () > at ../sysdeps/unix/syscall-template.S:81 > #1 0x00007f06452e0649 in DS_Sleep () > at /usr/lib64/dirsrv/libslapd.so.0 > #2 0x00007f063a13a7c7 in checkpoint_threadmain () > at /usr/lib64/dirsrv/plugins/libback-ldbm.so > #3 0x00007f064305dc5b in _pt_root (arg=0x557b3b2f59e0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #4 0x00007f06429fdea5 in start_thread (arg=0x7f0600531700) > at pthread_create.c:307 > #5 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 24 (Thread 0x7f05ffd30700 (LWP 11332)): > #0 0x00007f06420a09a3 in select () > at ../sysdeps/unix/syscall-template.S:81 > #1 0x00007f06452e0649 in DS_Sleep () > at /usr/lib64/dirsrv/libslapd.so.0 > #2 0x00007f063a136e47 in trickle_threadmain () > at /usr/lib64/dirsrv/plugins/libback-ldbm.so > #3 0x00007f064305dc5b in _pt_root (arg=0x557b3b2f5c20) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #4 0x00007f06429fdea5 in start_thread (arg=0x7f05ffd30700) > at pthread_create.c:307 > #5 0x00007f06420a98dd in clone () > ---Type <return> to continue, or q <return> to quit--- > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 23 (Thread 0x7f05ff52f700 (LWP 11333)): > #0 0x00007f06420a09a3 in select () > at ../sysdeps/unix/syscall-template.S:81 > #1 0x00007f06452e0649 in DS_Sleep () > at /usr/lib64/dirsrv/libslapd.so.0 > #2 0x00007f063a1319f7 in perf_threadmain () > at /usr/lib64/dirsrv/plugins/libback-ldbm.so > #3 0x00007f064305dc5b in _pt_root (arg=0x557b3b2f5440) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #4 0x00007f06429fdea5 in start_thread (arg=0x7f05ff52f700) > at pthread_create.c:307 > #5 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 22 (Thread 0x7f05fed2e700 (LWP 11334)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55e900, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x00007f06452ccf58 in slapi_wait_condvar () > at /usr/lib64/dirsrv/libslapd.so.0 > #3 0x00007f063abe515e in cos_cache_wait_on_change () > at /usr/lib64/dirsrv/plugins/libcos-plugin.so > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b680) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fed2e700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 21 (Thread 0x7f05fe52d700 (LWP 11335)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55ebc0, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x00007f06452ccf58 in slapi_wait_condvar () > at /usr/lib64/dirsrv/libslapd.so.0 > #3 0x00007f06382041fd in roles_cache_wait_on_change () > at /usr/lib64/dirsrv/plugins/libroles-plugin.so > ---Type <return> to continue, or q <return> to quit--- > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b440) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fe52d700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 20 (Thread 0x7f05fdd2c700 (LWP 11336)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b404d3c40, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x00007f06452ccf58 in slapi_wait_condvar () > at /usr/lib64/dirsrv/libslapd.so.0 > #3 0x00007f06382041fd in roles_cache_wait_on_change () > at /usr/lib64/dirsrv/plugins/libroles-plugin.so > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b320) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fdd2c700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 19 (Thread 0x7f05fd52b700 (LWP 11337)): > #0 0x00007f0642a01de2 in pthread_cond_timedwait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_timedwait.S:238 > #1 0x00007f0643057d07 in pt_TimedWait (cv=cv@entry=0x557b3b55ec08, > ml=0x557b4055c160, timeout=timeout@entry=30000) > at ../../../nspr/pr/src/pthreads/ptsynch.c:258 > #2 0x00007f06430581ee in PR_WaitCondVar (cvar=0x557b3b55ec00, timeout=30000) > at ../../../nspr/pr/src/pthreads/ptsynch.c:387 > #3 0x0000557b3a0be208 in housecleaning () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b0e0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fd52b700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 18 (Thread 0x7f05fcd2a700 (LWP 11338)): > ---Type <return> to continue, or q <return> to quit--- > #0 0x00007f0642a01de2 in pthread_cond_timedwait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_timedwait.S:238 > #1 0x00007f0643057d07 in pt_TimedWait (cv=cv@entry=0x557b3b61fa08, > ml=0x557b3b6709a0, timeout=timeout@entry=10000) > at ../../../nspr/pr/src/pthreads/ptsynch.c:258 > #2 0x00007f06430581ee in PR_WaitCondVar (cvar=0x557b3b61fa00, timeout=10000) > at ../../../nspr/pr/src/pthreads/ptsynch.c:387 > #3 0x00007f064526ef23 in eq_loop () > at /usr/lib64/dirsrv/libslapd.so.0 > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b200) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fcd2a700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 17 (Thread 0x7f05fc529700 (LWP 11339)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040ad80) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fc529700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 16 (Thread 0x7f05fbd28700 (LWP 11340)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b7a0) > ---Type <return> to continue, or q <return> to quit--- > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fbd28700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 15 (Thread 0x7f05fb527700 (LWP 11341)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b8c0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fb527700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 14 (Thread 0x7f05fad26700 (LWP 11342)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040be60) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fad26700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 13 (Thread 0x7f05fa525700 (LWP 11343)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > ---Type <return> to continue, or q <return> to quit--- > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040bd40) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05fa525700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 12 (Thread 0x7f05f9d24700 (LWP 11344)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040bc20) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f9d24700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 11 (Thread 0x7f05f9523700 (LWP 11345)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040bb00) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f9523700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 10 (Thread 0x7f05f8d22700 (LWP 11346)): > ---Type <return> to continue, or q <return> to quit--- > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040b9e0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f8d22700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 9 (Thread 0x7f05f8521700 (LWP 11347)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a120) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f8521700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 8 (Thread 0x7f05f7d20700 (LWP 11348)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a240) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f7d20700) > at pthread_create.c:307 > ---Type <return> to continue, or q <return> to quit--- > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 7 (Thread 0x7f05f751f700 (LWP 11349)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a360) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f751f700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 6 (Thread 0x7f05f6d1e700 (LWP 11350)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a5a0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f6d1e700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 5 (Thread 0x7f05f651d700 (LWP 11351)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > ---Type <return> to continue, or q <return> to quit--- > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a480) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f651d700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 4 (Thread 0x7f05f5d1c700 (LWP 11352)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a7e0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f5d1c700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 3 (Thread 0x7f05f551b700 (LWP 11353)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a900) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f551b700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 2 (Thread 0x7f05f4d1a700 (LWP 11354)): > #0 0x00007f0642a01a35 in pthread_cond_wait@@GLIBC_2.3.2 () > at ../nptl/sysdeps/unix/sysv/linux/x86_64/pthread_cond_wait.S:185 > #1 0x00007f0643058270 in PR_WaitCondVar (cvar=0x557b3b55fd80, > timeout=---Type <return> to continue, or q <return> to quit--- > 4294967295) at ../../../nspr/pr/src/pthreads/ptsynch.c:385 > #2 0x0000557b3a0b32fe in [IDLE THREAD] connection_wait_for_new_work () > #3 0x0000557b3a0b4941 in connection_threadmain () > #4 0x00007f064305dc5b in _pt_root (arg=0x557b4040a6c0) > at ../../../nspr/pr/src/pthreads/ptthread.c:201 > #5 0x00007f06429fdea5 in start_thread (arg=0x7f05f4d1a700) > at pthread_create.c:307 > #6 0x00007f06420a98dd in clone () > at ../sysdeps/unix/sysv/linux/x86_64/clone.S:111 > > Thread 1 (Thread 0x7f064594c940 (LWP 11328)): > #0 0x00007f064209ec3d in poll () > at ../sysdeps/unix/syscall-template.S:81 > #1 0x00007f0643059ba7 in poll (__timeout=250, __nfds=123, > __fds=0x557b404ae400) at /usr/include/bits/poll2.h:46 > #2 0x00007f0643059ba7 in _pr_poll_with_poll (pds=0x557b40bb8000, npds=123, > timeout=<optimized out>) > at ../../../nspr/pr/src/pthreads/ptio.c:4023 > #3 0x0000557b3a0ba149 in slapd_daemon () > #4 0x0000557b3a0abb15 in main () > (gdb) > (gdb) > > Deborah Crocker, PhD > Systems Engineer III > Office of Information Technology > The University of Alabama > Box 870346 > Tuscaloosa, AL 36587 > Office 205-348-3758 | Fax 205-348-9393 > [email protected] > > > -----Original Message----- > From: William Brown <[email protected]> > Sent: Sunday, May 31, 2020 8:00 PM > To: [email protected] > Subject: [EXTERNAL] [389-users] Re: Re: new server setup hanging > > > >> On 1 Jun 2020, at 10:54, Crocker, Deborah <[email protected]> wrote: >> >> We had to roll it back. There is one host running with it but the load is so >> light we never saw a problem. We think it was a known bug, maybe this: >> >> https://pagure.io/389-ds-base/issue/50329 > > That issue is fixed in 1.3.9, you are running 1.3.10, so that seems unlikely? > >> >> Do you want any info off the running host? > > As before, I'd need to see a gdb -p "pid" and `thread apply all bt` thanks. > When you gdb -p <pid>, it will tell you a command of what debug info packages > you need to install and how. You should install those before you run the gdb > commands. It will cause the server to "pause" when you attach gdb btw. > >> >> We'll now probably move into the 1.4.x trees. Any advice on which is the >> most stable? > > We try to make sure they are all stable - if you are using Red Hat/CentOS or > SLE/Suse Leap, then whatever 389-ds version are in those platforms is the > "best maintained" for that platform, and we'll resolve issues in them etc. I > think that's 1.4.2.x or 1.4.3.x from the top of my head at the moment for > RHEL 8.x and SLE 15.x. > >> >> -----Original Message----- >> From: William Brown <[email protected]> >> Sent: Sunday, May 31, 2020 6:25 PM >> To: [email protected] >> Subject: [EXTERNAL] [389-users] Re: new server setup hanging >> >> Hey there, >> >> >> We need to see the pstacks from all threads to really determine the cause >> here. Can you send us a complete read out? >> >> gdb -p "pid" >> >> thread apply all bt >> >> >> It'd be great if you can install debug info too to help. >> >> Thanks, >> >> >>> On 31 May 2020, at 05:09, Crocker, Deborah <[email protected]> wrote: >>> >>> Some more information from a coworker: >>> >>> Yeah, this sounds like an LDAP server bug. I haven't figured out what to >>> look at to pin it down, but when it's slow to connect, I can see with >>> strace that the primary thread hasn't called accept() yet for the >>> connection I'm trying to open. Once it does, the whole thing goes very >>> quickly, and I usually see a burst of other connections accepted and >>> handled at the same time. >>> >>> Deborah Crocker, PhD >>> Systems Engineer III >>> Office of Information Technology >>> The University of Alabama >>> Box 870346 >>> Tuscaloosa, AL 36587 >>> Office 205-348-3758 | Fax 205-348-9393 [email protected] >>> >>> >>> -----Original Message----- >>> From: Crocker, Deborah <[email protected]> >>> Sent: Saturday, May 30, 2020 2:08 PM >>> To: General discussion list for the 389 Directory server project. >>> <[email protected]> >>> Subject: [EXTERNAL] [389-users] new server setup hanging >>> >>> We're trying to move into our new server setup. We have one that seems to >>> be fine under a load but when we bring the next we're having trouble with >>> it hanging. The second does have more clients (and different) so there >>> could be something about what a client is doing. Here is the server: >>> 389-Directory/1.3.10.1 B2020.133.1625 Installed from EPEL, running on >>> CentOS Linux release 7.8.2003 >>> >>> And here is the pstack output listing the only thread that is not idle. Can >>> anyone tell me what is going on? >>> >>> Thread 44 (Thread 0x7f858e9b3700 (LWP 2515)): >>> #0 0x00007f860a90fe02 in slapi_atomic_load_32 () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #1 0x00007f860a8d4e8e in slapi_get_mapping_tree_node_by_dn () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #2 0x00007f860a8d5179 in slapi_be_select () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #3 0x00007f860a9296a0 in vattr_test_filter () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #4 0x00007f860a8b6ec4 in slapi_vattr_filter_test_ext_internal () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #5 0x00007f860a8b7ba6 in slapi_vattr_filter_test_ext () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #6 0x00007f8600a99e02 in acl__resource_match_aci () at >>> /usr/lib64/dirsrv/plugins/libacl-plugin.so >>> #7 0x00007f8600a9b280 in acl_access_allowed () at >>> /usr/lib64/dirsrv/plugins/libacl-plugin.so >>> #8 0x00007f8600aae9f7 in acl_access_allowed_main () at >>> /usr/lib64/dirsrv/plugins/libacl-plugin.so >>> #9 0x00007f860a8f0cbc in plugin_call_acl_plugin () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #10 0x00007f860a8b638d in test_filter_access () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #11 0x00007f860a8b6fb5 in slapi_vattr_filter_test_ext_internal () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #12 0x00007f860a8b6d31 in slapi_vattr_filter_test_ext_internal () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #13 0x00007f860a8b7ba6 in slapi_vattr_filter_test_ext () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #14 0x00007f85ff7c0df1 in ldbm_back_next_search_entry_ext () at >>> /usr/lib64/dirsrv/plugins/libback-ldbm.so >>> #15 0x00007f860a8deca6 in send_results_ext.constprop.5 () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #16 0x00007f860a8e0e09 in op_shared_search () at >>> /usr/lib64/dirsrv/libslapd.so.0 >>> #17 0x0000557410dd3c0e in do_search () >>> #18 0x0000557410dc198a in connection_threadmain () >>> #19 0x00007f86086a0c5b in _pt_root () at /lib64/libnspr4.so >>> #20 0x00007f8608040ea5 in start_thread () at /lib64/libpthread.so.0 >>> #21 0x00007f86076ec8dd in clone () at /lib64/libc.so.6 >>> >>> Deborah Crocker, PhD >>> Systems Engineer III >>> Office of Information Technology >>> The University of Alabama >>> Box 870346 >>> Tuscaloosa, AL 36587 >>> Office 205-348-3758 | Fax 205-348-9393 [email protected] >>> >>> >>> -----Original Message----- >>> From: William Brown <[email protected]> >>> Sent: Wednesday, May 27, 2020 5:43 PM >>> To: [email protected] >>> Subject: [EXTERNAL] [389-users] Re: Re: Re: Advice to bring new >>> servers into production >>> >>> >>> >>>> On 27 May 2020, at 23:20, Crocker, Deborah <[email protected]> wrote: >>>> >>>> Thanks - I think we have enough ideas in here to get this going. One last >>>> question: >>>> If replication is set up through the host name - how often does the >>>> directory server do a DNS look up, or does it do it once on startup (or >>>> creation of the rep agreement)? >>> >>> I "think" it's every time it initiates the new connection - but remember, >>> for replication, that *is* quite different to a client doing a search, so >>> I'd be pretty careful about this. IMO you should be standing up your >>> replacement servers in parallel, joining them all, moving the IP's then >>> decomission the old servers. Alternately, you'll need an outage window to >>> shutdown your old servers, export the ldif, and then import and bring up >>> the new ones. >>> >>> I think having "IP's are a limited resource" really does make this >>> whole process much much harder than it needs to be for you ... :( >>> >>>> >>>> -----Original Message----- >>>> From: William Brown <[email protected]> >>>> Sent: Tuesday, May 26, 2020 10:48 PM >>>> To: [email protected] >>>> Subject: [EXTERNAL] [389-users] Re: Re: Advice to bring new servers >>>> into production >>>> >>>> There are a few options. The best would be a load balancer which has the >>>> ip's so that it's transparent to your LDAP servers where they are. >>>> >>>> But also as mentioned, the virtual IP's honestly is the best way. Linux >>>> can have multiple IP's on an interface so you can just have two IP's on >>>> one interface, andthat's the best way to do this. >>>> >>>> Alternately, don't rely on the IP, lower your DNS ttl's to a very short >>>> time, change the DNS A/AAAA records, and then do it that way. >>>> >>>> >>>> >>>>> On 27 May 2020, at 06:17, Crocker, Deborah <[email protected]> wrote: >>>>> >>>>> I’d like not to take up two ip addresses per host indefinitely. We have >>>>> re-IP’d our hosts before so I know we can to do this but it was during a >>>>> downtime when everything was restarted. Just trying to get away with not >>>>> restarting the masters. >>>>> >>>>> Deborah Crocker, PhD >>>>> Systems Engineer III >>>>> Office of Information Technology >>>>> The University of Alabama >>>>> Box 870346 >>>>> Tuscaloosa, AL 36587 >>>>> Office 205-348-3758 | Fax 205-348-9393 [email protected] >>>>> >>>>> From: Leo Pleiman <[email protected]> >>>>> Sent: Tuesday, May 26, 2020 3:08 PM >>>>> To: General discussion list for the 389 Directory server project. >>>>> <[email protected]> >>>>> Subject: [EXTERNAL] [389-users] Re: Advice to bring new servers >>>>> into production >>>>> >>>>> My experience has been that the replicas and consumers have a unique id, >>>>> more than just an IP address which creates the trust relationship with >>>>> the master. If your goal is to simply maintain an IP so your clients >>>>> don't have to be repointed, I would build each new LDAP host and >>>>> replication agreement, and then as you decommission the old hosts use >>>>> their IP address as a virtual IP address on the replacement host. It >>>>> would take a quick restart od the LDAP service to start a listener on the >>>>> virtual Ip address. >>>>> >>>>> >>>>> Leo Pleiman >>>>> Senior System Engineer >>>>> Direct 202-787-3622 >>>>> Cell 410-688-3873 >>>>> >>>>> >>>>> >>>>> On Tue, May 26, 2020 at 3:57 PM Crocker, Deborah <[email protected]> wrote: >>>>> We have a setup with 2 multi-masters and 3 consumers. We are now building >>>>> new host and want to put them in place ultimately at the same IP address >>>>> as the original ones. I need some advice on how to do this quickly and >>>>> cleanly. >>>>> >>>>> To add a new consumer the idea now is to set it up and set up >>>>> replications agreements from each master using consumer DNS name (don't >>>>> start continuous replication yet). After initializing new consumer from >>>>> one master - turn off old consumer, remove old consumer agreement from >>>>> each master, and re-IP new consumer. Do we need to restart masters to >>>>> re-read DNS or will it pick that up when it starts the next replication? >>>>> Is this the best way to do this? >>>>> >>>>> Thanks >>>>> >>>>> Deborah Crocker, PhD >>>>> Systems Engineer III >>>>> Office of Information Technology >>>>> The University of Alabama >>>>> Box 870346 >>>>> Tuscaloosa, AL 36587 >>>>> Office 205-348-3758 | Fax 205-348-9393 [email protected] >>>>> >>>>> _______________________________________________ >>>>> 389-users mailing list -- [email protected] To >>>>> unsubscribe send an email to >>>>> [email protected] >>>>> Fedora Code of Conduct: >>>>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>>>> List Guidelines: >>>>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>>>> List Archives: >>>>> https://lists.fedoraproject.org/archives/list/[email protected] >>>>> a p r oject.org _______________________________________________ >>>>> 389-users mailing list -- [email protected] To >>>>> unsubscribe send an email to >>>>> [email protected] >>>>> Fedora Code of Conduct: >>>>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>>>> List Guidelines: >>>>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>>>> List Archives: >>>>> https://lists.fedoraproject.org/archives/list/[email protected] >>>>> a >>>>> p >>>>> r >>>>> oject.org >>>> >>>> — >>>> Sincerely, >>>> >>>> William Brown >>>> >>>> Senior Software Engineer, 389 Directory Server SUSE Labs >>>> _______________________________________________ >>>> 389-users mailing list -- [email protected] To >>>> unsubscribe send an email to [email protected] >>>> Fedora Code of Conduct: >>>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>>> List Guidelines: >>>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>>> List Archives: >>>> https://lists.fedoraproject.org/archives/list/[email protected] >>>> p r oject.org _______________________________________________ >>>> 389-users mailing list -- [email protected] To >>>> unsubscribe send an email to [email protected] >>>> Fedora Code of Conduct: >>>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>>> List Guidelines: >>>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>>> List Archives: >>>> https://lists.fedoraproject.org/archives/list/[email protected] >>>> p >>>> r >>>> oject.org >>> >>> — >>> Sincerely, >>> >>> William Brown >>> >>> Senior Software Engineer, 389 Directory Server SUSE Labs >>> _______________________________________________ >>> 389-users mailing list -- [email protected] To >>> unsubscribe send an email to [email protected] >>> Fedora Code of Conduct: >>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>> List Guidelines: >>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>> List Archives: >>> https://lists.fedoraproject.org/archives/list/[email protected] >>> r oject.org _______________________________________________ >>> 389-users mailing list -- [email protected] To >>> unsubscribe send an email to [email protected] >>> Fedora Code of Conduct: >>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>> List Guidelines: >>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>> List Archives: >>> https://lists.fedoraproject.org/archives/list/[email protected] >>> r oject.org _______________________________________________ >>> 389-users mailing list -- [email protected] To >>> unsubscribe send an email to [email protected] >>> Fedora Code of Conduct: >>> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >>> List Guidelines: >>> https://fedoraproject.org/wiki/Mailing_list_guidelines >>> List Archives: >>> https://lists.fedoraproject.org/archives/list/[email protected] >>> r >>> oject.org >> >> — >> Sincerely, >> >> William Brown >> >> Senior Software Engineer, 389 Directory Server SUSE Labs >> _______________________________________________ >> 389-users mailing list -- [email protected] To >> unsubscribe send an email to [email protected] >> Fedora Code of Conduct: >> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >> List Guidelines: >> https://fedoraproject.org/wiki/Mailing_list_guidelines >> List Archives: >> https://lists.fedoraproject.org/archives/list/[email protected] >> oject.org _______________________________________________ >> 389-users mailing list -- [email protected] To >> unsubscribe send an email to [email protected] >> Fedora Code of Conduct: >> https://docs.fedoraproject.org/en-US/project/code-of-conduct/ >> List Guidelines: >> https://fedoraproject.org/wiki/Mailing_list_guidelines >> List Archives: >> https://lists.fedoraproject.org/archives/list/[email protected] >> oject.org > > — > Sincerely, > > William Brown > > Senior Software Engineer, 389 Directory Server SUSE Labs > _______________________________________________ > 389-users mailing list -- [email protected] To unsubscribe > send an email to [email protected] > Fedora Code of Conduct: > https://docs.fedoraproject.org/en-US/project/code-of-conduct/ > List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines > List Archives: > https://lists.fedoraproject.org/archives/list/[email protected] > _______________________________________________ > 389-users mailing list -- [email protected] > To unsubscribe send an email to [email protected] > Fedora Code of Conduct: > https://docs.fedoraproject.org/en-US/project/code-of-conduct/ > List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines > List Archives: > https://lists.fedoraproject.org/archives/list/[email protected] — Sincerely, William Brown Senior Software Engineer, 389 Directory Server SUSE Labs _______________________________________________ 389-users mailing list -- [email protected] To unsubscribe send an email to [email protected] Fedora Code of Conduct: https://docs.fedoraproject.org/en-US/project/code-of-conduct/ List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines List Archives: https://lists.fedoraproject.org/archives/list/[email protected]
