On Tue, Feb 15, 2011 at 10:28 AM, Ian Kent <ra...@themaw.net> wrote: > On Mon, 2011-02-14 at 21:28 -0800, Mike Marion wrote: >> On Mon, Feb 14, 2011 at 07:37:01PM -0800, Ian Kent wrote: >> >> > That is kernel revision and autofs revision? >> >> 2.6.16.60-0.59.1 (Sles10 sp3 with an updated, but not bleeding edge, patch). >> autofs 5.0.5 with most of the patches up to a couple months ago. It's hard >> to >> get exacts because it's a PTF from Novell (we really pushed them to upgrade >> to >> 5.0.5) but it should be pretty much equal to the patch they just released for >> sle 11 sp1 that they're recommending as they default going forward. > > That make it hard, as you know. > > But I wouldn't mind spending a bit of time on it, if you can also. > > Let's assume that it's a user space problem for now.
Here are the call traces for all automount processes on the kernel side: PID: 4017 TASK: ffff8104202a9080 CPU: 4 COMMAND: "automount" #0 [ffff81040426fc58] schedule at ffffffff802efc13 #1 [ffff81040426fd30] schedule_timeout at ffffffff802f0748 #2 [ffff81040426fd80] futex_wait_abstime at ffffffff8014b703 #3 [ffff81040426fe60] do_futex at ffffffff8014bb33 #4 [ffff81040426ff20] sys_futex at ffffffff8014c2ed #5 [ffff81040426ff80] system_call at ffffffff8010ae16 PID: 4018 TASK: ffff81041cefd810 CPU: 2 COMMAND: "automount" #0 [ffff810404211d68] schedule at ffffffff802efc13 #1 [ffff810404211e40] schedule_timeout at ffffffff802f07b4 #2 [ffff810404211e90] futex_wait_abstime at ffffffff8014b703 #3 [ffff810404211f70] futex_wait_restart at ffffffff8014b7d2 #4 [ffff810404211f80] system_call at ffffffff8010ae16 PID: 4019 TASK: ffff810420ab8080 CPU: 5 COMMAND: "automount" #0 [ffff810404213c58] schedule at ffffffff802efc13 #1 [ffff810404213d30] schedule_timeout at ffffffff802f0748 #2 [ffff810404213d80] futex_wait_abstime at ffffffff8014b703 #3 [ffff810404213e60] do_futex at ffffffff8014bb33 #4 [ffff810404213f20] sys_futex at ffffffff8014c2ed #5 [ffff810404213f80] system_call at ffffffff8010ae16 PID: 4022 TASK: ffff81042055b7d0 CPU: 5 COMMAND: "automount" #0 [ffff810404c71c58] schedule at ffffffff802efc13 #1 [ffff810404c71d30] schedule_timeout at ffffffff802f0748 #2 [ffff810404c71d80] futex_wait_abstime at ffffffff8014b703 #3 [ffff810404c71e60] do_futex at ffffffff8014bb33 #4 [ffff810404c71f20] sys_futex at ffffffff8014c2ed #5 [ffff810404c71f80] system_call at ffffffff8010ae16 PID: 4025 TASK: ffff81041cefd080 CPU: 2 COMMAND: "automount" #0 [ffff8104001d7da8] schedule at ffffffff802efc13 #1 [ffff8104001d7e80] schedule_timeout at ffffffff802f07b4 #2 [ffff8104001d7ed0] do_sys_poll at ffffffff80199526 #3 [ffff8104001d7f60] sys_poll at ffffffff80199640 #4 [ffff8104001d7f80] system_call at ffffffff8010ae16 PID: 4026 TASK: ffff81041e1bd080 CPU: 7 COMMAND: "automount" #0 [ffff8104001f7c58] schedule at ffffffff802efc13 #1 [ffff8104001f7d30] schedule_timeout at ffffffff802f0748 #2 [ffff8104001f7d80] futex_wait_abstime at ffffffff8014b703 #3 [ffff8104001f7e60] do_futex at ffffffff8014bb33 #4 [ffff8104001f7f20] sys_futex at ffffffff8014c2ed #5 [ffff8104001f7f80] system_call at ffffffff8010ae16 PID: 6851 TASK: ffff81013c9187d0 CPU: 6 COMMAND: "automount" #0 [ffff810304f97c58] schedule at ffffffff802efc13 #1 [ffff810304f97d30] schedule_timeout at ffffffff802f0748 #2 [ffff810304f97d80] futex_wait_abstime at ffffffff8014b703 #3 [ffff810304f97e60] do_futex at ffffffff8014bb33 #4 [ffff810304f97f20] sys_futex at ffffffff8014c2ed #5 [ffff810304f97f80] system_call at ffffffff8010ae16 PID: 7060 TASK: ffff810188ef5080 CPU: 4 COMMAND: "automount" #0 [ffff810319ebdc58] schedule at ffffffff802efc13 #1 [ffff810319ebdd30] schedule_timeout at ffffffff802f0748 #2 [ffff810319ebdd80] futex_wait_abstime at ffffffff8014b703 #3 [ffff810319ebde60] do_futex at ffffffff8014bb33 #4 [ffff810319ebdf20] sys_futex at ffffffff8014c2ed #5 [ffff810319ebdf80] system_call at ffffffff8010ae16 PID: 20702 TASK: ffff810230be5080 CPU: 7 COMMAND: "automount" #0 [ffff81026c80dc18] schedule at ffffffff802efc13 #1 [ffff81026c80dcf0] autofs4_wait at ffffffff8831e5b2 [autofs4] #2 [ffff81026c80deb0] autofs4_expire_multi at ffffffff8831ed06 [autofs4] #3 [ffff81026c80dee0] do_ioctl at ffffffff80198acd #4 [ffff81026c80df00] vfs_ioctl at ffffffff80198d2b #5 [ffff81026c80df40] sys_ioctl at ffffffff80198da5 #6 [ffff81026c80df80] system_call at ffffffff8010ae16 They look regular to me, do you see something suspect? And here are the call traces from the user land daemon: Thread 9 (Thread 4017): #0 0x00002b56e465d6a8 in __lll_mutex_lock_wait () from /lib64/libpthread.so.0 #1 0x00002b56e46599fb in _L_mutex_lock_92 () from /lib64/libpthread.so.0 #2 0x00002b56e4659455 in pthread_mutex_lock () from /lib64/libpthread.so.0 #3 0x00005555555746cd in master_mutex_lock () at master.c:49 #4 0x000055555555d260 in do_hup_signal (master=0x55555568d010, age=1296063258) at automount.c:1276 #5 0x0000555555560bd3 in statemachine (arg=<value optimized out>) at automount.c:1354 #6 main (arg=<value optimized out>) at automount.c:2142 Thread 8 (Thread 20702): #0 0x00002b56e4dd62a7 in brk () from /lib64/libc.so.6 #1 0x0000555555577dfe in expire (logopt=2, cmd=<value optimized out>, fd=21, ioctlfd=21, path=0x55555569ca20 "/usr2", arg=0x41c27ef4) at dev-ioctl-lib.c:657 #2 0x0000555555577ebe in ioctl_expire (logopt=21, ioctlfd=-1, path=0x55555569ca20 "/usr2", when=0) at dev-ioctl-lib.c:701 #3 0x0000555555561e4e in expire_proc_indirect (arg=<value optimized out>) at indirect.c:545 #4 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #5 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #6 0x0000000000000000 in ?? () Thread 7 (Thread 7060): #0 0x00002b56e465ac77 in pthread_rwlock_wrlock () from /lib64/libpthread.so.0 #1 0x00005555555752ea in master_source_writelock (entry=<value optimized out>) at master.c:527 #2 0x0000555555575f8f in master_add_map_source (entry=0x5555556a10b0, type=0x0, format=0x0, age=1296059657, argc=1, argv=<value optimized out>) at master.c:191 #3 0x0000555555579ee3 in master_parse_entry (buffer=<value optimized out>, default_timeout=86400, logging=<value optimized out>, age=1296059657) at master_parse.y:823 #4 0x00002aaaaaab83fe in lookup_read_master (master=<value optimized out>, age=1296059657, context=<value optimized out>) at lookup_ldap.c:1625 #5 0x0000555555569052 in do_read_master (master=0x55555568d010, type=<value optimized out>, age=1296059657) at lookup.c:96 #6 0x000055555556aa3c in lookup_nss_read_master (master=0x55555568d010, age=1296059657) at lookup.c:229 #7 0x0000555555575c28 in master_read_master (master=0x55555568d010, age=1296059657, readall=1) at master.c:831 #8 0x000055555555d844 in do_read_master (arg=<value optimized out>) at automount.c:1259 #9 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #10 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #11 0x0000000000000000 in ?? () Thread 6 (Thread 6851): #0 0x00002b56e465aa3d in pthread_rwlock_rdlock () from /lib64/libpthread.so.0 #1 0x000055555556deb6 in cache_readlock (mc=0x55555568e5b8) at cache.c:60 #2 0x000055555556baff in do_readmap (arg=<value optimized out>) at state.c:479 #3 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #4 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #5 0x0000000000000000 in ?? () Thread 5 (Thread 4026): #0 0x00002b56e465d6a8 in __lll_mutex_lock_wait () from /lib64/libpthread.so.0 #1 0x00002b56e46599fb in _L_mutex_lock_92 () from /lib64/libpthread.so.0 #2 0x00002b56e4659455 in pthread_mutex_lock () from /lib64/libpthread.so.0 #3 0x00005555555746cd in master_mutex_lock () at master.c:49 #4 0x0000555555560ff9 in handle_packet_missing_indirect (ap=0x55555569c940, pkt=0x41823ec0) at indirect.c:808 #5 0x000055555555fa32 in handle_packet (ap=<value optimized out>) at automount.c:1026 #6 handle_mounts (ap=<value optimized out>) at automount.c:1551 #7 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #8 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #9 0x0000000000000000 in ?? () Thread 4 (Thread 4025): #0 0x00002b56e4dd4b26 in fts_read () from /lib64/libc.so.6 #1 0x00007fff37314e80 in ?? () #2 0x0000000000000000 in ?? () Thread 3 (Thread 4022): #0 0x00002b56e465ac77 in pthread_rwlock_wrlock () from /lib64/libpthread.so.0 #1 0x00005555555752ea in master_source_writelock (entry=<value optimized out>) at master.c:527 #2 0x00005555555631af in handle_packet_missing_direct (ap=0x5555556a11c0, pkt=0x40821ec0) at direct.c:1296 #3 0x000055555555fa1d in handle_packet (ap=<value optimized out>) at automount.c:1029 #4 handle_mounts (ap=<value optimized out>) at automount.c:1551 #5 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #6 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #7 0x0000000000000000 in ?? () Thread 2 (Thread 4019): #0 0x00002b56e465b326 in pthread_cond_wait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x000055555556c8b3 in st_queue_handler (arg=<value optimized out>) at state.c:1104 #2 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #3 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #4 0x0000000000000000 in ?? () Thread 1 (Thread 4018): #0 0x00002b56e465b517 in pthread_cond_timedwait@@GLIBC_2.3.2 () from /lib64/libpthread.so.0 #1 0x0000555555573762 in alarm_handler (arg=<value optimized out>) at alarm.c:206 #2 0x00002b56e4657193 in start_thread () from /lib64/libpthread.so.0 #3 0x00002b56e4ddcdfd in sysctl () from /lib64/libc.so.6 #4 0x0000000000000000 in ?? () > Could you post a debug log somewhere and the current sles patched source > somewhere. Unfortunately I don't have the full debug logs yet, so I'll leave this to Mike. > If you could spend a little time trying to compare the > patches in the sles package to the ones in the Rawhide autofs repo that > would be a big help too. The package includes all upstream patches released so far (ie, up to autofs-5.0.5-fix-submount-shutdown-wait.patch), minus the following ones: autofs-5.0.4 - use intr option as hosts mount default (configuration change) autofs-5.0.5 - add dump maps option (recent feature) autofs-5.0.5 - add external bind method (recent feature) autofs-5.0.5 - fix ext4 fsck at mount (no ext4 on SLES 10) Thanks, Leonardo _______________________________________________ autofs mailing list autofs@linux.kernel.org http://linux.kernel.org/mailman/listinfo/autofs