On 09/22/2016 04:02 AM, Frank Filz wrote: >> I have pushed rc7 with Matt's c++ compile changes and one final patch from >> Daniel G. >> >> Please have at it. I'd like to get as many FSAL's verified against rc7 by >> 9:00 AM PDT Thursday. At that time, unless some major fire has erupted, I >> will tag V2.4.0 and push that so Kaleb can get on with his work to include >> V2.4.0. > > Hmm, centos-ci is showing a failure in the Cthon04 lock tests. I ran NFS v3 > lock tests and got a pass.
I ran the cthon04 tests (using FSAL_GLUSTER) on v3 and v4 mounts. They seem to pass. But if I run in a loop, sometimes (very much spurious - hit only once) ganesha process seems to crash. One of the bt seen is (gdb) bt #0 0x00007f570da9fa98 in __GI_raise (sig=sig@entry=6) at ../sysdeps/unix/sysv/linux/raise.c:55 #1 0x00007f570daa169a in __GI_abort () at abort.c:89 #2 0x00007f570dae2e1a in __libc_message (do_abort=do_abort@entry=2, fmt=fmt@entry=0x7f570dbf5a00 "*** Error in `%s': %s: 0x%s ***\n") at ../sysdeps/posix/libc_fatal.c:175 #3 0x00007f570dae91e4 in malloc_printerr (action=<optimized out>, str=0x7f570dbf5a48 "corrupted double-linked list (not small)", ptr=<optimized out>, ar_ptr=<optimized out>) at malloc.c:5000 #4 0x00007f570daebd5a in _int_free (av=0x7f5650000020, p=<optimized out>, have_lock=0) at malloc.c:4008 #5 0x00007f570daeebcc in __GI___libc_free (mem=<optimized out>) at malloc.c:2962 #6 0x000000000044ad9b in gsh_free_size (p=0x7f56500a3290, n=1360) at /home/guest/Documents/workspace/nfs-ganesha/src/include/abstract_mem.h:287 #7 0x00007f570f4000b9 in mem_free (p=0x7f56500a3290, n=1360) at /home/guest/Documents/workspace/nfs-ganesha/src/libntirpc/ntirpc/rpc/types.h:208 #8 0x00007f570f4006c3 in free_rpc_msg (msg=0x7f56500a3290) at /home/guest/Documents/workspace/nfs-ganesha/src/libntirpc/src/svc.c:254 #9 0x00000000004e7ba5 in nfs_dupreq_rele (req=0x7f56500a0ad8, func=0x54a190 <nfs4_func_desc+48>) at /home/guest/Documents/workspace/nfs-ganesha/src/RPCAL/nfs_dupreq.c:1257 #10 0x000000000044a50a in nfs_rpc_execute (reqdata=0x7f56500a0ab0) at /home/guest/Documents/workspace/nfs-ganesha/src/MainNFSD/nfs_worker_thread.c:1405 and another one (I lost the core) , but it was mdcache_lru_get->mdcache_lru_clean -> fsal_close()->close() . In FSAL_GLUSTER()->file_close(), below assert was hit - assert(obj_hdl->type == REGULAR_FILE); The obj_hdl->type was a large number and did not have any of the defined macros value. I tried reproducing, but haven't hit it again (neither of the above crashes). But I would like to check if the above check is valid in FSALs close() routines ? I see this check in vfs_close() as well. But I assume mdcache_lru_clean could be called on an obj_hdl of any type but not restricted to REGULAR_FILE. Could you please confirm? Thanks, Soumya > > If there is something broken here, we really should try to fix ASAP. > > Thanks > > Frank > > > > --- > This email has been checked for viruses by Avast antivirus software. > https://www.avast.com/antivirus > > > ------------------------------------------------------------------------------ > _______________________________________________ > Nfs-ganesha-devel mailing list > Nfs-ganesha-devel@lists.sourceforge.net > https://lists.sourceforge.net/lists/listinfo/nfs-ganesha-devel > ------------------------------------------------------------------------------ _______________________________________________ Nfs-ganesha-devel mailing list Nfs-ganesha-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/nfs-ganesha-devel