On 10/27/17 6:43 PM, Frank Filz wrote:
Ganesha startup and shutdown seems to be wandering between fast, slow, and
in the case of shutdown, sometimes not cleanly...
I'm glad to see the startup time has recently improved, it was annoying for
a while.
Think that was mostly Malahal?
Also, my 2.6 UDP code uses separate send and receive buffers. That
might have helped a little during startup (and anybody running NFSv3
UDP anytime). The old original TIRPC library code uses 1 common buffer
for all UDP calls and replies. Was blocked waiting a lot of the time.
Shutdown had been horrible, but it would eventually shutdown, then it was
improved. And now it's horrible again.
Since the RDMA v3 work, the svc_work_pool took 240 seconds to shutdown
(an old constant from the former thrd_pool code). But it didn't run
unless you were testing RDMA.
In 2.6, added that pool for every transport, and decreased the timeout
to 31 seconds. Recently added a quicker 1 second shutdown wait loop
to signal the pool, and that usually runs twice. So 2 seconds.
If anyone has any thoughts on how to stabilize these so they don't
constantly have ups and downs that would be really nice...
What do your logs say?
Mine weren't showing anything bad:
25/10/2017 13:59:47 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[Admin]
do_shutdown :MAIN :EVENT :NFS EXIT: stopping NFS service
...
25/10/2017 13:59:50 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[Admin]
do_shutdown :THREAD :EVENT :Worker threads successfully shut down.
...
25/10/2017 13:59:50 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[Admin] rpc :TIRPC
:DEBUG :work_pool_shutdown() "svc_" 7
...
25/10/2017 13:59:51 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[Admin] rpc :TIRPC
:DEBUG :work_pool_shutdown() "svc_" 1
...
25/10/2017 13:59:52 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[Admin]
do_shutdown :MAIN :EVENT :Destroying the FSAL system.
...
25/10/2017 13:59:52 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[Admin]
do_shutdown :MAIN :EVENT :FSAL system destroyed.
25/10/2017 13:59:52 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[main]
nfs_start :MAIN :EVENT :NFS EXIT: regular exit
25/10/2017 13:59:52 : epoch 59f0d10c : simpson91 : ganesha.nfsd-28060[main]
fs_clean_old_recov_dir_impl :CLIENT ID :EVENT :Failed to open old v4 recovery
dir (/home/bill/rdma/install/var/lib/nfs/ganesha/v4old/node0), errno=2
3 seconds to unreg and shutdown all the fridge threads.
2 seconds for ntirpc worker threads.
5 seconds total?
------------------------------------------------------------------------------
Check out the vibrant tech community on one of the world's most
engaging tech sites, Slashdot.org! http://sdm.link/slashdot
_______________________________________________
Nfs-ganesha-devel mailing list
Nfs-ganesha-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/nfs-ganesha-devel