Single or multiple acitve mds? Were there "Client xxx failing to
respond to capability release" health warning?

On Mon, May 28, 2018 at 10:38 PM, Oliver Freyermuth
<[email protected]> wrote:
> Dear Cephalopodians,
>
> we just had a "lockup" of many MDS requests, and also trimming fell behind, 
> for over 2 days.
> One of the clients (all ceph-fuse 12.2.5 on CentOS 7.5) was in status 
> "currently failed to authpin local pins". Metadata pool usage did grow by 10 
> GB in those 2 days.
>
> Rebooting the node to force a client eviction solved the issue, and now 
> metadata usage is down again, and all stuck requests were processed quickly.
>
> Is there any idea on what could cause something like that? On the client, der 
> was no CPU load, but many processes waiting for cephfs to respond.
> Syslog did yield anything. It only affected one user and his user directory.
>
> If there are no ideas: How can I collect good debug information in case this 
> happens again?
>
> Cheers,
>         Oliver
>
>
> _______________________________________________
> ceph-users mailing list
> [email protected]
> http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com
>
_______________________________________________
ceph-users mailing list
[email protected]
http://lists.ceph.com/listinfo.cgi/ceph-users-ceph.com

Reply via email to