What happens to the metrics when an NFS mount is in hung state? I believe 
that if a NFS is in hung state, the node_exporter will stop scraping the 
metrics until the mount become normal again, because if node_exporter keeps 
trying to scrape metrics from a hung mount, it will eventually lead to 
node_exporter going down itself? I am right here? 
And if I am, then can we safely say that whenever node_exporter stop giving 
metrics for a NFS mount, it is necessarily because the mount is in hung 
state?

On Tuesday, March 3, 2020 at 10:23:10 PM UTC+5:30, Matthias Rampke wrote:
>
> The trouble is that the only sure way to know if MFS hangs is to try and 
> use it. For one, the node exporter is not a health prober per se, bur more 
> importantly, when NFS does hang, this leaves a thread permanently stuck in 
> an uninterruptible syscall. Very soon the exporter would run out of threads 
> and stop working altogether. For the same reason, do not put textfile 
> metrics files on NFS.
>
> I believe the only way to do this safely is a separate check script/loop. 
> Don't start the script with cron, you need to make sure that once it is 
> stuck it stops trying until it is unstuck again, or you will fill up your 
> process table with stuck processes that cannot be killed.
>
> /MR
>
>
> On Tue, 3 Mar 2020, 16:54 Yagyansh S. Kumar, <[email protected] 
> <javascript:>> wrote:
>
>> Hello experts.
>> I want to check if the NFS is hanged(i.e whether it is accessible from 
>> the server or not, and if yes then what is the response time it is 
>> getting). I have already enabled the nfs and nfsd collectors, but haven't 
>> found any that can accurately tell me every time the NFS hangs. Any help 
>> would be appreciated. 
>> Thanks in advance.
>>
>> -- 
>> You received this message because you are subscribed to the Google Groups 
>> "Prometheus Developers" group.
>> To unsubscribe from this group and stop receiving emails from it, send an 
>> email to [email protected] <javascript:>
>> .
>> To view this discussion on the web visit 
>> https://groups.google.com/d/msgid/prometheus-developers/1b166f35-fdb0-44fe-9fda-294fcde864f5%40googlegroups.com
>>  
>> <https://groups.google.com/d/msgid/prometheus-developers/1b166f35-fdb0-44fe-9fda-294fcde864f5%40googlegroups.com?utm_medium=email&utm_source=footer>
>> .
>>
>

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Developers" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-developers/545809e6-689a-4d6f-ac9e-20e93f5ba7e8%40googlegroups.com.

Reply via email to