On 3/14/20 10:01 PM, Yagyansh S. Kumar wrote:
> Also, since you mentioned hanging network filesystem, is there any
> way/logic to find out whether my NFS mount is hanged on a machine or
> not? I have busted my ass on getting this result, must have tried more
> than 50 things but still have nothing in this matter.
> In our setup we use a lot of NFS and some of the mounts are really
> critical. All these shared NFS mounts are taken from a 3rd party vendor
> and due to network lag or IP mismatch or 10 other reasons, the NFS ends
> up being hanged on a machine or two. I need to know whenever this
> happens. Anything that can be done here?

I think I would aim for using the regular node_filesystem_device_error
metric nowadays, which is basically the Statfs sucess status.

In earlier node_exporter times, a hung nfs mount could easily prevent
node_exporter from working reliably, which is why we still have nfs
excluded via --collector.filesystem.ignored-fs-types. However, since
#997 [1] this should have been improved. Therefore, I plan to give this
a go again.

Other than that, there are nfs client metrics, but I'm not sure if you
can derive a hung / not hung result from that.

I was about to link to another thread some weeks ago, but I just noticed
that it was started by you as well [2]. ;)

I think that Ben's suggestion is basically the same. Julien's approach
regarding separation of collector's into different jobs (in the same
mail thread) also sounded interesting.

Have you done some experiments with node_filesystem_device_error?

Kind regards,
Christian


[1] https://github.com/prometheus/node_exporter/pull/997
[2]
https://groups.google.com/d/msgid/prometheus-users/CABbyFmqMKQXYNOfdr7BeFA%3Dx%3D5fY%2Bk4EQ8oprL0Wh-8SNqmvoA%40mail.gmail.com?utm_medium=email&utm_source=footer

-- 
You received this message because you are subscribed to the Google Groups 
"Prometheus Users" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to [email protected].
To view this discussion on the web visit 
https://groups.google.com/d/msgid/prometheus-users/81404bf0-6cc3-f18e-59aa-4d186f1e03ad%40hoffmann-christian.info.

Reply via email to