After leaving the systems alone overnight, the graphs appeared this morning.

-- 
Jeff White
HPC Systems Engineer
Information Technology Services - WSU

On 04/19/2016 04:45 PM, Jesse Becker wrote:
> Do you know if the metrics are actually being collected?
>
> An easy way to test is to use netcat or telnet to connect to the
> compute node with the nVidia card:
>
>    nc node123 8649
>
> That should dump a bunch of XML, and you can search that for the
> metrics generated by the plugin.  If you find them there, check the
> gmetad process similarly (note different port!):
>    nc headnode 8651
>
> Also check the web FE for simple per-metric charts (the boring grey
> ones...) it's possible that the metrics are collected and sent, but
> not rendering properly for some reason.
>
>
>
> On Tue, Apr 19, 2016 at 6:15 PM, Jeff White <jeff.wh...@wsu.edu> wrote:
>> I'm trying to get this nvidia module working on CentOS 7:
>>
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__github.com_ganglia_gmond-5Fpython-5Fmodules_tree_master_gpu_nvidia&d=CwIBaQ&c=C3yme8gMkxg_ihJNXS06ZyWk4EJm8LdrrvxQb-Je7sw&r=DhM5WMgdrH-xWhI5BzkRTzoTvz8C-BRZ05t9kW9SXZk&m=3yzyW3jAhW38GAREYQCNiFYK4z2ASBmPNHizXFqotvE&s=vAEKJU7EyeCjD9hcjNtyTVBPC70kN7CIlylja7mTVAY&e=
>>
>> I did the following:
>>
>> * Installed CUDA on node but NOT on the Ganglia server
>>
>> * Installed nvidia-ml-py-7.352.0.tar.gz on both node and server
>>
>> * put nvidia.py into /usr/lib64/ganglia/python_modules/ on both node and
>> server
>>
>> * put nvidia.pyconf in /etc/ganglia/conf.d/ on the node only (and
>> verified gmond.conf includes that directory)
>>
>> * put related .php files in /usr/share/ganglia/ on server only
>>
>> * restarted everything everywhere
>>
>> What did I do wrong?  Logs are not saying anything useful.  The graphs
>> just don't show up.  No error, nothing, just doesn't work.
>>
>> --
>> Jeff White
>> HPC Systems Engineer
>> Information Technology Services - WSU
>>
>>
>> ------------------------------------------------------------------------------
>> Find and fix application performance issues faster with Applications Manager
>> Applications Manager provides deep performance insights into multiple tiers 
>> of
>> your business applications. It resolves application problems quickly and
>> reduces your MTTR. Get your free trial!
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__ad.doubleclick.net_ddm_clk_302982198-3B130105516-3Bz&d=CwIBaQ&c=C3yme8gMkxg_ihJNXS06ZyWk4EJm8LdrrvxQb-Je7sw&r=DhM5WMgdrH-xWhI5BzkRTzoTvz8C-BRZ05t9kW9SXZk&m=3yzyW3jAhW38GAREYQCNiFYK4z2ASBmPNHizXFqotvE&s=py4mUuLjM2OeqCqfTeHQ-1_pTclOM6EHnBkH2w8Grzs&e=
>> _______________________________________________
>> Ganglia-general mailing list
>> Ganglia-general@lists.sourceforge.net
>> https://urldefense.proofpoint.com/v2/url?u=https-3A__lists.sourceforge.net_lists_listinfo_ganglia-2Dgeneral&d=CwIBaQ&c=C3yme8gMkxg_ihJNXS06ZyWk4EJm8LdrrvxQb-Je7sw&r=DhM5WMgdrH-xWhI5BzkRTzoTvz8C-BRZ05t9kW9SXZk&m=3yzyW3jAhW38GAREYQCNiFYK4z2ASBmPNHizXFqotvE&s=zfC_f-AEPIYiWAesSXe9kGmG6V9i9zR4qQf4sK1oQ5s&e=
>
>


------------------------------------------------------------------------------
Find and fix application performance issues faster with Applications Manager
Applications Manager provides deep performance insights into multiple tiers of
your business applications. It resolves application problems quickly and
reduces your MTTR. Get your free trial!
https://ad.doubleclick.net/ddm/clk/302982198;130105516;z
_______________________________________________
Ganglia-general mailing list
Ganglia-general@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/ganglia-general

Reply via email to