[ 
https://issues.apache.org/jira/browse/MESOS-8248?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16798214#comment-16798214
 ] 

Jorge Machado commented on MESOS-8248:
--------------------------------------

[~klueska] I saw you userid in the comments for the gpu things. I would like to 
take on this task. I thinking of adding these attr to ResourceStatistics 
object: 
 
 
{code:java}
//GPU statistics for each container
optional int32 gpu_idx = 50;
optional string gpu_uuid = 51;
optional string device_name = 52;
optional uint64 gpu_memory_used_mb = 53;
optional uint64 gpu_memory_total_mb = 54;
optional double gpu_usage = 55;
optional int32 gpu_temperature = 56;
optional int32 gpu_frequency_MHz = 57;
optional int32 gpu_power_used_W = 58;{code}
 What do you think ? 

I still did not figure out how I can call  nvmlDeviceGetHandleByIndex, I'm 
assuming somewhere around  NvidiaGpuIsolatorProcess right ? when the method 
NvidiaGpuIsolatorProcess::usage get's called it is outside of the container it 
self right ? 

> Expose information about GPU assigned to a task
> -----------------------------------------------
>
>                 Key: MESOS-8248
>                 URL: https://issues.apache.org/jira/browse/MESOS-8248
>             Project: Mesos
>          Issue Type: Improvement
>          Components: containerization, gpu
>            Reporter: Karthik Anantha Padmanabhan
>            Priority: Major
>              Labels: GPU
>
> As a framework author I'd like information about the gpu that was assigned to 
> a task.
> `nvidia-smi` for example provides the following information GPU UUID, boardId 
> minor number etc. It would useful to expose this information when a task is 
> assigned to a GPU instance.
> This will make it possible to monitor resource usage for a task on GPU which 
> is not possible when



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to