Kevin Klues created MESOS-5555:
----------------------------------

             Summary: Update method of populating device entries for 
`/dev/nvidia-uvm`, etc
                 Key: MESOS-5555
                 URL: https://issues.apache.org/jira/browse/MESOS-5555
             Project: Mesos
          Issue Type: Improvement
            Reporter: Kevin Klues
            Assignee: Kevin Klues


Currently, access to `/dev/nvidiactl` and `/dev/nvidia-uvm` is only granted to 
/ revoked from a container as GPUs are added and removed from them. On some 
level, this makes sense because most jobs don't need access to these devices 
unless they are also using a GPU. However, there are cases when access to these 
files is appropriate, even when not making use of a GPU. Running `nvidia-smi` 
to control the global state of the underlying nvidia driver, for example.
    
We should add `/dev/nvidiactl` and `/dev/nvidia-uvm` to the default whitelist 
of devices to include in every container when the `gpu/nvidia` isolator is 
enabled. This will allow a container to run standard nvidia driver tools (such 
as `nvidia-smi`) without failing with abnormal errors when no GPUs have been 
granted to it. As such, these tools will now report that no GPUs are installed 
instead of failing abnormally.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to