[
https://issues.apache.org/jira/browse/MESOS-5448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Anindya Sinha updated MESOS-5448:
---------------------------------
Description:
When the master sends a CheckpointResourcesMessage to the agent, the agent
attempts to rmdir the persistent volume for a DESTROY operation (if it existed
before, and is no longer in the updated checkpoint in
CheckpointResourcesMessage).
If the slave restarts before the operation finishes, the disk space can be
leaked because a reattempt of a rmdir is not done (since the checkpoint is
already updated).
Subsequently, a CREATE on the same path could result in leaking of the data to
another framework (since the directory was not rm-ed) since the CREATE
operation is successful even if the root directory exists and the contents of
that directory is not empty.
was:
When the master sends a CheckpointResourcesMessage to the agent, the agent
attempts to rmdir the persistent volume (if it existed before, and is no longer
in the updated checkpoint in CheckpointResourcesMessage).
If the slave restarts before the operation finishes, the disk space can be
leaked because a reattempt of a rmdir is not done.
Subsequently, a CREATE on the same path could result in leaking of the data to
another framework (since the directory was not rm-ed).
> Persistent volume deletion on the agent should survive slave restart
> --------------------------------------------------------------------
>
> Key: MESOS-5448
> URL: https://issues.apache.org/jira/browse/MESOS-5448
> Project: Mesos
> Issue Type: Bug
> Components: general
> Reporter: Anindya Sinha
> Assignee: Anindya Sinha
> Labels: external-volumes, persistent-volumes
>
> When the master sends a CheckpointResourcesMessage to the agent, the agent
> attempts to rmdir the persistent volume for a DESTROY operation (if it
> existed before, and is no longer in the updated checkpoint in
> CheckpointResourcesMessage).
> If the slave restarts before the operation finishes, the disk space can be
> leaked because a reattempt of a rmdir is not done (since the checkpoint is
> already updated).
> Subsequently, a CREATE on the same path could result in leaking of the data
> to another framework (since the directory was not rm-ed) since the CREATE
> operation is successful even if the root directory exists and the contents of
> that directory is not empty.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)