[
https://issues.apache.org/jira/browse/SINGA-11?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14593571#comment-14593571
]
Anh Dinh commented on SINGA-11:
-------------------------------
does "recovery" here mean replacing the failed executor (a Singa process) with
another (new) executor? Does the new Singa process start with some checkpointed
states?
> Start SINGA using Mesos
> -----------------------
>
> Key: SINGA-11
> URL: https://issues.apache.org/jira/browse/SINGA-11
> Project: Singa
> Issue Type: New Feature
> Reporter: wangwei
>
> Mesos helps to mange resources in large clusters.
> This ticket is an initial integration of SINGA with Mesos, which aims to
> simply start SINGA through Mesos and run multiple SINGA tasks in the same
> cluster.
> The fully integration should include,
> 1. start SINGA by Mesos, including requesting processes, memory, CPU, etc.
> 2. detect failures and recovery through Mesos
> 3. TBD.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)