Hi folks,
Thanks for the feedback!
On Mon, Oct 17, 2016 at 12:44 PM, Zhitao Li wrote:
> +1 to both A to B.
>
> Do we plan to eventually drop non-checkpionted framework support (possibly
> in v2) and declare that all frameworks has to operate in this assumption?
I think
+1 to A and B
Aaron Carey
Production Engineer - Cloud Pipeline
Industrial Light & Magic
London
020 3751 9150
On 17 October 2016 at 00:38, Qian Zhang wrote:
> and requires operators to enable checkpointing on the slaves.
>
>
> Just curious why operator needs to enable
Qian,
Turns out the --checkpoint flag was made default and removed in Mesos 0.22.
On Sun, Oct 16, 2016 at 4:38 PM, Qian Zhang wrote:
> and requires operators to enable checkpointing on the slaves.
>
>
> Just curious why operator needs to enable checkpointing on the slaves
+1 to both A to B.
Do we plan to eventually drop non-checkpionted framework support (possibly
in v2) and declare that all frameworks has to operate in this assumption?
On Mon, Oct 17, 2016 at 1:36 AM, Aaron Carey wrote:
> +1 to A and B
>
> Aaron Carey
> Production Engineer -
>
> and requires operators to enable checkpointing on the slaves.
Just curious why operator needs to enable checkpointing on the slaves (I do
not see an agent flag for that), I think checkpointing should be enabled in
framework level rather than slave.
Thanks,
Qian Zhang
On Sun, Oct 16, 2016
+1 to A and B
Aurora has enabled checkpointing for years and requires operators to enable
checkpointing on the slaves.
On Sat, Oct 15, 2016 at 11:57 AM, Joris Van Remoortere
wrote:
> I'm in favor of A & B. I find it provides a better "first experience" to
> users.
> From
I'm in favor of A & B. I find it provides a better "first experience" to
users.
>From my experience you usually have to have an explicit reason to not want
to checkpoint. Most people assume the semantics provided by the checkpoint
behavior is default and it can be a frustrating experience for them
Hi folks,
I'd like input from individuals who currently use frameworks but do
not enable checkpointing.
Background: "checkpointing" is a parameter that can be enabled in
FrameworkInfo; if enabled, the agent will write the framework pid,
executor PIDs, and status updates to disk for any tasks
to be a
regression:
1. Slave re-detects leading Master.
2. Slave re-authenticates with Master.
3. Master sees slave as already activated, calls disconnect().
4. For non-checkpointing frameworks, this call to disconnect() assumes the
slave has exited, and will send TASK_LOST.
5. In the case where
/
Slave authentication retries can trigger TASK_LOST for non-checkpointing
frameworks.
Key: MESOS-1264
URL: https://issues.apache.org/jira/browse/MESOS-1264
Project
: https://reviews.apache.org/r/21017
Slave authentication retries can trigger TASK_LOST for non-checkpointing
frameworks.
Key: MESOS-1264
URL: https://issues.apache.org/jira
) disable the slave in the allocator so it makes no more offers, 2)
remove (non-checkpointing) frameworks, which sends the TASK_LOST message you
were seeing, and 3) remove all offers on that slave and have the allocator
recover those resources.
1) I definitely want the slave disabled
of you take a look here?
Slave authentication retries can trigger TASK_LOST for non-checkpointing
frameworks.
Key: MESOS-1264
URL: https://issues.apache.org/jira/browse/MESOS
Benjamin Mahler created MESOS-1264:
--
Summary: Slave authentication retries can trigger TASK_LOST for
non-checkpointing frameworks.
Key: MESOS-1264
URL: https://issues.apache.org/jira/browse/MESOS-1264
[
https://issues.apache.org/jira/browse/MESOS-1264?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Adam B reassigned MESOS-1264:
-
Assignee: Adam B
Slave authentication retries can trigger TASK_LOST for non-checkpointing
frameworks
15 matches
Mail list logo