[jira] [Commented] (MESOS-7921) process::EventQueue sometimes crashes

2017-09-01 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16151096#comment-16151096 ] Yan Xu commented on MESOS-7921: --- New failure on ASF CI: https://lists.apache.org/thread.htm

[jira] [Commented] (MESOS-6918) Prometheus exporter endpoints for metrics

2017-09-01 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16150848#comment-16150848 ] Yan Xu commented on MESOS-6918: --- I think [~bmahler]'s questions (and mine below) suggest we

[jira] [Commented] (MESOS-7921) process::EventQueue sometimes crashes

2017-08-31 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16149737#comment-16149737 ] Yan Xu commented on MESOS-7921: --- [~benjaminhindman] In the newly attached FetcherCacheTest

[jira] [Updated] (MESOS-7921) process::EventQueue sometimes crashes

2017-08-31 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7921: -- Attachment: FetcherCacheTest.CachedCustomOutputFileWithSubdirectory.log.txt > process::EventQueue sometimes cras

[jira] [Updated] (MESOS-7921) process::EventQueue sometimes crashes

2017-08-28 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7921: -- Description: The following segfault is found on [ASF|https://builds.apache.org/job/Mesos-Buildbot/BUILDTOOL=aut

[jira] [Updated] (MESOS-7921) process::EventQueue sometimes crashes

2017-08-28 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7921: -- Attachment: MesosContainerizerSlaveRecoveryTest.ResourceStatisticsFullLog.txt Attached the full log on ASF CI f

[jira] [Commented] (MESOS-7921) process::EventQueue sometimes crashes

2017-08-28 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7921?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16144134#comment-16144134 ] Yan Xu commented on MESOS-7921: --- [~benjaminhindman] [~bmahler] > process::EventQueue someti

[jira] [Created] (MESOS-7921) process::EventQueue sometimes crashes

2017-08-28 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7921: - Summary: process::EventQueue sometimes crashes Key: MESOS-7921 URL: https://issues.apache.org/jira/browse/MESOS-7921 Project: Mesos Issue Type: Bug Components: l

[jira] [Updated] (MESOS-7895) ZK session timeout is unconfigurable in agent and scheduler drivers

2017-08-25 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7895: -- Shepherd: Yan Xu > ZK session timeout is unconfigurable in agent and scheduler drivers > ---

[jira] [Created] (MESOS-7915) cgroups::internal::Destroyer should not destroy nested cgroups in parallel.

2017-08-24 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7915: - Summary: cgroups::internal::Destroyer should not destroy nested cgroups in parallel. Key: MESOS-7915 URL: https://issues.apache.org/jira/browse/MESOS-7915 Project: Mesos

[jira] [Commented] (MESOS-7888) Track fetcher task success and failures

2017-08-22 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7888?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16137547#comment-16137547 ] Yan Xu commented on MESOS-7888: --- Thanks, since it's not officially 1.4.0 yet I'll just leave

[jira] [Updated] (MESOS-6918) Prometheus exporter endpoints for metrics

2017-08-22 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-6918: -- Shepherd: Yan Xu > Prometheus exporter endpoints for metrics > - > >

[jira] [Commented] (MESOS-7907) `async` fails to accept mutable lambdas

2017-08-22 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7907?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16137121#comment-16137121 ] Yan Xu commented on MESOS-7907: --- [~mcypark] [~jpe...@apache.org] according to your slack cha

[jira] [Commented] (MESOS-6918) Prometheus exporter endpoints for metrics

2017-08-21 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6918?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16135996#comment-16135996 ] Yan Xu commented on MESOS-6918: --- [~klueska] [~bmahler] if you guys don't have cycles I can s

[jira] [Commented] (MESOS-1719) Master should persist active frameworks information

2017-08-10 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-1719?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16122328#comment-16122328 ] Yan Xu commented on MESOS-1719: --- [~adam-mesos] does this being labelled {{mesosphere}} mean

[jira] [Commented] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-08-09 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16120473#comment-16120473 ] Yan Xu commented on MESOS-7714: --- [~mcypark] did you get a chance to work on this? > Fix age

[jira] [Commented] (MESOS-7215) Race condition on re-registration of non-partition-aware frameworks

2017-08-04 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7215?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16114745#comment-16114745 ] Yan Xu commented on MESOS-7215: --- Communicated over slack but yeah it's being worked on and a

[jira] [Commented] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-07-31 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16107680#comment-16107680 ] Yan Xu commented on MESOS-7714: --- Yes we are. Thanks! Hope we can prioritize this one (possib

[jira] [Assigned] (MESOS-6489) Better support for containers that want to manage their own cgroup.

2017-07-31 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6489?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-6489: - Assignee: Yan Xu (was: Anindya Sinha) > Better support for containers that want to manage their own cgro

[jira] [Updated] (MESOS-7215) Race condition on re-registration of non-partition-aware frameworks

2017-07-31 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7215: -- Shepherd: Yan Xu (was: Neil Conway) > Race condition on re-registration of non-partition-aware frameworks > ---

[jira] [Commented] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-07-28 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16105960#comment-16105960 ] Yan Xu commented on MESOS-7714: --- I mean when we are not using new features, so this appears

[jira] [Commented] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-07-28 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16105874#comment-16105874 ] Yan Xu commented on MESOS-7714: --- I see, but 1) is a real operational concern for upgrading t

[jira] [Commented] (MESOS-5116) Investigate supporting accounting only mode in XFS isolator

2017-07-27 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-5116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16104458#comment-16104458 ] Yan Xu commented on MESOS-5116: --- [~jpe...@apache.org] I think this is worth calling out in t

[jira] [Assigned] (MESOS-7831) Resource refinement is not applied to tasks in completed_frameworks.

2017-07-27 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-7831: - Assignee: Yan Xu > Resource refinement is not applied to tasks in completed_frameworks. > ---

[jira] [Commented] (MESOS-7831) Resource refinement is not applied to tasks in completed_frameworks.

2017-07-27 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103975#comment-16103975 ] Yan Xu commented on MESOS-7831: --- Similar to MESOS-7716. > Resource refinement is not applie

[jira] [Commented] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-07-27 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103626#comment-16103626 ] Yan Xu commented on MESOS-7714: --- Great. I guess I am just not clear on the mechanism to achi

[jira] [Comment Edited] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-07-27 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103505#comment-16103505 ] Yan Xu edited comment on MESOS-7714 at 7/27/17 5:06 PM: [~mcypark]

[jira] [Commented] (MESOS-7714) Fix agent downgrade for reservation refinement

2017-07-27 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7714?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16103505#comment-16103505 ] Yan Xu commented on MESOS-7714: --- [~mcypark] Just to be sure. This ticket is not for supporti

[jira] [Commented] (MESOS-5368) Consider introducing persistent agent ID

2017-07-26 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-5368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16102755#comment-16102755 ] Yan Xu commented on MESOS-5368: --- This ticket is not a duplicate of MESOS-6223. With MESOS-62

[jira] [Comment Edited] (MESOS-5368) Consider introducing persistent agent ID

2017-07-26 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-5368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16102755#comment-16102755 ] Yan Xu edited comment on MESOS-5368 at 7/27/17 5:57 AM: This ticke

[jira] [Comment Edited] (MESOS-5368) Consider introducing persistent agent ID

2017-07-26 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-5368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16102755#comment-16102755 ] Yan Xu edited comment on MESOS-5368 at 7/27/17 5:57 AM: This ticke

[jira] [Updated] (MESOS-7716) Mesos 1.2.0 agent crashes Mesos 1.4.0 master

2017-07-26 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7716: -- Summary: Mesos 1.2.0 agent crashes Mesos 1.4.0 master (was: Mesos 1.2.0 crashes Mesos 1.4.0 master) > Mesos 1.

[jira] [Commented] (MESOS-7832) Mesos master during failover may not re-add completed tasks from agents belonging to frameworks that have yet to reregister

2017-07-25 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7832?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16100772#comment-16100772 ] Yan Xu commented on MESOS-7832: --- /cc [~neilc] [~vinodkone] > Mesos master during failover m

[jira] [Created] (MESOS-7832) Mesos master during failover may not re-add completed tasks from agents belonging to frameworks that have yet to reregister

2017-07-25 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7832: - Summary: Mesos master during failover may not re-add completed tasks from agents belonging to frameworks that have yet to reregister Key: MESOS-7832 URL: https://issues.apache.org/jira/browse/M

[jira] [Updated] (MESOS-7831) Resource refinement is not applied to tasks in completed_frameworks.

2017-07-25 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7831?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7831: -- Description: When an agent reregisters, the master [doesn't apply refinement to completed_frameworks|https://gi

[jira] [Created] (MESOS-7831) Resource refinement is not applied to tasks in completed_frameworks.

2017-07-25 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7831: - Summary: Resource refinement is not applied to tasks in completed_frameworks. Key: MESOS-7831 URL: https://issues.apache.org/jira/browse/MESOS-7831 Project: Mesos Issue T

[jira] [Commented] (MESOS-7831) Resource refinement is not applied to tasks in completed_frameworks.

2017-07-25 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7831?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16100711#comment-16100711 ] Yan Xu commented on MESOS-7831: --- /cc [~mcypark] > Resource refinement is not applied to tas

[jira] [Commented] (MESOS-6406) Send latest status for partition-aware tasks when agent reregisters

2017-07-18 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6406?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16092266#comment-16092266 ] Yan Xu commented on MESOS-6406: --- The master should probably send updates about non-partition

[jira] [Updated] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-18 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7753: -- Affects Version/s: 1.4.0 > `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0' >

[jira] [Commented] (MESOS-6223) Allow agents to re-register post a host reboot

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16090881#comment-16090881 ] Yan Xu commented on MESOS-6223: --- https://reviews.apache.org/r/60925/ > Allow agents to re-r

[jira] [Assigned] (MESOS-7786) Create a special "no sender" PID.

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-7786: - Assignee: (was: Yan Xu) > Create a special "no sender" PID. > - > >

[jira] [Updated] (MESOS-6223) Allow agents to re-register post a host reboot

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-6223: -- Shepherd: Yan Xu (was: Vinod Kone) > Allow agents to re-register post a host reboot > -

[jira] [Updated] (MESOS-6549) Asynchronous dir removal in agent GC

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6549?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-6549: -- Shepherd: Yan Xu Target Version/s: 1.4.0 > Asynchronous dir removal in agent GC > --

[jira] [Commented] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16090238#comment-16090238 ] Yan Xu commented on MESOS-7753: --- Eventually decided to address this ticket separately from M

[jira] [Assigned] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-7753: - Assignee: Yan Xu > `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0' > -

[jira] [Updated] (MESOS-7711) Master updates registry for reregistering agents even when they haven't been unreachable

2017-07-17 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7711: -- Shepherd: James Peach > Master updates registry for reregistering agents even when they haven't been > unreacha

[jira] [Updated] (MESOS-7786) Create a special "no sender" PID.

2017-07-14 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7786: -- Description: In libprocess we have this "fire and forget" messaging semantics with [process::post|https://githu

[jira] [Updated] (MESOS-7786) Create a special "no sender" PID.

2017-07-13 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7786: -- Summary: Create a special "no sender" PID. (was: Create a convention for a special "no sender" PID.) > Create

[jira] [Updated] (MESOS-7711) Master updates registry for reregistering agents even when they haven't been unreachable

2017-07-13 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7711: -- Description: During a master failover we observed many registry updates, on average _one per two agents_, as in

[jira] [Assigned] (MESOS-7711) Master updates registry for reregistering agents even when they haven't been unreachable

2017-07-13 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7711?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-7711: - Assignee: Yan Xu > Master updates registry for reregistering agents even when they haven't been > unreac

[jira] [Commented] (MESOS-6223) Allow agents to re-register post a host reboot

2017-07-12 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16085192#comment-16085192 ] Yan Xu commented on MESOS-6223: --- {noformat:title=} commit 188109b63ea9cc0cdfe1fd616c744cb10d

[jira] [Assigned] (MESOS-7786) Create a convention for a special "no sender" PID.

2017-07-12 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-7786: - Assignee: Yan Xu > Create a convention for a special "no sender" PID. > -

[jira] [Created] (MESOS-7786) Create a convention for special

2017-07-12 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7786: - Summary: Create a convention for special Key: MESOS-7786 URL: https://issues.apache.org/jira/browse/MESOS-7786 Project: Mesos Issue Type: Bug Reporter: Yan Xu

[jira] [Updated] (MESOS-7786) Create a convention for a special "no sender" PID.

2017-07-12 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7786: -- Summary: Create a convention for a special "no sender" PID. (was: Create a convention for special ) > Create a

[jira] [Updated] (MESOS-7786) Create a convention for a special "no sender" PID.

2017-07-12 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7786?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7786: -- Component/s: libprocess > Create a convention for a special "no sender" PID. > -

[jira] [Updated] (MESOS-7769) libprocess initializes to bind to random port if --ip is not specified

2017-07-07 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7769: -- Description: When running current [HEAD|https://github.com/apache/mesos/commit/c90bea80486c089e933bef64aca341e4

[jira] [Updated] (MESOS-7769) libprocess initializes to bind to random port if --ip is not specified

2017-07-07 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7769: -- Summary: libprocess initializes to bind to random port if --ip is not specified (was: libprocess initializes to

[jira] [Created] (MESOS-7769) libprocess initializes to use random port if --ip is not specified

2017-07-07 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7769: - Summary: libprocess initializes to use random port if --ip is not specified Key: MESOS-7769 URL: https://issues.apache.org/jira/browse/MESOS-7769 Project: Mesos Issue Typ

[jira] [Commented] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-06 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16077282#comment-16077282 ] Yan Xu commented on MESOS-7753: --- The libprocess change is still separate though as I feel th

[jira] [Commented] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-06 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16077276#comment-16077276 ] Yan Xu commented on MESOS-7753: --- This version of [post|https://github.com/apache/mesos/blob

[jira] [Comment Edited] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-06 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16077276#comment-16077276 ] Yan Xu edited comment on MESOS-7753 at 7/6/17 10:25 PM: This versi

[jira] [Commented] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-03 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072972#comment-16072972 ] Yan Xu commented on MESOS-7753: --- A few things in consideration: - We cannot simply whitelis

[jira] [Updated] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0'

2017-07-03 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7753: -- Summary: `log.LearnedMessage` could be rejected due to being sent from '@0.0.0.0:0' (was: `log.LearnedMessage`

[jira] [Created] (MESOS-7753) `log.LearnedMessage` could be rejected due to being sent from '@:0.0.0.0:0'

2017-07-03 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7753: - Summary: `log.LearnedMessage` could be rejected due to being sent from '@:0.0.0.0:0' Key: MESOS-7753 URL: https://issues.apache.org/jira/browse/MESOS-7753 Project: Mesos

[jira] [Updated] (MESOS-621) `HierarchicalAllocatorProcess::removeSlave` doesn't properly handle framework allocations/resources

2017-07-03 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-621: - Summary: `HierarchicalAllocatorProcess::removeSlave` doesn't properly handle framework allocations/resources (was:

[jira] [Assigned] (MESOS-621) HierarchicalAllocator::slaveRemoved doesn't properly handle framework allocations/resources

2017-07-03 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-621: Assignee: Yan Xu > HierarchicalAllocator::slaveRemoved doesn't properly handle framework > allocations/reso

[jira] [Commented] (MESOS-621) HierarchicalAllocator::slaveRemoved doesn't properly handle framework allocations/resources

2017-07-03 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-621?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16072718#comment-16072718 ] Yan Xu commented on MESOS-621: -- I think this is still valuable and I am fixing this as part of

[jira] [Commented] (MESOS-7688) Improve master failover performance by reducing unnecessary agent retries.

2017-06-29 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16068755#comment-16068755 ] Yan Xu commented on MESOS-7688: --- [~dzhuk] sorry I missed this reply. yes it's ~1.3.0 (hence

[jira] [Commented] (MESOS-5396) After failover, master does not remove agents with same UPID.

2017-06-23 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16061604#comment-16061604 ] Yan Xu commented on MESOS-5396: --- As noted in MESOS-6223, even when MESOS-6223 is merged, the

[jira] [Comment Edited] (MESOS-6223) Allow agents to re-register post a host reboot

2017-06-23 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-6223?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15870821#comment-15870821 ] Yan Xu edited comment on MESOS-6223 at 6/23/17 10:36 PM: - >From my

[jira] [Commented] (MESOS-7688) Improve master failover performance by reducing unnecessary agent retries.

2017-06-22 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16059872#comment-16059872 ] Yan Xu commented on MESOS-7688: --- Some log summaries from us: For a 2min failover (time spe

[jira] [Commented] (MESOS-7688) Improve master failover performance by reducing unnecessary agent retries.

2017-06-22 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16059714#comment-16059714 ] Yan Xu commented on MESOS-7688: --- [~dzhuk] https://reviews.apache.org/r/60003/ worth a JIRA o

[jira] [Created] (MESOS-7711) Master updates registry for reregistering agents even when they haven't been unreachable

2017-06-22 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7711: - Summary: Master updates registry for reregistering agents even when they haven't been unreachable Key: MESOS-7711 URL: https://issues.apache.org/jira/browse/MESOS-7711 Project: Mes

[jira] [Created] (MESOS-7710) Mesos agent registration retry backoff window always has a zero lower-bound

2017-06-22 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7710: - Summary: Mesos agent registration retry backoff window always has a zero lower-bound Key: MESOS-7710 URL: https://issues.apache.org/jira/browse/MESOS-7710 Project: Mesos

[jira] [Comment Edited] (MESOS-7302) Support launching standalone containers.

2017-06-21 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16057962#comment-16057962 ] Yan Xu edited comment on MESOS-7302 at 6/21/17 6:19 PM: Thanks. So

[jira] [Commented] (MESOS-7302) Support launching standalone containers.

2017-06-21 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16057962#comment-16057962 ] Yan Xu commented on MESOS-7302: --- Thanks. So there's going to be an agent API with a {{Resour

[jira] [Commented] (MESOS-7302) Support launching standalone containers.

2017-06-21 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7302?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16057886#comment-16057886 ] Yan Xu commented on MESOS-7302: --- [~jieyu] [~kaysoky] Could you clarify on how resources cons

[jira] [Commented] (MESOS-7694) Discarding process::loop doesn't stop the loop

2017-06-20 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7694?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16056018#comment-16056018 ] Yan Xu commented on MESOS-7694: --- /cc [~benjaminhindman] [~bmahler] > Discarding process::lo

[jira] [Comment Edited] (MESOS-7639) Oversubscription could crash the master due to CHECK failure in the allocator

2017-06-19 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16054700#comment-16054700 ] Yan Xu edited comment on MESOS-7639 at 6/19/17 8:42 PM: I think yo

[jira] [Commented] (MESOS-7639) Oversubscription could crash the master due to CHECK failure in the allocator

2017-06-19 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7639?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16054700#comment-16054700 ] Yan Xu commented on MESOS-7639: --- I think you are right that you stopped seeing the crash bec

[jira] [Created] (MESOS-7694) Discarding process::loop doesn't stop the loop

2017-06-19 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7694: - Summary: Discarding process::loop doesn't stop the loop Key: MESOS-7694 URL: https://issues.apache.org/jira/browse/MESOS-7694 Project: Mesos Issue Type: Bug Comp

[jira] [Updated] (MESOS-7650) Timer::cancel doesn't completely prevent spurious agent reregister loops

2017-06-12 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7650: -- Description: See https://reviews.apache.org/r/54909/ for the previous attempt to address this issue but Timer ca

[jira] [Commented] (MESOS-7651) Consider a more explicit way to bind reservations / volumes to a framework.

2017-06-09 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7651?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16044848#comment-16044848 ] Yan Xu commented on MESOS-7651: --- +1. Related to this is the headaches around the lifecycle o

[jira] [Updated] (MESOS-7650) Timer::cancel doesn't completely prevent spurious agent reregister loops

2017-06-09 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7650?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7650: -- Affects Version/s: 1.3.0 1.2.0 > Timer::cancel doesn't completely prevent spurious agent

[jira] [Created] (MESOS-7650) Timer::cancel doesn't completely prevent spurious agent reregister loops

2017-06-09 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7650: - Summary: Timer::cancel doesn't completely prevent spurious agent reregister loops Key: MESOS-7650 URL: https://issues.apache.org/jira/browse/MESOS-7650 Project: Mesos Iss

[jira] [Created] (MESOS-7646) Create a Backoff abstraction

2017-06-08 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7646: - Summary: Create a Backoff abstraction Key: MESOS-7646 URL: https://issues.apache.org/jira/browse/MESOS-7646 Project: Mesos Issue Type: Bug Components: stout

[jira] [Commented] (MESOS-7641) process::delay'd method may not be canceled by canceling the returned timer

2017-06-07 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16041910#comment-16041910 ] Yan Xu commented on MESOS-7641: --- [~bmahler] We can probably fix it by [dispatching|https://

[jira] [Commented] (MESOS-7641) process::delay'd method may not be canceled by canceling the returned timer

2017-06-07 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16041867#comment-16041867 ] Yan Xu commented on MESOS-7641: --- /cc [~bmahler] > process::delay'd method may not be cancel

[jira] [Created] (MESOS-7641) process::delay'd method may not be canceled by canceling the returned timer

2017-06-07 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7641: - Summary: process::delay'd method may not be canceled by canceling the returned timer Key: MESOS-7641 URL: https://issues.apache.org/jira/browse/MESOS-7641 Project: Mesos

[jira] [Commented] (MESOS-7566) Master crash due to failed check in DRFSorter::remove

2017-06-07 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16041719#comment-16041719 ] Yan Xu commented on MESOS-7566: --- Filed MESOS-7639. > Master crash due to failed check in DR

[jira] [Created] (MESOS-7639) Oversubscription could crash the master due to CHECK failure in the allocator

2017-06-07 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7639: - Summary: Oversubscription could crash the master due to CHECK failure in the allocator Key: MESOS-7639 URL: https://issues.apache.org/jira/browse/MESOS-7639 Project: Mesos

[jira] [Commented] (MESOS-7566) Master crash due to failed check in DRFSorter::remove

2017-06-06 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16039892#comment-16039892 ] Yan Xu commented on MESOS-7566: --- Will do. > Master crash due to failed check in DRFSorter::

[jira] [Commented] (MESOS-7566) Master crash due to failed check in DRFSorter::remove

2017-05-31 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7566?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16032442#comment-16032442 ] Yan Xu commented on MESOS-7566: --- Certain scenarios do seem problematic to me, e.g., - The a

[jira] [Updated] (MESOS-7507) Add a metric for the network size of replicas for the registry.

2017-05-16 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7507: -- Description: Maintaining quorum is an important aspect of the high availability of Mesos master but right now th

[jira] [Commented] (MESOS-7115) Agent should prefer LOG(FATAL) over EXIT().

2017-05-16 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16013315#comment-16013315 ] Yan Xu commented on MESOS-7115: --- {noformat:title=} commit 342aab64c60d8118468d17974ee0b863d3

[jira] [Commented] (MESOS-7507) Add a metric for the network size of replicas for the registry.

2017-05-15 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16011264#comment-16011264 ] Yan Xu commented on MESOS-7507: --- [~bmahler] [~jieyu] can you shepherd? > Add a metric for t

[jira] [Updated] (MESOS-7507) Add a metric for the network size of replicas for the registry.

2017-05-15 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu updated MESOS-7507: -- Description: Maintaining quorum is an important aspect of the high availability of Mesos master but right now th

[jira] [Assigned] (MESOS-7507) Add a metric for the network size of replicas for the registry.

2017-05-15 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yan Xu reassigned MESOS-7507: - Assignee: Yan Xu > Add a metric for the network size of replicas for the registry. >

[jira] [Created] (MESOS-7507) Add a metric for the network size of replicas for the registry.

2017-05-15 Thread Yan Xu (JIRA)
Yan Xu created MESOS-7507: - Summary: Add a metric for the network size of replicas for the registry. Key: MESOS-7507 URL: https://issues.apache.org/jira/browse/MESOS-7507 Project: Mesos Issue Type:

[jira] [Commented] (MESOS-7378) Build failure with missing gnu_dev_major and gnu_dev_minor symbols

2017-05-09 Thread Yan Xu (JIRA)
[ https://issues.apache.org/jira/browse/MESOS-7378?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16003783#comment-16003783 ] Yan Xu commented on MESOS-7378: --- Some potential workarounds. provides the following definit

<    1   2   3   4   5   6   7   8   9   >