[
https://issues.apache.org/jira/browse/MESOS-2531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Michał Kiędyś updated MESOS-2531:
---------------------------------
Description:
I have build Mesos from scratch using code available on GitHub, revision
#a12242b.
My Mesos cluster runs on MacOS and consists of one master and three slaves -
all running on the same computer but on different ports. ZooKeeper runs also on
the same computer.
Later on I compiled Marathon also using latest version from GitHub, revision
#6decf76. Marathon uses same ZooKeeper instance and successfully connects to
Mesos cluster.
After deploying simple application that runs {{sleep}} command for 120 seconds
and scaling that application to ten my Marathon crushed killed by JVM after
SIGSEGV in libmesos-0.23.0.dylib.
h4. Log
{noformat}
[2015-03-23 15:47:17,872] INFO Computed new deployment plan:
DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar, Some(sleep
120))), 10))))) (mesosphere.marathon.upgrade.DeploymentPlan$:263)
[2015-03-23 15:47:17,876] INFO Deployment acknowledged. Waiting to get
processed: DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar,
Some(sleep 120))), 10))))) (mesosphere.marathon.state.GroupManager:142)
[2015-03-23 15:47:17,877] INFO 127.0.0.1 - - [23/mar/2015:14:47:17 +0000]
"PUT /v2/apps//bar HTTP/1.1" 200 92 "http://127.0.0.1:8080/" "Mozilla/5.0
(Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
[2015-03-23 15:47:17,918] INFO 127.0.0.1 - - [23/mar/2015:14:47:17 +0000]
"GET /v2/apps//bar/versions HTTP/1.1" 200 68 "http://127.0.0.1:8080/"
"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML,
like Gecko) Chrome/41.0.2272.89 Safari/537.36"
(mesosphere.chaos.http.ChaosRequestLog:15)
[2015-03-23 15:47:20,722] INFO 127.0.0.1 - - [23/mar/2015:14:47:20 +0000]
"GET /v2/apps HTTP/1.1" 200 592 "http://127.0.0.1:8080/" "Mozilla/5.0
(Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
[2015-03-23 15:47:20,782] INFO Received status update for task
bar.82501637-d16b-11e4-b7fa-aa4dda3d2dbb: TASK_RUNNING ()
(mesosphere.marathon.MarathonScheduler:149)
[2015-03-23 15:47:20,790] INFO 127.0.0.1 - - [23/mar/2015:14:47:20 +0000]
"GET /v2/deployments HTTP/1.1" 200 256 "http://127.0.0.1:8080/" "Mozilla/5.0
(Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
#
# A fatal error has been detected by the Java Runtime Environment:
#
# SIGSEGV (0xb) at pc=0x000000012ec946f7, pid=98294, tid=27651
#
# JRE version: Java(TM) SE Runtime Environment (8.0-b132) (build 1.8.0-b132)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.0-b70 mixed mode bsd-amd64
compressed oops)
# Problematic frame:
# C [libmesos-0.23.0.dylib+0x7836f7]
process::Future<mesos::internal::state::Variable>::isFailed() const+0x17
#
# Failed to write core dump. Core dumps have been disabled. To enable core
dumping, try "ulimit -c unlimited" before starting Java again
#
# An error report file with more information is saved as:
# /Users/mkiedys/Downloads/MESOS/marathon/hs_err_pid98294.log
#
# If you would like to submit a bug report, please visit:
# http://bugreport.sun.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#
Abort trap: 6
{noformat}
h4. Java
java version "1.8.0"
Java(TM) SE Runtime Environment (build 1.8.0-b132)
Java HotSpot(TM) 64-Bit Server VM (build 25.0-b70, mixed mode)
h4. System Software Overview
- System Version: OS X 10.10.2 (14C109)
- Kernel Version: Darwin 14.1.0
- Secure Virtual Memory: Enabled
- Time since boot: 13 days 11:02
was:
I have build Mesos from scratch using code available on GitHub, revision
#a12242b.
My Mesos cluster runs on MacOS Yosemite and consists of one master and three
slaves - all running on the same computer but on different ports. ZooKeeper
runs also on the same computer.
Later on I compiled Marathon also using latest version from GitHub, revision
#6decf76. Marathon uses same ZooKeeper instance and successfully connects to
Mesos cluster.
After deploying simple application that runs sleep command for 120 seconds and
scaling that application to ten my Marathon died killed by JVM after SIGSEGV in
libmesos-0.23.0.dylib.
{noformat}
[2015-03-23 15:47:17,872] INFO Computed new deployment plan:
DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar, Some(sleep
120))), 10))))) (mesosphere.marathon.upgrade.DeploymentPlan$:263)
[2015-03-23 15:47:17,876] INFO Deployment acknowledged. Waiting to get
processed: DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar,
Some(sleep 120))), 10))))) (mesosphere.marathon.state.GroupManager:142)
[2015-03-23 15:47:17,877] INFO 127.0.0.1 - - [23/mar/2015:14:47:17 +0000]
"PUT /v2/apps//bar HTTP/1.1" 200 92 "http://127.0.0.1:8080/" "Mozilla/5.0
(Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
[2015-03-23 15:47:17,918] INFO 127.0.0.1 - - [23/mar/2015:14:47:17 +0000]
"GET /v2/apps//bar/versions HTTP/1.1" 200 68 "http://127.0.0.1:8080/"
"Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML,
like Gecko) Chrome/41.0.2272.89 Safari/537.36"
(mesosphere.chaos.http.ChaosRequestLog:15)
[2015-03-23 15:47:20,722] INFO 127.0.0.1 - - [23/mar/2015:14:47:20 +0000]
"GET /v2/apps HTTP/1.1" 200 592 "http://127.0.0.1:8080/" "Mozilla/5.0
(Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
[2015-03-23 15:47:20,782] INFO Received status update for task
bar.82501637-d16b-11e4-b7fa-aa4dda3d2dbb: TASK_RUNNING ()
(mesosphere.marathon.MarathonScheduler:149)
[2015-03-23 15:47:20,790] INFO 127.0.0.1 - - [23/mar/2015:14:47:20 +0000]
"GET /v2/deployments HTTP/1.1" 200 256 "http://127.0.0.1:8080/" "Mozilla/5.0
(Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
#
# A fatal error has been detected by the Java Runtime Environment:
#
# SIGSEGV (0xb) at pc=0x000000012ec946f7, pid=98294, tid=27651
#
# JRE version: Java(TM) SE Runtime Environment (8.0-b132) (build 1.8.0-b132)
# Java VM: Java HotSpot(TM) 64-Bit Server VM (25.0-b70 mixed mode bsd-amd64
compressed oops)
# Problematic frame:
# C [libmesos-0.23.0.dylib+0x7836f7]
process::Future<mesos::internal::state::Variable>::isFailed() const+0x17
#
# Failed to write core dump. Core dumps have been disabled. To enable core
dumping, try "ulimit -c unlimited" before starting Java again
#
# An error report file with more information is saved as:
# /Users/mkiedys/Downloads/MESOS/marathon/hs_err_pid98294.log
#
# If you would like to submit a bug report, please visit:
# http://bugreport.sun.com/bugreport/crash.jsp
# The crash happened outside the Java Virtual Machine in native code.
# See problematic frame for where to report the bug.
#
Abort trap: 6
{noformat}
Mesos #a12242b
Marathon #6decf76
java version "1.8.0"
Java(TM) SE Runtime Environment (build 1.8.0-b132)
Java HotSpot(TM) 64-Bit Server VM (build 25.0-b70, mixed mode)
System Software Overview:
System Version: OS X 10.10.2 (14C109)
Kernel Version: Darwin 14.1.0
Secure Virtual Memory: Enabled
Time since boot: 13 days 11:02
> Libmesos terminates JVM
> -----------------------
>
> Key: MESOS-2531
> URL: https://issues.apache.org/jira/browse/MESOS-2531
> Project: Mesos
> Issue Type: Bug
> Components: java api
> Affects Versions: 0.23.0
> Reporter: Michał Kiędyś
>
> I have build Mesos from scratch using code available on GitHub, revision
> #a12242b.
> My Mesos cluster runs on MacOS and consists of one master and three slaves -
> all running on the same computer but on different ports. ZooKeeper runs also
> on the same computer.
> Later on I compiled Marathon also using latest version from GitHub, revision
> #6decf76. Marathon uses same ZooKeeper instance and successfully connects to
> Mesos cluster.
> After deploying simple application that runs {{sleep}} command for 120
> seconds and scaling that application to ten my Marathon crushed killed by JVM
> after SIGSEGV in libmesos-0.23.0.dylib.
> h4. Log
> {noformat}
> [2015-03-23 15:47:17,872] INFO Computed new deployment plan:
> DeploymentPlan(2015-03-23T14:47:17.823Z, (Step(List(Scale(App(/bar,
> Some(sleep 120))), 10))))) (mesosphere.marathon.upgrade.DeploymentPlan$:263)
> [2015-03-23 15:47:17,876] INFO Deployment acknowledged. Waiting to get
> processed: DeploymentPlan(2015-03-23T14:47:17.823Z,
> (Step(List(Scale(App(/bar, Some(sleep 120))), 10)))))
> (mesosphere.marathon.state.GroupManager:142)
> [2015-03-23 15:47:17,877] INFO 127.0.0.1 - - [23/mar/2015:14:47:17 +0000]
> "PUT /v2/apps//bar HTTP/1.1" 200 92 "http://127.0.0.1:8080/" "Mozilla/5.0
> (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
> Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
> [2015-03-23 15:47:17,918] INFO 127.0.0.1 - - [23/mar/2015:14:47:17 +0000]
> "GET /v2/apps//bar/versions HTTP/1.1" 200 68 "http://127.0.0.1:8080/"
> "Mozilla/5.0 (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML,
> like Gecko) Chrome/41.0.2272.89 Safari/537.36"
> (mesosphere.chaos.http.ChaosRequestLog:15)
> [2015-03-23 15:47:20,722] INFO 127.0.0.1 - - [23/mar/2015:14:47:20 +0000]
> "GET /v2/apps HTTP/1.1" 200 592 "http://127.0.0.1:8080/" "Mozilla/5.0
> (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
> Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
> [2015-03-23 15:47:20,782] INFO Received status update for task
> bar.82501637-d16b-11e4-b7fa-aa4dda3d2dbb: TASK_RUNNING ()
> (mesosphere.marathon.MarathonScheduler:149)
> [2015-03-23 15:47:20,790] INFO 127.0.0.1 - - [23/mar/2015:14:47:20 +0000]
> "GET /v2/deployments HTTP/1.1" 200 256 "http://127.0.0.1:8080/" "Mozilla/5.0
> (Macintosh; Intel Mac OS X 10_10_2) AppleWebKit/537.36 (KHTML, like Gecko)
> Chrome/41.0.2272.89 Safari/537.36" (mesosphere.chaos.http.ChaosRequestLog:15)
> #
> # A fatal error has been detected by the Java Runtime Environment:
> #
> # SIGSEGV (0xb) at pc=0x000000012ec946f7, pid=98294, tid=27651
> #
> # JRE version: Java(TM) SE Runtime Environment (8.0-b132) (build 1.8.0-b132)
> # Java VM: Java HotSpot(TM) 64-Bit Server VM (25.0-b70 mixed mode bsd-amd64
> compressed oops)
> # Problematic frame:
> # C [libmesos-0.23.0.dylib+0x7836f7]
> process::Future<mesos::internal::state::Variable>::isFailed() const+0x17
> #
> # Failed to write core dump. Core dumps have been disabled. To enable core
> dumping, try "ulimit -c unlimited" before starting Java again
> #
> # An error report file with more information is saved as:
> # /Users/mkiedys/Downloads/MESOS/marathon/hs_err_pid98294.log
> #
> # If you would like to submit a bug report, please visit:
> # http://bugreport.sun.com/bugreport/crash.jsp
> # The crash happened outside the Java Virtual Machine in native code.
> # See problematic frame for where to report the bug.
> #
> Abort trap: 6
> {noformat}
> h4. Java
> java version "1.8.0"
> Java(TM) SE Runtime Environment (build 1.8.0-b132)
> Java HotSpot(TM) 64-Bit Server VM (build 25.0-b70, mixed mode)
> h4. System Software Overview
> - System Version: OS X 10.10.2 (14C109)
> - Kernel Version: Darwin 14.1.0
> - Secure Virtual Memory: Enabled
> - Time since boot: 13 days 11:02
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)