Greetings,

I've been spending some time trying to get the Mesos up and running on
Vagrant (a nice frontend for headless Virtualbox). I have the master setup
locally on 33.33.13.38:5050 and one slave setup on 33.33.13.39:5050. There
able to communicate with each other and the web display on the master
works. The problem is that the master keeps adding and removing the slave
or just segfaults sometimes. The web interface doesn't register the slave
(maybe removed too quickly?). I'm not too sure what to do at this point and
I was hoping for some help. I'm using Mesos 0.10.

Here is the output from the master:

I0416 10:09:01.794397  2040 dominant_share_allocator.cpp:417] Performed
allocation for 0 slaves in 0.018916 milliseconds
I0416 10:09:02.099568  2038 master.cpp:906] Attempting to register slave on
vagrant-ubuntu.vagrantup.com at slave(1)@127.0.1.1:57599
I0416 10:09:02.100764  2038 master.cpp:1142] Master now considering a slave
at vagrant-ubuntu.vagrantup.com:57599 as active
I0416 10:09:02.101080  2038 master.cpp:1721] Adding slave
201304161008-16842879-5050-2023-56 at vagrant-ubuntu.vagrantup.com with
cpus=2; mem=979; ports=[31000-32000]
I0416 10:09:02.104706  2038 master.cpp:513] Slave
201304161008-16842879-5050-2023-56(vagrant-ubuntu.vagrantup.com)
disconnected
I0416 10:09:02.105237  2037 dominant_share_allocator.cpp:244] Added slave
201304161008-16842879-5050-2023-56 (vagrant-ubuntu.vagrantup.com) with
cpus=2; mem=979; ports=[31000-32000] (and cpus=2; mem=979;
ports=[31000-32000] available)
I0416 10:09:02.105865  2037 dominant_share_allocator.cpp:435] Performed
allocation for slave 201304161008-16842879-5050-2023-56 in 0.011817
milliseconds
I0416 10:09:02.106258  2037 dominant_share_allocator.cpp:269] Removed slave
201304161008-16842879-5050-2023-56
I0416 10:09:02.797294  2038 dominant_share_allocator.cpp:417] Performed
allocation for 0 slaves in 0.017615 milliseconds
I0416 10:09:03.101245  2040 master.cpp:906] Attempting to register slave on
vagrant-ubuntu.vagrantup.com at slave(1)@127.0.1.1:57599
I0416 10:09:03.102088  2040 master.cpp:1142] Master now considering a slave
at vagrant-ubuntu.vagrantup.com:57599 as active
I0416 10:09:03.103230  2040 master.cpp:1721] Adding slave
201304161008-16842879-5050-2023-57 at vagrant-ubuntu.vagrantup.com with
cpus=2; mem=979; ports=[31000-32000]
I0416 10:09:03.106045  2040 master.cpp:513] Slave
201304161008-16842879-5050-2023-57(vagrant-ubuntu.vagrantup.com)
disconnected
I0416 10:09:03.106202  2039 dominant_share_allocator.cpp:244] Added slave
201304161008-16842879-5050-2023-57 (vagrant-ubuntu.vagrantup.com) with
cpus=2; mem=979; ports=[31000-32000] (and cpus=2; mem=979;
ports=[31000-32000] available)
I0416 10:09:03.107240  2039 dominant_share_allocator.cpp:435] Performed
allocation for slave 201304161008-16842879-5050-2023-57 in 0.011276
milliseconds
I0416 10:09:03.107650  2039 dominant_share_allocator.cpp:269] Removed slave
201304161008-16842879-5050-2023-57
I0416 10:09:03.799612  2040 dominant_share_allocator.cpp:417] Performed
allocation for 0 slaves in 0.024916 milliseconds

Here is the output from the slave:
I0416 10:19:46.207093  1867 main.cpp:123] Creating "process" isolation
module
I0416 10:19:46.209199  1867 main.cpp:131] Build: 2013-04-16 07:41:31 by
vagrant
I0416 10:19:46.209410  1867 main.cpp:132] Starting Mesos slave
I0416 10:19:46.210247  1883 slave.cpp:175] Slave started on 1)@
127.0.1.1:56701
I0416 10:19:46.210842  1883 slave.cpp:176] Slave resources: cpus=2;
mem=979; ports=[31000-32000]
I0416 10:19:46.213693  1883 slave.cpp:352] New master detected at
[email protected]:5050
Loading webui script at
'/home/vagrant/mesos-0.10.0/src/webui/slave/webui.py'
Bottle server starting up (using WSGIRefServer())...
Listening on http://0.0.0.0:8081/
Use Ctrl-C to quit.

Sometimes the master just quits

master:
I0416 10:19:58.244128  2545 master.cpp:513] Slave
201304161019-16842879-5050-2531-12(vagrant-ubuntu.vagrantup.com)
disconnected
I0416 10:19:58.245954  2545 dominant_share_allocator.cpp:269] Removed slave
201304161019-16842879-5050-2531-12
F0416 10:19:58.719403  2549 process.cpp:1828] Check failed:
outgoing.count(s) > 0
*** Check failure stack trace: ***
    @     0x7f554933c0ad  google::LogMessage::Fail()
    @     0x7f554933e83f  google::LogMessage::SendToLog()
    @     0x7f554933bcab  google::LogMessage::Flush()
    @     0x7f554933f0cd  google::LogMessageFatal::~LogMessageFatal()
    @     0x7f5549227484  process::SocketManager::next()
    @     0x7f55492216bf  process::send_data()
    @     0x7f554937b9df  ev_invoke_pending
    @     0x7f554937fd14  ev_loop
    @     0x7f554922292c  process::serve()
    @     0x7f5548a9ae9a  start_thread
    @     0x7f5547fb5cbd  (unknown)


Additional from slave:
I0416 10:19:58.808632  1884 slave.cpp:1141] Process exited: @0.0.0.0:0
W0416 10:19:58.808785  1884 slave.cpp:1144] WARNING! Master disconnected!
Waiting for a new master to be elected.


-- 
John

Reply via email to