Hi Xiaoying, can you re-run with GLOG_v=2 in your environment? This will enable verbose logging which may help us diagnose better.
Thanks! On Wed, Apr 17, 2013 at 12:50 AM, Xiaoying Zheng <[email protected]> wrote: > hi, > > Without specifying IP at the slave node, the slave node itself kept > connecting and disconnecting to the master. Thanks. > > > 1. On the master node (192.168.1.130), we ran "bin/mesos-master.sh > --ip=192.168.1.130". > > Log file created at: 2013/04/17 15:47:03 > > Running on machine: master > Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg > I0417 15:47:03.479331 4381 logging.cpp:72] Logging to logs > I0417 15:47:03.480342 4381 main.cpp:105] Build: 2013-01-22 20:32:32 by > hadoop > I0417 15:47:03.480383 4381 main.cpp:106] Starting Mesos master > I0417 15:47:03.480571 4398 master.cpp:268] Master started on > 192.168.1.130:5050 > I0417 15:47:03.480655 4398 master.cpp:283] Master ID: > 201304171547-2113820480-5050-**4381 > I0417 15:47:03.482028 4398 master.cpp:483] Elected as master! > I0417 15:47:03.493221 4400 webui_utils.cpp:45] Loading webui script at > '/home/hadoop/mesos-0.9.0/src/**webui/master/webui.py' > I0417 15:47:12.886644 4396 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-0 at [email protected]:36966 > I0417 15:47:12.886725 4396 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:12.886754 4396 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-0 at hdfs2 with cpus=4; mem=968 > I0417 15:47:12.887498 4396 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-0 with cpus=4; mem=968 > I0417 15:47:12.887603 4396 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-0 disconnected > I0417 15:47:12.887863 4396 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-0 > I0417 15:47:13.893245 4398 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-1 at [email protected]:36966 > I0417 15:47:13.893322 4398 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:13.893344 4398 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-1 at hdfs2 with cpus=4; mem=968 > I0417 15:47:13.893545 4398 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-1 with cpus=4; mem=968 > I0417 15:47:13.893630 4398 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-1 disconnected > I0417 15:47:13.893779 4398 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-1 > I0417 15:47:14.903282 4396 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-2 at [email protected]:36966 > I0417 15:47:14.903359 4396 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:14.903381 4396 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-2 at hdfs2 with cpus=4; mem=968 > I0417 15:47:14.903586 4396 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-2 with cpus=4; mem=968 > I0417 15:47:14.903662 4396 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-2 disconnected > I0417 15:47:14.903825 4396 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-2 > I0417 15:47:15.913488 4398 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-3 at [email protected]:36966 > I0417 15:47:15.913547 4398 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:15.913568 4398 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-3 at hdfs2 with cpus=4; mem=968 > I0417 15:47:15.913749 4398 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-3 with cpus=4; mem=968 > I0417 15:47:15.913833 4398 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-3 disconnected > I0417 15:47:15.914034 4398 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-3 > I0417 15:47:16.923637 4395 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-4 at [email protected]:36966 > I0417 15:47:16.923703 4395 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:16.923727 4395 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-4 at hdfs2 with cpus=4; mem=968 > I0417 15:47:16.923910 4395 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-4 with cpus=4; mem=968 > I0417 15:47:16.923984 4395 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-4 disconnected > I0417 15:47:16.924165 4395 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-4 > I0417 15:47:17.933886 4398 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-5 at [email protected]:36966 > I0417 15:47:17.934022 4398 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:17.934849 4398 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-5 at hdfs2 with cpus=4; mem=968 > I0417 15:47:17.935056 4398 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-5 with cpus=4; mem=968 > I0417 15:47:17.935143 4398 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-5 disconnected > I0417 15:47:17.935303 4398 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-5 > I0417 15:47:18.944039 4396 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-6 at [email protected]:36966 > I0417 15:47:18.944144 4396 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:18.944172 4396 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-6 at hdfs2 with cpus=4; mem=968 > I0417 15:47:18.944496 4396 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-6 with cpus=4; mem=968 > I0417 15:47:18.944571 4396 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-6 disconnected > I0417 15:47:18.944804 4396 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-6 > I0417 15:47:19.954175 4398 master.cpp:844] Attempting to register slave > 201304171547-2113820480-5050-**4381-7 at [email protected]:36966 > I0417 15:47:19.954265 4398 master.cpp:1097] Master now considering a > slave at hdfs2:36966 as active > I0417 15:47:19.954294 4398 master.cpp:1633] Adding slave > 201304171547-2113820480-5050-**4381-7 at hdfs2 with cpus=4; mem=968 > I0417 15:47:19.954597 4398 simple_allocator.cpp:69] Added slave > 201304171547-2113820480-5050-**4381-7 with cpus=4; mem=968 > I0417 15:47:19.954680 4398 master.cpp:455] Slave > 201304171547-2113820480-5050-**4381-7 disconnected > I0417 15:47:19.954807 4398 simple_allocator.cpp:81] Removed slave > 201304171547-2113820480-5050-**4381-7 > > > > > 2. On the slave node (192.168.1.132), we ran "bin/mesos-master.sh --master= > 192.168.1.130:5050". > > Log file created at: 2013/04/17 15:45:44 > > Running on machine: hdfs2 > Log line format: [IWEF]mmdd hh:mm:ss.uuuuuu threadid file:line] msg > I0417 15:45:44.330966 2864 logging.cpp:72] Logging to logs > I0417 15:45:44.332208 2864 main.cpp:111] Creating "process" isolation > module > I0417 15:45:44.332365 2864 main.cpp:119] Build: 2013-01-22 20:32:32 by > hadoop > I0417 15:45:44.332392 2864 main.cpp:120] Starting Mesos slave > I0417 15:45:44.332712 2879 slave.cpp:191] Slave started on > 127.0.1.1:36966 > I0417 15:45:44.332756 2879 slave.cpp:192] Slave resources: cpus=4; mem=968 > I0417 15:45:44.334221 2879 slave.cpp:357] New master detected at > [email protected]:5050 > I0417 15:45:44.345507 2883 webui_utils.cpp:45] Loading webui script at > '/home/hadoop/mesos-0.9.0/src/**webui/slave/webui.py' > I0417 15:45:51.909643 2880 slave.cpp:1104] Process exited: @0.0.0.0:0 > W0417 15:45:51.909715 2880 slave.cpp:1107] WARNING! Master disconnected! > Waiting for a new master to be elected. > > >
