Chen Peng created SINGA-154:
-------------------------------
Summary: Program hangs when training in pseudo-distributed mode
Key: SINGA-154
URL: https://issues.apache.org/jira/browse/SINGA-154
Project: Singa
Issue Type: Bug
Reporter: Chen Peng
When training in pseudo-distributed mode (4 nodes in total, node0 to node3),
the program hangs with the following output
root@node0:~/incubator-singa/tool/mesos# ./scheduler
/root/incubator-singa/examples/cifar10/job.conf -
scheduler_conf ./scheduler.conf -singa_conf
/root/incubator-singa/conf/singa.conf
WARNING: Logging before InitGoogleLogging() is written to STDERR
I0323 21:53:18.856851 277 singa_scheduler.cc:413] Scheduler initialized
I0323 21:53:18.869976 277 singa_scheduler.cc:419] Starting SINGA framework...
I0323 21:53:18.878701 277 sched.cpp:157] Version: 0.22.0
I0323 21:53:18.888286 282 sched.cpp:254] New master detected at
[email protected]:5050
I0323 21:53:18.891774 282 sched.cpp:264] No credentials provided. Attempting to
register without aut
hentication
I0323 21:53:18.894948 283 sched.cpp:448] Framework registered with
20160323-214924-33558956-5050-144
-0000
I0323 21:53:18.896631 288 singa_scheduler.cc:356] n1 = 1 n2 = 1 ncpus = 2
I0323 21:53:18.896728 288 singa_scheduler.cc:356] n1 = 1 n2 = 1 ncpus = 2
I0323 21:53:18.896771 288 singa_scheduler.cc:356] n1 = 1 n2 = 1 ncpus = 2
The log in slave node is not human readable as
~U^@@@
^Enode3^Z^V
Dcpus^P@Z ^@@@@@@^@@2^A*^Z^U
Cmem^P@Z ^@@@@^@ަ@2^A*^Z^V
Ddisk^P@Z ^@@@@^@e2^A*^Z^X
^Eports^P^A"
^H^H~X[34m^A^P~@^A2^A*2&
$20160323-214924-33558956-5050-144-S28^A@�'
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)