Can you show us the the slave log and more of the master log? There should be a TASK_LOST somewhere within them.
On Wed, Sep 17, 2014 at 10:43 AM, Luyi Wang <[email protected]> wrote: > Have anyone experience TASK_LOST status for storm tasks on mesos. > > > I checked the stderr. Everything seems normal. > WARNING: Logging before InitGoogleLogging() is written to STDERR > I0917 00:21:36.164840 4831 fetcher.cpp:76] Fetching URI 'hdfs:// > 192.168.123.27/storm-mesos-0.9.2-incubating.tgz' > I0917 00:21:36.165225 4831 fetcher.cpp:105] Downloading resource from > 'hdfs://192.168.123.27/storm-mesos-0.9.2-incubating.tgz' to > '/tmp/mesos/slaves/20140915-185627-326871232-5050-8074-3/frameworks/20140915-230424-326871232-5050-13574-0000/executors/production-topology-1-1410913050/runs/53d06991-cb84-49d3-a530-f83efcf339e9/storm-mesos-0.9.2-incubating.tgz' > I0917 00:21:52.202791 4831 fetcher.cpp:64] Extracted resource > '/tmp/mesos/slaves/20140915-185627-326871232-5050-8074-3/frameworks/20140915-230424-326871232-5050-13574-0000/executors/production-topology-1-1410913050/runs/53d06991-cb84-49d3-a530-f83efcf339e9/storm-mesos-0.9.2-incubating.tgz' > into > '/tmp/mesos/slaves/20140915-185627-326871232-5050-8074-3/frameworks/20140915-230424-326871232-5050-13574-0000/executors/production-topology-1-1410913050/runs/53d06991-cb84-49d3-a530-f83efcf339e9' > I0917 00:21:52.206581 4831 fetcher.cpp:76] Fetching URI ' > http://mesos:45579/conf/storm.yaml' > I0917 00:21:52.206626 4831 fetcher.cpp:126] Downloading ' > http://mesos:45579/conf/storm.yaml' to > '/tmp/mesos/slaves/20140915-185627-326871232-5050-8074-3/frameworks/20140915-230424-326871232-5050-13574-0000/executors/production-topology-1-/runs/53d06991-cb84-49d3-a530-f83efcf339e9/storm.yaml' > I0917 00:22:01.829298 4984 exec.cpp:132] Version: 0.21.0 > I0917 00:22:01.832185 5006 exec.cpp:206] Executor registered on slave > 20140915-185627-326871232-5050-8074-3 > > > > > I also checked the mesos info log. Here is what it logged. > > I0917 00:19:59.020755 5145 master.cpp:3261] Executor > production-topology-1-1410913050 of framework > 20140915-230424-326871232-5050-13574-0000 on slave > 20140915-185627-326871232-5050-8074-2 at slave(1)@192.168.123.29:5051 ( > dev10-cdh5-03.int.dev10.smcl.pure-breeze.com) exited with status 0 > > > Any idea on this? > > > > > -Luyi. > > > >

