How many workers do you have in your topology? Andy Feng
Sent from my iPhone > On Mar 4, 2014, at 5:21 AM, "李家宏" <[email protected]> wrote: > > hi, all > > When I submit a topology to a storm cluster of 0.9.0.1, the following error > occurs: > ---------------------------------------------------------------------------------------------------------------------- > [INFO] Starting > 2014-03-04 20:24:13 o.a.z.ZooKeeper [INFO] Initiating client connection, > > connectString=10.207.52.82:2181,10.207.52.83:2181,10.207.52.84:2181sessionTimeout=20000 > watcher=com.netflix.curator.ConnectionState@796cefa8 > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] Opening socket connection to > server /10.207.52.83:2181 > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] Socket connection > established to > storm010207052083.cm3.tbsite.net/10.207.52.83:2181, initiating session > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] Session establishment > complete on server > storm010207052083.cm3.tbsite.net/10.207.52.83:2181, sessionid = > 0x2423f964207c973, negotiated timeout = 20000 > 2014-03-04 20:24:13 b.s.zookeeper [INFO] Zookeeper state update: > :connected:none > 2014-03-04 20:24:13 o.a.z.ZooKeeper [INFO] Session: 0x2423f964207c973 > closed > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] EventThread shut down > 2014-03-04 20:24:13 c.n.c.f.i.CuratorFrameworkImpl [INFO] Starting > 2014-03-04 20:24:13 o.a.z.ZooKeeper [INFO] Initiating client connection, > connectString=10.207.52.82:2181,10.207.52.83:2181, > 10.207.52.84:2181/tmp/storm-0.9.0.1 sessionTimeout=20000 > watcher=com.netflix.curator.ConnectionState@58f41393 > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] Opening socket connection to > server /10.207.52.82:2181 > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] Socket connection > established to > storm010207052082.cm3.tbsite.net/10.207.52.82:2181, initiating session > 2014-03-04 20:24:13 o.a.z.ClientCnxn [INFO] Session establishment > complete on server > storm010207052082.cm3.tbsite.net/10.207.52.82:2181, sessionid = > 0x1423f964209c65f, negotiated timeout = 20000 > 2014-03-04 20:24:14 b.s.m.TransportFactory [INFO] Storm peer transport > plugin:backtype.storm.messaging.netty.Context > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [2] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.m.n.Client [INFO] Reconnect ... [1] > 2014-03-04 20:24:14 b.s.d.worker [ERROR] Error on initialization of > server mk-worker > org.jboss.netty.channel.ChannelException: Failed to create a selector. > at > org.jboss.netty.channel.socket.nio.AbstractNioSelector.openSelector(AbstractNioSelector.java:337) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.AbstractNioSelector.(AbstractNioSelector.java:95) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.AbstractNioWorker.(AbstractNioWorker.java:51) > ~[netty-3.6.3.Final.jar:na] > at org.jboss.netty.channel.socket.nio.NioWorker.(NioWorker.java:45) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.NioWorkerPool.createWorker(NioWorkerPool.java:45) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.NioWorkerPool.createWorker(NioWorkerPool.java:28) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.AbstractNioWorkerPool.newWorker(AbstractNioWorkerPool.java:99) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.AbstractNioWorkerPool.init(AbstractNioWorkerPool.java:69) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.NioWorkerPool.(NioWorkerPool.java:39) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.NioWorkerPool.(NioWorkerPool.java:33) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.NioClientSocketChannelFactory.(NioClientSocketChannelFactory.java:152) > ~[netty-3.6.3.Final.jar:na] > at > org.jboss.netty.channel.socket.nio.NioClientSocketChannelFactory.(NioClientSocketChannelFactory.java:134) > ~[netty-3.6.3.Final.jar:na] > at backtype.storm.messaging.netty.Client.(Client.java:54) > ~[storm-netty-0.9.0.1.jar:na] > at backtype.storm.messaging.netty.Context.connect(Context.java:36) > ~[storm-netty-0.9.0.1.jar:na] > at > backtype.storm.daemon.worker$mk_refresh_connections$this__5827$iter__5834__5838$fn__5839.invoke(worker.clj:250) > ~[storm-core-0.9.0.1.jar:na] > at clojure.lang.LazySeq.sval(LazySeq.java:42) ~[clojure-1.4.0.jar:na] > at clojure.lang.LazySeq.seq(LazySeq.java:60) ~[clojure-1.4.0.jar:na] > at clojure.lang.Cons.next(Cons.java:39) ~[clojure-1.4.0.jar:na] > at clojure.lang.RT.next(RT.java:587) ~[clojure-1.4.0.jar:na] > at clojure.core$next.invoke(core.clj:64) ~[clojure-1.4.0.jar:na] > at clojure.core$dorun.invoke(core.clj:2726) ~[clojure-1.4.0.jar:na] > at clojure.core$doall.invoke(core.clj:2741) ~[clojure-1.4.0.jar:na] > at > backtype.storm.daemon.worker$mk_refresh_connections$this__5827.invoke(worker.clj:244) > ~[storm-core-0.9.0.1.jar:na] > at > backtype.storm.daemon.worker$fn__5882$exec_fn__1229__auto____5883.invoke(worker.clj:357) > ~[storm-core-0.9.0.1.jar:na] > at clojure.lang.AFn.applyToHelper(AFn.java:185) [clojure-1.4.0.jar:na] > at clojure.lang.AFn.applyTo(AFn.java:151) [clojure-1.4.0.jar:na] > at clojure.core$apply.invoke(core.clj:601) ~[clojure-1.4.0.jar:na] > at > backtype.storm.daemon.worker$fn__5882$mk_worker__5938.doInvoke(worker.clj:329) > [storm-core-0.9.0.1.jar:na] > at clojure.lang.RestFn.invoke(RestFn.java:512) [clojure-1.4.0.jar:na] > at backtype.storm.daemon.worker$_main.invoke(worker.clj:439) > [storm-core-0.9.0.1.jar:na] > at clojure.lang.AFn.applyToHelper(AFn.java:172) [clojure-1.4.0.jar:na] > at clojure.lang.AFn.applyTo(AFn.java:151) [clojure-1.4.0.jar:na] > at backtype.storm.daemon.worker.main(Unknown Source) > [storm-core-0.9.0.1.jar:na] > Caused by: java.io.IOException: Too many open files > at sun.nio.ch.IOUtil.initPipe(Native Method) ~[na:1.6.0_38] > at sun.nio.ch.EPollSelectorImpl.(EPollSelectorImpl.java:49) > ~[na:1.6.0_38] > at > sun.nio.ch.EPollSelectorProvider.openSelector(EPollSelectorProvider.java:18) > ~[na:1.6.0_38] > at java.nio.channels.Selector.open(Selector.java:209) ~[na:1.6.0_38] > at > org.jboss.netty.channel.socket.nio.AbstractNioSelector.openSelector(AbstractNioSelector.java:335) > ~[netty-3.6.3.Final.jar:na] > ... 32 common frames omitted > 2014-03-04 20:24:14 b.s.util [INFO] Halting process: ("Error on > initialization") > -------------------------------------------------------------------------------------------------------------------- > > This topology works fine with storm cluster of 0.8.0. > And: > ulimit -n => 131072; > sudo losf | grep java | wc -l => 5000 > it seems like opened fds do not reaching limits > > What's the problem ? > > Regards > > -- > > ====================================================== > > Gvain > > Email: [email protected]
