[ https://issues.apache.org/jira/browse/STORM-1915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15425463#comment-15425463 ]
Jungtaek Lim commented on STORM-1915: ------------------------------------- STORM-1934 is applied to 1.0.2 but I just forgot to close this. [~fogetti] Please try out 1.0.2 and see it works without the symptom, and reopen if it's not. Thanks! > Supervisor keeps restarting forever > ----------------------------------- > > Key: STORM-1915 > URL: https://issues.apache.org/jira/browse/STORM-1915 > Project: Apache Storm > Issue Type: Bug > Components: storm-core > Affects Versions: 1.0.1 > Environment: Linode 4GB running on KVM - Ubuntu 14.04 LTS > Reporter: Gergely Nagy > > While submitting a topology to a 20 node 40 worker strong cluster, the > supervisor keeps throwing errors and keeps restarting the workers it is > supervising. > For this reason the topology never starts, instead it keeps dancing by > reassigning the bolts and spouts forever. > I'd love to attach the logs here but I can't find any upload button in the > JIRA form. > The error basically says: > {code} > 2016-06-18 12:04:26.589 o.a.s.config [WARN] Failed to get worker user for . > #error { > :cause /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users > (Is a directory) > :via > [{:type java.io.FileNotFoundException > :message > /home/fogetti/downloads/apache-storm-1.0.1/storm-local/workers-users (Is a > directory) > :at [java.io.FileInputStream open0 FileInputStream.java -2]}] > :trace > [[java.io.FileInputStream open0 FileInputStream.java -2] > [java.io.FileInputStream open FileInputStream.java 195] > [java.io.FileInputStream <init> FileInputStream.java 138] > [clojure.java.io$fn__9189 invoke io.clj 229] > [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69] > [clojure.java.io$fn__9201 invoke io.clj 258] > [clojure.java.io$fn__9102$G__9095__9109 invoke io.clj 69] > [clojure.java.io$fn__9163 invoke io.clj 165] > [clojure.java.io$fn__9115$G__9091__9122 invoke io.clj 69] > [clojure.java.io$reader doInvoke io.clj 102] > [clojure.lang.RestFn invoke RestFn.java 410] > [clojure.lang.AFn applyToHelper AFn.java 154] > [clojure.lang.RestFn applyTo RestFn.java 132] > [clojure.core$apply invoke core.clj 632] > [clojure.core$slurp doInvoke core.clj 6653] > [clojure.lang.RestFn invoke RestFn.java 410] > [org.apache.storm.config$get_worker_user invoke config.clj 239] > [org.apache.storm.daemon.supervisor$shutdown_worker invoke supervisor.clj > 281] > > [org.apache.storm.daemon.supervisor$kill_existing_workers_with_change_in_components > invoke supervisor.clj 536] > [org.apache.storm.daemon.supervisor$mk_synchronize_supervisor$this__9078 > invoke supervisor.clj 595] > [org.apache.storm.event$event_manager$fn__8630 invoke event.clj 40] > [clojure.lang.AFn run AFn.java 22] > [java.lang.Thread run Thread.java 745]]} > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)