Eddy created STORM-2138:
---------------------------
Summary: java.io.FileNotFoundException: stormconf.ser does not
exist
Key: STORM-2138
URL: https://issues.apache.org/jira/browse/STORM-2138
Project: Apache Storm
Issue Type: Bug
Components: storm-core
Affects Versions: 1.0.2
Reporter: Eddy
We are seeing problems in our storm topology whereby all our workers crash.
The errors we see are
2016-10-07 09:49:33.599 o.a.s.d.supervisor [ERROR] Error on initialization of
server mk-supervisor
java.io.FileNotFoundException: File
'/opt/storm_local/supervisor/stormdist/production_2016_09_13-1-1475831938/stormconf.ser'
does not exist
at
org.apache.storm.shade.org.apache.commons.io.FileUtils.openInputStream(FileUtils.java:292)
at
org.apache.storm.shade.org.apache.commons.io.FileUtils.readFileToByteArray(FileUtils.java:1815)
at
org.apache.storm.config$read_supervisor_storm_conf_given_path.invoke(config.clj:142)
at
org.apache.storm.config$read_supervisor_storm_conf.invoke(config.clj:221)
at
org.apache.storm.daemon.supervisor$add_blob_references.invoke(supervisor.clj:495)
at
org.apache.storm.daemon.supervisor$fn__9307$exec_fn__2466__auto____9308.invoke(supervisor.clj:795)
at clojure.lang.AFn.applyToHelper(AFn.java:160)
at clojure.lang.AFn.applyTo(AFn.java:144)
at clojure.core$apply.invoke(core.clj:630)
at
org.apache.storm.daemon.supervisor$fn__9307$mk_supervisor__9352.doInvoke(supervisor.clj:763)
at clojure.lang.RestFn.invoke(RestFn.java:436)
at
org.apache.storm.daemon.supervisor$_launch.invoke(supervisor.clj:1200)
at org.apache.storm.daemon.supervisor$_main.invoke(supervisor.clj:1233)
at clojure.lang.AFn.applyToHelper(AFn.java:152)
at clojure.lang.AFn.applyTo(AFn.java:144)
at org.apache.storm.daemon.supervisor.main(Unknown Source)
2016-10-07 09:49:33.608 o.a.s.util [ERROR] Halting process: ("Error on
initialization")
java.lang.RuntimeException: ("Error on initialization")
at org.apache.storm.util$exit_process_BANG_.doInvoke(util.clj:341)
at clojure.lang.RestFn.invoke(RestFn.java:423)
at
org.apache.storm.daemon.supervisor$fn__9307$mk_supervisor__9352.doInvoke(supervisor.clj:763)
at clojure.lang.RestFn.invoke(RestFn.java:436)
at
org.apache.storm.daemon.supervisor$_launch.invoke(supervisor.clj:1200)
at org.apache.storm.daemon.supervisor$_main.invoke(supervisor.clj:1233)
at clojure.lang.AFn.applyToHelper(AFn.java:152)
at clojure.lang.AFn.applyTo(AFn.java:144)
at org.apache.storm.daemon.supervisor.main(Unknown Source)
2016-10-07 09:49:34.668 o.a.s.d.supervisor [INFO] Removing code for storm id
production_2016_09_13-1-1475831938
We have looked at https://github.com/apache/storm/pull/418 and
https://issues.apache.org/jira/browse/STORM-130, which both show the first
issue as being fixed - however we are still experiencing it in 1.0.2. The
changes from the fixing commit
(https://github.com/apache/storm/pull/418/commits/ccd28f8a356f468e66865fa9d9901b0a2628ec74)
don't seem to be in the current version of the file
(https://github.com/apache/storm/blob/v1.0.2/storm-core/src/clj/org/apache/storm/daemon/supervisor.clj).
We get this often when resubmitting a topology, and our only workaround is to
stop the topology, delete the whole /opt/storm_local directory (which is our
storm.local.dir) and resubmit the topology. Often, the workers seem to be
looking for stormconf.ser in the local directory of an old topology that isn't
even running at the time.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)