[ 
https://issues.apache.org/jira/browse/STORM-3784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Richard Zowalla closed STORM-3784.
----------------------------------
    Resolution: Cannot Reproduce

> my supervisor will shut down on 2:00 am everyday
> ------------------------------------------------
>
>                 Key: STORM-3784
>                 URL: https://issues.apache.org/jira/browse/STORM-3784
>             Project: Apache Storm
>          Issue Type: Bug
>          Components: storm-server
>    Affects Versions: 2.1.0
>         Environment: centos 7 x64
>            Reporter: Sunsy Sun
>            Priority: Major
>         Attachments: supervisor(1).log
>
>
> The cluster has one nimbus and two supervisors.one of the supervisors is 
> alone with nimbus.
> I deployed two topology that PradarLinkTopology and PradarLogTopology.
> PradarLogTopology run with 4 workers.PradarLinkTopology run with 1 workers.
> on 2:00 am everyday, all supervisors will shut down,i havn't find out the 
> reason.
> I try to clean up the status directory,but the problem still exsit.
> this is my supervisor.log
> {code:java}
> //代码占位符
> 2021-07-21 02:03:42.070 o.a.s.u.Utils Thread-17 [INFO] Worker Process 
> dcae9231-4be4-4842-9ed0-988e1b8a2b28:Error occurred during initialization of 
> VM2021-07-21 02:03:42.070 o.a.s.u.Utils Thread-17 [INFO] Worker Process 
> dcae9231-4be4-4842-9ed0-988e1b8a2b28:Error occurred during initialization of 
> VM2021-07-21 02:03:42.071 o.a.s.u.Utils Thread-17 [INFO] Worker Process 
> dcae9231-4be4-4842-9ed0-988e1b8a2b28:java.lang.Error: Properties init: Could 
> not determine current working directory.2021-07-21 02:03:42.071 o.a.s.u.Utils 
> Thread-17 [INFO] Worker Process dcae9231-4be4-4842-9ed0-988e1b8a2b28: at 
> java.lang.System.initProperties(Native Method)2021-07-21 02:03:42.071 
> o.a.s.u.Utils Thread-17 [INFO] Worker Process 
> dcae9231-4be4-4842-9ed0-988e1b8a2b28: at 
> java.lang.System.initializeSystemClass(System.java:1166)2021-07-21 
> 02:03:42.071 o.a.s.u.Utils Thread-17 [INFO] Worker Process 
> dcae9231-4be4-4842-9ed0-988e1b8a2b28:2021-07-21 02:03:42.323 
> o.a.s.d.s.BasicContainer SLOT_6702 [INFO] Removed Worker ID 
> dcae9231-4be4-4842-9ed0-988e1b8a2b282021-07-21 02:03:42.329 o.a.s.d.s.Slot 
> SLOT_6702 [INFO] STATE kill msInState: 68588 
> topo:PradarLogTopology-3-1626751922 worker:null -> empty msInState: 
> 32021-07-21 02:03:42.329 o.a.s.d.s.Slot SLOT_6702 [INFO] SLOT 6702: Changing 
> current assignment from 
> LocalAssignment(topology_id:PradarLogTopology-3-1626751922, 
> executors:[ExecutorInfo(task_start:4, task_end:4), ExecutorInfo(task_start:1, 
> task_end:1)], resources:WorkerResources(mem_on_heap:256.0, mem_off_heap:0.0, 
> cpu:20.0, shared_mem_on_heap:0.0, shared_mem_off_heap:0.0, 
> resources:{offheap.memory.mb=0.0, onheap.memory.mb=256.0, 
> cpu.pcore.percent=20.0}, shared_resources:{}), owner:root) to null2021-07-21 
> 02:03:42.353 o.a.s.d.s.Supervisor pool-10-thread-1 [WARN] Topology config is 
> not localized yet...2021-07-21 02:03:42.449 o.a.s.d.s.Slot SLOT_6700 [INFO] 
> SLOT 6700 all processes are dead...2021-07-21 02:03:42.449 
> o.a.s.d.s.Container SLOT_6700 [INFO] Cleaning up 
> 8cbbfd6c-961b-482d-9175-cf9b79473808-172.26.137.86:b7963273-452a-43af-bc00-d814e0629f962021-07-21
>  02:03:42.450 o.a.s.d.s.Container SLOT_6700 [INFO] GET worker-user for 
> b7963273-452a-43af-bc00-d814e0629f962021-07-21 02:03:42.450 
> o.a.s.d.s.AdvancedFSOps SLOT_6700 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/b7963273-452a-43af-bc00-d814e0629f96/pids/163262021-07-21
>  02:03:43.322 o.a.s.d.s.AdvancedFSOps SLOT_6701 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/26b5ffbd-08b6-46df-aa04-6b86f78b8ad8/pids2021-07-21
>  02:03:43.322 o.a.s.d.s.AdvancedFSOps SLOT_6701 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/26b5ffbd-08b6-46df-aa04-6b86f78b8ad8/tmp2021-07-21
>  02:03:45.209 o.a.s.d.s.BasicContainer Thread-17 [INFO] Worker Process 
> dcae9231-4be4-4842-9ed0-988e1b8a2b28 exited with code: 12021-07-21 
> 02:03:45.224 o.a.s.d.s.AdvancedFSOps SLOT_6701 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/26b5ffbd-08b6-46df-aa04-6b86f78b8ad82021-07-21
>  02:03:45.224 o.a.s.d.s.Supervisor pool-10-thread-7 [WARN] Topology config is 
> not localized yet...2021-07-21 02:03:45.224 o.a.s.d.s.Container SLOT_6701 
> [INFO] REMOVE worker-user 26b5ffbd-08b6-46df-aa04-6b86f78b8ad82021-07-21 
> 02:03:45.224 o.a.s.d.s.AdvancedFSOps SLOT_6700 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/b7963273-452a-43af-bc00-d814e0629f96/heartbeats2021-07-21
>  02:03:45.224 o.a.s.d.s.AdvancedFSOps SLOT_6701 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers-users/26b5ffbd-08b6-46df-aa04-6b86f78b8ad82021-07-21
>  02:03:45.224 o.a.s.t.ProcessFunction pool-10-thread-7 [ERROR] Internal error 
> processing 
> sendSupervisorWorkerHeartbeatorg.apache.storm.utils.WrappedNotAliveException: 
> PradarLinkTopology-2-1626337413 does not appear to be alive, you should 
> probably exit at 
> org.apache.storm.daemon.supervisor.Supervisor$1.sendSupervisorWorkerHeartbeat(Supervisor.java:442)
>  ~[storm-server-2.1.0.jar:2.1.0] at 
> org.apache.storm.generated.Supervisor$Processor$sendSupervisorWorkerHeartbeat.getResult(Supervisor.java:374)
>  ~[storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.generated.Supervisor$Processor$sendSupervisorWorkerHeartbeat.getResult(Supervisor.java:353)
>  ~[storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.thrift.ProcessFunction.process(ProcessFunction.java:38) 
> [storm-shaded-deps-2.1.0.jar:2.1.0] at 
> org.apache.storm.thrift.TBaseProcessor.process(TBaseProcessor.java:39) 
> [storm-shaded-deps-2.1.0.jar:2.1.0] at 
> org.apache.storm.security.auth.SimpleTransportPlugin$SimpleWrapProcessor.process(SimpleTransportPlugin.java:174)
>  [storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.thrift.server.AbstractNonblockingServer$FrameBuffer.invoke(AbstractNonblockingServer.java:518)
>  [storm-shaded-deps-2.1.0.jar:2.1.0] at 
> org.apache.storm.thrift.server.Invocation.run(Invocation.java:18) 
> [storm-shaded-deps-2.1.0.jar:2.1.0] at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
>  [?:1.8.0_201] at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
>  [?:1.8.0_201] at java.lang.Thread.run(Thread.java:748) 
> [?:1.8.0_201]2021-07-21 02:03:45.225 o.a.s.d.s.AdvancedFSOps SLOT_6700 [INFO] 
> Deleting path 
> /data/apache-storm-2.1.0/status/workers/b7963273-452a-43af-bc00-d814e0629f96/pids2021-07-21
>  02:03:45.225 o.a.s.d.s.BasicContainer Thread-16 [INFO] Worker Process 
> b7963273-452a-43af-bc00-d814e0629f96 exited with code: 2542021-07-21 
> 02:03:45.225 o.a.s.d.s.AdvancedFSOps SLOT_6700 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/b7963273-452a-43af-bc00-d814e0629f96/tmp2021-07-21
>  02:03:45.226 o.a.s.d.s.AdvancedFSOps SLOT_6700 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers/b7963273-452a-43af-bc00-d814e0629f962021-07-21
>  02:03:45.226 o.a.s.d.s.Container SLOT_6700 [INFO] REMOVE worker-user 
> b7963273-452a-43af-bc00-d814e0629f962021-07-21 02:03:45.226 
> o.a.s.d.s.AdvancedFSOps SLOT_6700 [INFO] Deleting path 
> /data/apache-storm-2.1.0/status/workers-users/b7963273-452a-43af-bc00-d814e0629f962021-07-21
>  02:03:45.227 o.a.s.d.s.BasicContainer SLOT_6701 [INFO] Removed Worker ID 
> 26b5ffbd-08b6-46df-aa04-6b86f78b8ad82021-07-21 02:03:45.228 
> o.a.s.d.s.BasicContainer SLOT_6700 [INFO] Removed Worker ID 
> b7963273-452a-43af-bc00-d814e0629f962021-07-21 02:03:45.229 o.a.s.d.s.Slot 
> SLOT_6700 [INFO] STATE kill msInState: 81385 
> topo:PradarLogTopology-3-1626751922 worker:null -> empty msInState: 
> 02021-07-21 02:03:45.229 o.a.s.d.s.Slot SLOT_6700 [INFO] SLOT 6700: Changing 
> current assignment from 
> LocalAssignment(topology_id:PradarLogTopology-3-1626751922, 
> executors:[ExecutorInfo(task_start:3, task_end:3)], 
> resources:WorkerResources(mem_on_heap:128.0, mem_off_heap:0.0, cpu:10.0, 
> shared_mem_on_heap:0.0, shared_mem_off_heap:0.0, 
> resources:{offheap.memory.mb=0.0, onheap.memory.mb=128.0, 
> cpu.pcore.percent=10.0}, shared_resources:{}), owner:root) to null2021-07-21 
> 02:03:45.230 o.a.s.d.s.Slot SLOT_6701 [INFO] STATE kill-and-relaunch 
> msInState: 95356 topo:PradarLogTopology-3-1626751922 worker:null -> 
> waiting-for-blob-localization msInState: 12021-07-21 02:03:45.231 
> o.a.s.d.s.Slot SLOT_6701 [INFO] SLOT 6701: Changing current assignment from 
> LocalAssignment(topology_id:PradarLogTopology-3-1626751922, 
> executors:[ExecutorInfo(task_start:3, task_end:3)], 
> resources:WorkerResources(mem_on_heap:128.0, mem_off_heap:0.0, cpu:10.0, 
> shared_mem_on_heap:0.0, shared_mem_off_heap:0.0, 
> resources:{offheap.memory.mb=0.0, onheap.memory.mb=128.0, 
> cpu.pcore.percent=10.0}, shared_resources:{}), owner:root) to null2021-07-21 
> 02:03:45.231 o.a.s.d.s.Slot SLOT_6700 [INFO] STATE empty msInState: 2 -> 
> waiting-for-blob-localization msInState: 02021-07-21 02:03:45.232 
> o.a.s.d.s.Slot SLOT_6701 [ERROR] Error when processing 
> eventjava.io.FileNotFoundException: File 
> '/data/apache-storm-2.1.0/status/supervisor/stormdist/PradarLinkTopology-4-1626751925/stormconf.ser'
>  does not exist at 
> org.apache.storm.shade.org.apache.commons.io.FileUtils.openInputStream(FileUtils.java:297)
>  ~[storm-shaded-deps-2.1.0.jar:2.1.0] at 
> org.apache.storm.shade.org.apache.commons.io.FileUtils.readFileToByteArray(FileUtils.java:1851)
>  ~[storm-shaded-deps-2.1.0.jar:2.1.0] at 
> org.apache.storm.utils.ConfigUtils.readSupervisorStormConfGivenPath(ConfigUtils.java:303)
>  ~[storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.utils.ConfigUtils.readSupervisorStormConfImpl(ConfigUtils.java:464)
>  ~[storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.utils.ConfigUtils.readSupervisorStormConf(ConfigUtils.java:298)
>  ~[storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.localizer.AsyncLocalizer.getLocalResources(AsyncLocalizer.java:351)
>  ~[storm-server-2.1.0.jar:2.1.0] at 
> org.apache.storm.localizer.AsyncLocalizer.releaseSlotFor(AsyncLocalizer.java:452)
>  ~[storm-server-2.1.0.jar:2.1.0] at 
> org.apache.storm.daemon.supervisor.Slot.handleWaitingForBlobLocalization(Slot.java:440)
>  ~[storm-server-2.1.0.jar:2.1.0] at 
> org.apache.storm.daemon.supervisor.Slot.stateMachineStep(Slot.java:228) 
> ~[storm-server-2.1.0.jar:2.1.0] at 
> org.apache.storm.daemon.supervisor.Slot.run(Slot.java:931) 
> [storm-server-2.1.0.jar:2.1.0]2021-07-21 02:03:45.234 o.a.s.u.Utils SLOT_6701 
> [ERROR] Halting process: Error when processing an 
> eventjava.lang.RuntimeException: Halting process: Error when processing an 
> event at org.apache.storm.utils.Utils.exitProcess(Utils.java:512) 
> [storm-client-2.1.0.jar:2.1.0] at 
> org.apache.storm.daemon.supervisor.Slot.run(Slot.java:978) 
> [storm-server-2.1.0.jar:2.1.0]2021-07-21 02:03:45.235 
> o.a.s.d.s.BasicContainer SLOT_6700 [INFO] Created Worker ID 
> 68102ac7-a341-4d84-b1aa-db0f72934f992021-07-21 02:03:45.236 
> o.a.s.d.s.Container SLOT_6700 [INFO] Setting up 
> 8cbbfd6c-961b-482d-9175-cf9b79473808-172.26.137.86:68102ac7-a341-4d84-b1aa-db0f72934f992021-07-21
>  02:03:45.236 o.a.s.d.s.Container SLOT_6700 [INFO] GET worker-user for 
> 68102ac7-a341-4d84-b1aa-db0f72934f992021-07-21 02:03:45.240 
> o.a.s.d.s.Container SLOT_6700 [INFO] SET worker-user 
> 68102ac7-a341-4d84-b1aa-db0f72934f99 root2021-07-21 02:03:45.241 
> o.a.s.d.s.Container SLOT_6700 [INFO] Creating symlinks for worker-id: 
> 68102ac7-a341-4d84-b1aa-db0f72934f99 storm-id: PradarLogTopology-3-1626751922 
> for files(1): [resources]2021-07-21 02:03:45.241 o.a.s.d.s.BasicContainer 
> SLOT_6700 [INFO] Launching worker with assignment 
> LocalAssignment(topology_id:PradarLogTopology-3-1626751922, 
> executors:[ExecutorInfo(task_start:4, task_end:4), ExecutorInfo(task_start:1, 
> task_end:1)], resources:WorkerResources(mem_on_heap:256.0, mem_off_heap:0.0, 
> cpu:20.0, shared_mem_on_heap:0.0, shared_mem_off_heap:0.0, 
> resources:{offheap.memory.mb=0.0, onheap.memory.mb=256.0, 
> cpu.pcore.percent=20.0}, shared_resources:{}), owner:root) for this 
> supervisor 8cbbfd6c-961b-482d-9175-cf9b79473808-172.26.137.86 on port 6700 
> with id 68102ac7-a341-4d84-b1aa-db0f72934f992021-07-21 02:03:45.243 
> o.a.s.d.s.BasicContainer SLOT_6700 [INFO] Launching worker with command: 
> '/usr/local/java/bin/java' '-cp' 
> '/data/apache-storm-2.1.0/lib-worker/*:/data/apache-storm-2.1.0/extlib/*:/data/apache-storm-2.1.0/conf:/data/apache-storm-2.1.0/status/supervisor/stormdist/PradarLogTopology-3-1626751922/stormjar.jar'
>  '-Xmx64m' '-Dlogging.sensitivity=S3' '-Dlogfile.name=worker.log' 
> '-Dstorm.home=/data/apache-storm-2.1.0' 
> '-Dworkers.artifacts=/data/apache-storm-2.1.0/logs/workers-artifacts' 
> '-Dstorm.id=PradarLogTopology-3-1626751922' 
> '-Dworker.id=68102ac7-a341-4d84-b1aa-db0f72934f99' '-Dworker.port=6700' 
> '-Dstorm.log.dir=/data/apache-storm-2.1.0/logs' 
> '-DLog4jContextSelector=org.apache.logging.log4j.core.selector.BasicContextSelector'
>  '-Dstorm.local.dir=/data/apache-storm-2.1.0/status' 
> '-Dworker.memory_limit_mb=256' 
> '-Dlog4j.configurationFile=/data/apache-storm-2.1.0/log4j2/worker.xml' 
> 'org.apache.storm.LogWriter' '/usr/local/java/bin/java' '-server' 
> '-Dlogging.sensitivity=S3' '-Dlogfile.name=worker.log' 
> '-Dstorm.home=/data/apache-storm-2.1.0' 
> '-Dworkers.artifacts=/data/apache-storm-2.1.0/logs/workers-artifacts' 
> '-Dstorm.id=PradarLogTopology-3-1626751922' 
> '-Dworker.id=68102ac7-a341-4d84-b1aa-db0f72934f99' '-Dworker.port=6700' 
> '-Dstorm.log.dir=/data/apache-storm-2.1.0/logs' 
> '-DLog4jContextSelector=org.apache.logging.log4j.core.selector.BasicContextSelector'
>  '-Dstorm.local.dir=/data/apache-storm-2.1.0/status' 
> '-Dworker.memory_limit_mb=256' 
> '-Dlog4j.configurationFile=/data/apache-storm-2.1.0/log4j2/worker.xml' 
> '-Xmx256m' '-XX:+PrintGCDetails' '-Xloggc:artifacts/gc.log' 
> '-XX:+PrintGCDateStamps' '-XX:+PrintGCTimeStamps' '-XX:+UseGCLogFileRotation' 
> '-XX:NumberOfGCLogFiles=10' '-XX:GCLogFileSize=1M' 
> '-XX:+HeapDumpOnOutOfMemoryError' '-XX:HeapDumpPath=artifacts/heapdump' 
> '-Xms2g' '-Xmx2g' '-XX:MaxDirectMemorySize=512m' 
> '-XX:+HeapDumpOnOutOfMemoryError' '-XX:HeapDumpPath=java.hprof' 
> '-XX:MetaspaceSize=256m' '-XX:MaxMetaspaceSize=256m' 
> '-XX:-OmitStackTraceInFastThrow' 
> '-Djava.library.path=/data/apache-storm-2.1.0/status/supervisor/stormdist/PradarLogTopology-3-1626751922/resources/Linux-amd64:/data/apache-storm-2.1.0/status/supervisor/stormdist/PradarLogTopology-3-1626751922/resources:/usr/local/lib:/opt/local/lib:/usr/lib:/usr/lib64'
>  '-Dstorm.conf.file=' '-Dstorm.options=' 
> '-Djava.io.tmpdir=/data/apache-storm-2.1.0/status/workers/68102ac7-a341-4d84-b1aa-db0f72934f99/tmp'
>  '-cp' 
> '/data/apache-storm-2.1.0/lib-worker/*:/data/apache-storm-2.1.0/extlib/*:/data/apache-storm-2.1.0/conf:/data/apache-storm-2.1.0/status/supervisor/stormdist/PradarLogTopology-3-1626751922/stormjar.jar'
>  'org.apache.storm.daemon.worker.Worker' 'PradarLogTopology-3-1626751922' 
> '8cbbfd6c-961b-482d-9175-cf9b79473808-172.26.137.86' '6628' '6700' 
> '68102ac7-a341-4d84-b1aa-db0f72934f99'. 2021-07-21 02:03:45.243 o.a.s.u.Utils 
> Thread-5 [INFO] Halting after 1 seconds2021-07-21 02:03:45.244 
> o.a.s.d.s.Supervisor Thread-6 [INFO] Shutting down supervisor 
> 8cbbfd6c-961b-482d-9175-cf9b79473808-172.26.137.86
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to