hi,Prasanna,
maybe it is beause few free memory left in your system. I find this on
your log :
------------------ Original ------------------
From: "Prasanna";<[email protected]>;
Date: Mon, Jan 8, 2018 02:09 PM
To: "user"<[email protected]>;
Subject: If hbase region server is down,then will down or not?
Hi all,
I have small doubt regarding kylin ,If my hbase region server is down then
kylin service will down automatically or it will work. I am facing this issue
from longback.Sometimes in my cluster regionserver is getting down then my
kylin service is getting down.I am attaching the logs,can you please confirm
what is route cause.
Logs: kylin.log
----------------------
Thu Jan 04 14:27:44 GMT+08:00 2018,
RpcRetryingCaller{globalStartTime=1515047253026, pause=100, retries=1},
java.io.IOException: Call to ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed
on local exception: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call
id=475450, waitTime=11172, operationTimeout=10000 expired.
2018-01-04 11:58:03,998 INFO
[localhost-startStop-1-SendThread(ICCC-THBDPS03.EYEWAY.local:2181)]
zookeeper.ClientCnxn:1019 : Opening socket connection to server
ICCC-THBDPS03.EYEWAY.local/10.82.0.19:2181. Will not attempt to authenticate
using SASL (unknown error)
2018-01-04 11:58:03,999 INFO
[localhost-startStop-1-SendThread(ICCC-THBDPS03.EYEWAY.local:2181)]
zookeeper.ClientCnxn:864 : Socket connection established to
ICCC-THBDPS03.EYEWAY.local/10.82.0.19:2181, initiating session
2018-01-04 11:58:19,376 INFO
[localhost-startStop-1-SendThread(ICCC-THBDPS03.EYEWAY.local:2181)]
zookeeper.ClientCnxn:1140 : Client session timed out, have not heard from
server in 15377ms for sessionid 0x360bc7adc000011, closing socket connection
and attempting reconnect
2018-01-04 11:58:19,376 ERROR [pool-8-thread-1] dao.ExecutableDao:155 : error
get all Jobs:
org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed after
attempts=1, exceptions:
Thu Jan 04 14:28:03 GMT+08:00 2018,
RpcRetryingCaller{globalStartTime=1515047272716, pause=100, retries=1},
java.io.IOException: Call to ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed
on local exception: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call
id=475451, waitTime=11282, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:147)
at
org.apache.hadoop.hbase.client.ResultBoundedCompletionService$QueueingFuture.run(ResultBoundedCompletionService.java:65)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.io.IOException: Call to
ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed on local exception:
org.apache.hadoop.hbase.ipc.CallTimeoutException: Call id=475451,
waitTime=11282, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.wrapException(RpcClientImpl.java:1261)
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.call(RpcClientImpl.java:1229)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:213)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:287)
at
org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$BlockingStub.scan(ClientProtos.java:32651)
at
org.apache.hadoop.hbase.client.ScannerCallable.openScanner(ScannerCallable.java:372)
at
org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java:199)
at
org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java:62)
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithoutRetries(RpcRetryingCaller.java:200)
at
org.apache.hadoop.hbase.client.ScannerCallableWithReplicas$RetryingRPC.call(ScannerCallableWithReplicas.java:356)
at
org.apache.hadoop.hbase.client.ScannerCallableWithReplicas$RetryingRPC.call(ScannerCallableWithReplicas.java:330)
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:126)
... 4 more
Caused by: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call id=475451,
waitTime=11282, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.ipc.Call.checkAndSetTimeout(Call.java:70)
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.call(RpcClientImpl.java:1203)
... 14 more
2018-01-04 11:58:19,377 ERROR [pool-8-thread-1] execution.ExecutableManager:269
: error get All Job Ids
org.apache.kylin.job.exception.PersistentException:
org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed after
attempts=1, exceptions:
Thu Jan 04 14:28:03 GMT+08:00 2018,
RpcRetryingCaller{globalStartTime=1515047272716, pause=100, retries=1},
java.io.IOException: Call to ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed
on local exception: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call
id=475451, waitTime=11282, operationTimeout=5000 expired.
at
org.apache.kylin.job.dao.ExecutableDao.getJobIds(ExecutableDao.java:156)
at
org.apache.kylin.job.execution.ExecutableManager.getAllJobIds(ExecutableManager.java:267)
at
org.apache.kylin.job.impl.threadpool.DefaultScheduler$FetcherRunner.run(DefaultScheduler.java:85)
at
java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
at
java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Caused by: org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed
after attempts=1, exceptions:
Thu Jan 04 14:28:03 GMT+08:00 2018,
RpcRetryingCaller{globalStartTime=1515047272716, pause=100, retries=1},
java.io.IOException: Call to ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed
on local exception: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call
id=475451, waitTime=11282, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:147)
at
org.apache.hadoop.hbase.client.ResultBoundedCompletionService$QueueingFuture.run(ResultBoundedCompletionService.java:65)
... 3 more
Caused by: java.io.IOException: Call to
ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed on local exception:
org.apache.hadoop.hbase.ipc.CallTimeoutException: Call id=475451,
waitTime=11282, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.wrapException(RpcClientImpl.java:1261)
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.call(RpcClientImpl.java:1229)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:213)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:287)
at
org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$BlockingStub.scan(ClientProtos.java:32651)
at
org.apache.hadoop.hbase.client.ScannerCallable.openScanner(ScannerCallable.java:372)
at
org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java:199)
at
org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java:62)
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithoutRetries(RpcRetryingCaller.java:200)
at
org.apache.hadoop.hbase.client.ScannerCallableWithReplicas$RetryingRPC.call(ScannerCallableWithReplicas.java:356)
at
org.apache.hadoop.hbase.client.ScannerCallableWithReplicas$RetryingRPC.call(ScannerCallableWithReplicas.java:330)
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:126)
... 4 more
Caused by: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call id=475451,
waitTime=11282, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.ipc.Call.checkAndSetTimeout(Call.java:70)
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.call(RpcClientImpl.java:1203)
... 14 more
2018-01-04 11:58:19,377 WARN [pool-8-thread-1] threadpool.DefaultScheduler:127
: Job Fetcher caught a exception java.lang.RuntimeException:
org.apache.kylin.job.exception.PersistentException:
org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed after
attempts=1, exceptions:
Thu Jan 04 14:28:03 GMT+08:00 2018,
RpcRetryingCaller{globalStartTime=1515047272716, pause=100, retries=1},
java.io.IOException: Call to ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed
on local exception: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call
id=475451, waitTime=11282, operationTimeout=5000 expired.
2018-01-04 11:58:31,002 INFO [BadQueryDetector] service.BadQueryDetector:160 :
System free memory less than 100 MB. 0 queries running.
2018-01-04 11:58:39,176 INFO
[localhost-startStop-1-SendThread(ICCC-THBDPS01.EYEWAY.local:2181)]
zookeeper.ClientCnxn:1019 : Opening socket connection to server
ICCC-THBDPS01.EYEWAY.local/10.82.0.17:2181. Will not attempt to authenticate
using SASL (unknown error)
2018-01-04 11:58:39,488 ERROR [pool-8-thread-1] dao.ExecutableDao:155 : error
get all Jobs:
org.apache.hadoop.hbase.client.RetriesExhaustedException: Failed after
attempts=1, exceptions:
Thu Jan 04 14:28:39 GMT+08:00 2018,
RpcRetryingCaller{globalStartTime=1515047299715, pause=100, retries=1},
java.io.IOException: Call to ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed
on local exception: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call
id=475452, waitTime=8173, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:147)
at
org.apache.hadoop.hbase.client.ResultBoundedCompletionService$QueueingFuture.run(ResultBoundedCompletionService.java:65)
at
java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
at
java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.io.IOException: Call to
ICCC-THBDPS03.EYEWAY.local/10.82.0.19:16020 failed on local exception:
org.apache.hadoop.hbase.ipc.CallTimeoutException: Call id=475452,
waitTime=8173, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.wrapException(RpcClientImpl.java:1261)
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.call(RpcClientImpl.java:1229)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient.callBlockingMethod(AbstractRpcClient.java:213)
at
org.apache.hadoop.hbase.ipc.AbstractRpcClient$BlockingRpcChannelImplementation.callBlockingMethod(AbstractRpcClient.java:287)
at
org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$BlockingStub.scan(ClientProtos.java:32651)
at
org.apache.hadoop.hbase.client.ScannerCallable.openScanner(ScannerCallable.java:372)
at
org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java:199)
at
org.apache.hadoop.hbase.client.ScannerCallable.call(ScannerCallable.java:62)
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithoutRetries(RpcRetryingCaller.java:200)
at
org.apache.hadoop.hbase.client.ScannerCallableWithReplicas$RetryingRPC.call(ScannerCallableWithReplicas.java:356)
at
org.apache.hadoop.hbase.client.ScannerCallableWithReplicas$RetryingRPC.call(ScannerCallableWithReplicas.java:330)
at
org.apache.hadoop.hbase.client.RpcRetryingCaller.callWithRetries(RpcRetryingCaller.java:126)
... 4 more
Caused by: org.apache.hadoop.hbase.ipc.CallTimeoutException: Call id=475452,
waitTime=8173, operationTimeout=5000 expired.
at
org.apache.hadoop.hbase.ipc.Call.checkAndSetTimeout(Call.java:70)
at
org.apache.hadoop.hbase.ipc.RpcClientImpl.call(RpcClientImpl.java:1203)
... 14 more
2018-01-04 11:59:49,777 INFO
[localhost-startStop-1-SendThread(ICCC-THBDPSM01.EYEWAY.local:2181)]
zookeeper.ClientCnxn:1140 : Client session timed out, have not heard from
server in 40161ms for sessionid 0x360bc7adc000012, closing socket connection
and attempting reconnect
2018-01-04 11:59:31,385 INFO [BadQueryDetector] service.BadQueryDetector:160 :
System free memory less than 100 MB. 0 queries running.
2018-01-04 11:58:39,488 INFO
[localhost-startStop-1-SendThread(ICCC-THBDPS01.EYEWAY.local:2181)]
zookeeper.ClientCnxn:864 : Socket connection established to
ICCC-THBDPS01.EYEWAY.local/10.82.0.17:2181, initiating session