>
> 2014-08-14 21:35:16,740 WARN org.apache.hadoop.hbase.util.Sleeper: We
> slept
> 14912ms instead of 3000ms, this is likely due to a long garbage collecting
> pause and it's usually bad, see
> http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired


I would check your gc logs for long gc pauses.

On Fri, May 29, 2015 at 11:38 AM, rahul malviya <malviyarahul2...@gmail.com>
wrote:

> Hi All,
>
> In our cluster region server logs are filled with response too slow
> message. This is causing jobs to slow down. How can I debug what is the
> reason for this slowness.
>
> We have enabled short circuit reads and region server has 27GB RAM.
>
> Here is a trace when regionserver starts.
>
> Thu Aug 14 20:23:51 GMT 2014 Starting regionserver on nodex
> core file size          (blocks, -c) 0
> data seg size           (kbytes, -d) unlimited
> scheduling priority             (-e) 0
> file size               (blocks, -f) unlimited
> pending signals                 (-i) 966365
> max locked memory       (kbytes, -l) 64
> max memory size         (kbytes, -m) unlimited
> open files                      (-n) 32768
> pipe size            (512 bytes, -p) 8
> POSIX message queues     (bytes, -q) 819200
> real-time priority              (-r) 0
> stack size              (kbytes, -s) 10240
> cpu time               (seconds, -t) unlimited
> max user processes              (-u) 966365
> virtual memory          (kbytes, -v) unlimited
> file locks                      (-x) unlimited
> 2014-08-14 20:23:53,341 WARN org.apache.hadoop.conf.Configuration:
> fs.default.name is deprecated. Instead, use fs.defaultFS
> 2014-08-14 20:23:53,342 WARN org.apache.hadoop.conf.Configuration:
> mapred.task.id is deprecated. Instead, use mapreduce.task.attempt.id
> 2014-08-14 20:23:53,884 WARN org.apache.hadoop.conf.Configuration:
> slave.host.name is deprecated. Instead, use
> mapreduce.tasktracker.host.name
> 2014-08-14 20:24:03,999 WARN org.apache.hadoop.conf.Configuration:
> hadoop.native.lib is deprecated. Instead, use io.native.lib.available
> 2014-08-14 20:26:47,605 ERROR
> org.apache.hadoop.hbase.regionserver.metrics.SchemaMetrics: Inconsistent
> configuration. Previous configuration for using table name in metrics:
> true, new configuration: false
> 2014-08-14 20:28:23,491 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":18725,"call":"next(-8041903839443097981, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.248:58716
>
> ","starttimems":1408048084720,"queuetimems":0,"class":"HRegionServer","responsesize":5031595,"method":"next"}
> 2014-08-14 21:35:16,740 WARN org.apache.hadoop.hbase.util.Sleeper: We slept
> 14912ms instead of 3000ms, this is likely due to a long garbage collecting
> pause and it's usually bad, see
> http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired
> 2014-08-14 21:42:28,477 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":16968,"call":"next(5487686201525374976, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.249:36657
>
> ","starttimems":1408052531504,"queuetimems":0,"class":"HRegionServer","responsesize":1959532,"method":"next"}
> 2014-08-14 21:42:56,923 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":10591,"call":"next(5487686201525374976, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.249:40818
>
> ","starttimems":1408052566327,"queuetimems":1,"class":"HRegionServer","responsesize":2987578,"method":"next"}
> 2014-08-14 21:44:24,372 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":10656,"call":"next(5487686201525374976, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.249:41993
>
> ","starttimems":1408052653710,"queuetimems":1,"class":"HRegionServer","responsesize":3039779,"method":"next"}
> 2014-08-14 21:45:50,598 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":12418,"call":"next(5487686201525374976, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.249:45197
>
> ","starttimems":1408052738174,"queuetimems":10,"class":"HRegionServer","responsesize":2476903,"method":"next"}
> 2014-08-14 21:46:15,187 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":23766,"call":"next(5487686201525374976, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.249:49425
>
> ","starttimems":1408052751414,"queuetimems":0,"class":"HRegionServer","responsesize":5681175,"method":"next"}
> 2014-08-14 21:47:09,041 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":12320,"call":"next(5487686201525374976, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.249:50269
>
> ","starttimems":1408052816698,"queuetimems":1,"class":"HRegionServer","responsesize":2986949,"method":"next"}
> 2014-08-14 21:49:23,833 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":11389,"call":"next(1227841280814011139, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.122:41976
>
> ","starttimems":1408052952415,"queuetimems":0,"class":"HRegionServer","responsesize":3160025,"method":"next"}
> 2014-08-14 21:49:23,869 WARN org.apache.hadoop.ipc.HBaseServer: Exception
> while changing ops : java.nio.channels.CancelledKeyException
> 2014-08-14 21:49:23,900 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":11428,"call":"next(9103372947568217267, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.41:35241
>
> ","starttimems":1408052952469,"queuetimems":0,"class":"HRegionServer","responsesize":1809158,"method":"next"}
> 2014-08-14 21:49:23,902 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":11415,"call":"next(-3120240140302998196, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.195:46046
>
> ","starttimems":1408052952468,"queuetimems":0,"class":"HRegionServer","responsesize":1826929,"method":"next"}
> 2014-08-14 21:49:24,050 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":11438,"call":"next(3799907609071248384, 10), rpc
> version=1, client version=29, methodsFingerPrint=-1368823753","client":"
> 17.170.176.154:42797
>
> ","starttimems":1408052952459,"queuetimems":0,"class":"HRegionServer","responsesize":2628568,"method":"next"}
> 2014-08-14 21:49:24,057 WARN org.apache.hadoop.ipc.HBaseServer:
> (responseTooSlow):
> {"processingtimems":11843,"call":"next(-1679362783893333095, 10), rpc
> version=1, client version=29,
> methodsFingerPrint=-1368823753","client":"17.170
>
> Thanks,
> Rahul
>

Reply via email to