[
https://issues.apache.org/jira/browse/HBASE-1072?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
stack updated HBASE-1072:
-------------------------
Attachment: 1072.patch
Add limit to how long we'll wait on dfs close down.
> Change Thread.join on exit to a timed Thread.join
> -------------------------------------------------
>
> Key: HBASE-1072
> URL: https://issues.apache.org/jira/browse/HBASE-1072
> Project: Hadoop HBase
> Issue Type: Bug
> Reporter: stack
> Fix For: 0.19.0
>
> Attachments: 1072.patch
>
>
> Here is a hungup regionserver stuck on the running of the dfs shutdown thread:
> {code}
> "Thread-11" prio=10 tid=0x00007fcd00a9b400 nid=0x751d waiting on condition
> [0x0000000042458000..0x0000000042458d00]
> java.lang.Thread.State: TIMED_WAITING (sleeping)
> at java.lang.Thread.sleep(Native Method)
> at org.apache.hadoop.ipc.Client.stop(Client.java:667)
> at org.apache.hadoop.ipc.RPC$ClientCache.stopClient(RPC.java:189)
> at org.apache.hadoop.ipc.RPC$ClientCache.access$400(RPC.java:138)
> at org.apache.hadoop.ipc.RPC$Invoker.close(RPC.java:229)
> - locked <0x00007fcd06c6b470> (a org.apache.hadoop.ipc.RPC$Invoker)
> at org.apache.hadoop.ipc.RPC$Invoker.access$500(RPC.java:196)
> at org.apache.hadoop.ipc.RPC.stopProxy(RPC.java:353)
> at org.apache.hadoop.hdfs.DFSClient.close(DFSClient.java:213)
> - locked <0x00007fcd06c6b3a0> (a org.apache.hadoop.hdfs.DFSClient)
> at
> org.apache.hadoop.hdfs.DistributedFileSystem.close(DistributedFileSystem.java:243)
> at
> org.apache.hadoop.fs.FileSystem$Cache.closeAll(FileSystem.java:1413)
> - locked <0x00007fcd06ab9b68> (a
> org.apache.hadoop.fs.FileSystem$Cache)
> at org.apache.hadoop.fs.FileSystem.closeAll(FileSystem.java:236)
> at
> org.apache.hadoop.fs.FileSystem$ClientFinalizer.run(FileSystem.java:221)
> - locked <0x00007fcd06aaeee0> (a
> org.apache.hadoop.fs.FileSystem$ClientFinalizer)
> {code}
> Above is just doing this:
> {code}
> // wait until all connections are closed
> while (!connections.isEmpty()) {
> try {
> Thread.sleep(100);
> } catch (InterruptedException e) {
> }
> }
> {code}
> Might never go down or wont' go down promptly.
> Should interrupt threads if join timesout and just continue with exit.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.