[
https://issues.apache.org/jira/browse/HBASE-1042?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
stack updated HBASE-1042:
-------------------------
Attachment: 1042.patch
Patch that catches Throwable in Leases and exits cleaning up existing leases.
Also, added to checkOOME test for the special OOME that comes up out of a
mapfile (the OOME is in the IOE message, not as the cause of the IOE).... see
below for example ... so we abort rather than as now, miss the OOME.
{code}
2008-12-02 12:58:51,274 INFO org.apache.hadoop.ipc.Server: IPC Server handler 0
on 60020, call openScanner([EMAIL PROTECTED], [EMAIL PROTECTED], [EMAIL
PROTECTED], 9223372036854775807, null) from XX.XX.XX.106:55041: error:
java.io.IOException: java.lang.OutOfMemoryError: Java heap space
java.io.IOException: java.lang.OutOfMemoryError: Java heap space
at org.apache.hadoop.io.MapFile$Reader.readIndex(MapFile.java:337)
at org.apache.hadoop.io.MapFile$Reader.midKey(MapFile.java:368)
at
org.apache.hadoop.hbase.io.HBaseMapFile$HBaseReader.<init>(HBaseMapFile.java:93)
at
org.apache.hadoop.hbase.io.BloomFilterMapFile$Reader.<init>(BloomFilterMapFile.java:66)
at
org.apache.hadoop.hbase.regionserver.HStoreFile.getReader(HStoreFile.java:443)
at
org.apache.hadoop.hbase.regionserver.StoreFileScanner.openReaders(StoreFileScanner.java:96)
at
org.apache.hadoop.hbase.regionserver.StoreFileScanner.<init>(StoreFileScanner.java:67)
at
org.apache.hadoop.hbase.regionserver.HStoreScanner.<init>(HStoreScanner.java:84)
at
org.apache.hadoop.hbase.regionserver.HStore.getScanner(HStore.java:2095)
at
org.apache.hadoop.hbase.regionserver.HRegion$HScanner.<init>(HRegion.java:1977)
at
org.apache.hadoop.hbase.regionserver.HRegion.getScanner(HRegion.java:1159)
at
org.apache.hadoop.hbase.regionserver.HRegionServer.openScanner(HRegionServer.java:1441)
at sun.reflect.GeneratedMethodAccessor18.invoke(Unknown Source)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.hbase.ipc.HbaseRPC$Server.call(HbaseRPC.java:634)
at org.apache.hadoop.ipc.Server$Handler.run(Server.java:888)
{code}
> OOME but we don't abort
> -----------------------
>
> Key: HBASE-1042
> URL: https://issues.apache.org/jira/browse/HBASE-1042
> Project: Hadoop HBase
> Issue Type: Bug
> Reporter: stack
> Fix For: 0.19.0
>
> Attachments: 1042.patch
>
>
> On streamy cluster saw case where graceful shutdown had been triggered rather
> than an abort on OOME. On graceful shutdown, we wait on leases to expire or
> be closed. Server wouldn't go down because it was waiting on leases to
> expire only an OOME in Leases had killed the thread so it wasn't ever going
> to expire anything. Node was stuck for four hours till someone noticed it.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.