[ 
https://issues.apache.org/jira/browse/HBASE-854?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12627096#action_12627096
 ] 

Jim Kellerman commented on HBASE-854:
-------------------------------------

This is very strange. The failures only occur on the second test case in a 
Junit test in which we spin up multiple HBase and Hadoop clusters. But it 
doesn't always fail - that's the baffling part. The following exception causes 
the region server to abort, and because the client application cannot connect, 
the test times out. 

{code}
    [junit] 2008-08-26 21:24:50,536 FATAL [RegionServer:0] 
regionserver.HRegionServer(424): Unhandled exception. Aborting...
    [junit] java.io.IOException: Call failed on local exception
    [junit]     at org.apache.hadoop.ipc.Client.call(Client.java:718)
    [junit]     at 
org.apache.hadoop.hbase.ipc.HbaseRPC$Invoker.invoke(HbaseRPC.java:230)
    [junit]     at $Proxy8.getProtocolVersion(Unknown Source)
    [junit]     at 
org.apache.hadoop.hbase.ipc.HbaseRPC.getProxy(HbaseRPC.java:340)
    [junit]     at 
org.apache.hadoop.hbase.ipc.HbaseRPC.getProxy(HbaseRPC.java:327)
    [junit]     at 
org.apache.hadoop.hbase.ipc.HbaseRPC.getProxy(HbaseRPC.java:364)
    [junit]     at 
org.apache.hadoop.hbase.ipc.HbaseRPC.waitForProxy(HbaseRPC.java:302)
    [junit]     at 
org.apache.hadoop.hbase.regionserver.HRegionServer.reportForDuty(HRegionServer.java:706)
    [junit]     at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:291)
    [junit]     at java.lang.Thread.run(Thread.java:619)
    [junit] Caused by: java.io.EOFException
    [junit]     at java.io.DataInputStream.readInt(DataInputStream.java:375)
    [junit]     at 
org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:499)
    [junit]     at org.apache.hadoop.ipc.Client$Connection.run(Client.java:441)
    [junit] 2008-08-26 21:24:50,538 DEBUG [RegionServer:0] 
hbase.RegionHistorian(316): Offlined
    [junit] Exception in thread "RegionServer:0" java.lang.NullPointerException
    [junit]     at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:450)
    [junit]     at java.lang.Thread.run(Thread.java:619)
{code}

Now it could be related to the fact that the client cache is static in HbaseRPC:

{code}
  private static ClientCache CLIENTS=new ClientCache();
{code}

But why don't all tests that have multiple test cases fail all the time on the 
second cluster spin up?


> hbase-841 broke build on hudson?
> --------------------------------
>
>                 Key: HBASE-854
>                 URL: https://issues.apache.org/jira/browse/HBASE-854
>             Project: Hadoop HBase
>          Issue Type: Bug
>            Reporter: stack
>            Assignee: Jim Kellerman
>            Priority: Blocker
>             Fix For: 0.18.0
>
>         Attachments: hbase-854.patch
>
>
> Jim, you want to take a look at it?
> 841 changed interfaces.  Changed interfaces can make for odd issues like the 
> hangs exhibited up on hudson (stuff is failing for me on my laptop since 
> about the commit 841... timeouts.  I don't have same issue on branch).

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to