date:20130624

[jira] [Updated] (HBASE-8667) Master and Regionserver not able to communicate if both bound to different network interfaces on the same machine.

2013-06-24 Thread rajeshbabu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

rajeshbabu updated HBASE-8667:
--

Attachment: HBASE-8667_trunk_v6.patch

[~stack]
bq. Seems like a hadoop1 incompatibiity?
Sorry for this. I have built tar ball with default hadoop profile 1.1.2, so 
didnt observe this. In present patch directly binding address to client 
socket(No change in Netutils.connect),so there wont be compatibility issue. I 
have built with 1.0.4 as well, its working fine.
Thanks Stack.

 Master and Regionserver not able to communicate if both bound to different 
 network interfaces on the same machine.
 --

 Key: HBASE-8667
 URL: https://issues.apache.org/jira/browse/HBASE-8667
 Project: HBase
  Issue Type: Bug
  Components: IPC/RPC
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.95.2, 0.94.9

 Attachments: HBASE-8667_trunk.patch, HBASE-8667_Trunk.patch, 
 HBASE-8667_Trunk-V2.patch, HBASE-8667_trunk_v4.patch, 
 HBASE-8667_trunk_v5.patch, HBASE-8667_trunk_v6.patch


 While testing HBASE-8640 fix found that master and regionserver running on 
 different interfaces are not communicating properly.
 I have two interfaces 1) lo 2) eth0 in my machine and default hostname 
 interface is lo.
 I have configured master ipc address to ip of eth0 interface.
 Started master and regionserver on the same machine.
 1) master rpc server bound to eth0 and RS rpc server bound to lo
 2) Since rpc client is not binding to any ip address, when RS is reporting RS 
 startup its getting registered with eth0 ip address(but actually it should 
 register localhost)
 Here are RS logs:
 {code}
 2013-05-31 06:05:28,608 WARN  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: reportForDuty failed; 
 sleeping and then retrying.
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Attempting connect to 
 Master server at 192.168.0.100,6,1369960497008
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Telling master at 
 192.168.0.100,6,1369960497008 that we are up with port=60020, 
 startcode=1369960502544
 2013-05-31 06:05:31,618 DEBUG [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Config from master: 
 hbase.rootdir=hdfs://localhost:2851/hbase
 2013-05-31 06:05:31,618 DEBUG [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Config from master: 
 fs.default.name=hdfs://localhost:2851
 2013-05-31 06:05:31,618 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us a 
 different hostname to use; was=localhost, but now=192.168.0.100
 {code}
 Here are master logs:
 {code}
 2013-05-31 06:05:31,615 INFO  [IPC Server handler 9 on 6] 
 org.apache.hadoop.hbase.master.ServerManager: Registering 
 server=192.168.0.100,60020,1369960502544
 {code}
 Since master has wrong rpc server address of RS, META is not getting assigned.
 {code}
 2013-05-31 06:05:34,362 DEBUG [master-192.168.0.100,6,1369960497008] 
 org.apache.hadoop.hbase.master.AssignmentManager: No previous transition plan 
 was found (or we are ignoring an existing plan) for .META.,,1.1028785192 so 
 generated a random one; hri=.META.,,1.1028785192, src=, 
 dest=192.168.0.100,60020,1369960502544; 1 (online=1, available=1) available 
 servers, forceNewPlan=false
 -
 org.apache.hadoop.hbase.master.AssignmentManager: Failed assignment of 
 .META.,,1.1028785192 to 192.168.0.100,60020,1369960502544, trying to assign 
 elsewhere instead; try=1 of 10
 java.net.ConnectException: Connection refused
   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
   at 
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
   at 
 org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
   at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:511)
   at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:481)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupConnection(RpcClient.java:549)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupIOstreams(RpcClient.java:813)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.getConnection(RpcClient.java:1422)
   at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1315)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1532)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1587)
   at

[jira] [Commented] (HBASE-8667) Master and Regionserver not able to communicate if both bound to different network interfaces on the same machine.

2013-06-24 Thread rajeshbabu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691721#comment-13691721
 ] 

rajeshbabu commented on HBASE-8667:
---

Thanks [~viralbajaria] for testing the patch.
[~anoop.hbase] 
bq. Seems NetUtils#connect(Socket socket, SocketAddress endpoint, SocketAddress 
localAddr, int timeout) not available with hadoop1.
Yes Anoop.Its not present in hadoop 1.0.4, In latest patch avoided this.

 Master and Regionserver not able to communicate if both bound to different 
 network interfaces on the same machine.
 --

 Key: HBASE-8667
 URL: https://issues.apache.org/jira/browse/HBASE-8667
 Project: HBase
  Issue Type: Bug
  Components: IPC/RPC
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.95.2, 0.94.9

 Attachments: HBASE-8667_trunk.patch, HBASE-8667_Trunk.patch, 
 HBASE-8667_Trunk-V2.patch, HBASE-8667_trunk_v4.patch, 
 HBASE-8667_trunk_v5.patch, HBASE-8667_trunk_v6.patch


 While testing HBASE-8640 fix found that master and regionserver running on 
 different interfaces are not communicating properly.
 I have two interfaces 1) lo 2) eth0 in my machine and default hostname 
 interface is lo.
 I have configured master ipc address to ip of eth0 interface.
 Started master and regionserver on the same machine.
 1) master rpc server bound to eth0 and RS rpc server bound to lo
 2) Since rpc client is not binding to any ip address, when RS is reporting RS 
 startup its getting registered with eth0 ip address(but actually it should 
 register localhost)
 Here are RS logs:
 {code}
 2013-05-31 06:05:28,608 WARN  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: reportForDuty failed; 
 sleeping and then retrying.
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Attempting connect to 
 Master server at 192.168.0.100,6,1369960497008
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Telling master at 
 192.168.0.100,6,1369960497008 that we are up with port=60020, 
 startcode=1369960502544
 2013-05-31 06:05:31,618 DEBUG [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Config from master: 
 hbase.rootdir=hdfs://localhost:2851/hbase
 2013-05-31 06:05:31,618 DEBUG [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Config from master: 
 fs.default.name=hdfs://localhost:2851
 2013-05-31 06:05:31,618 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us a 
 different hostname to use; was=localhost, but now=192.168.0.100
 {code}
 Here are master logs:
 {code}
 2013-05-31 06:05:31,615 INFO  [IPC Server handler 9 on 6] 
 org.apache.hadoop.hbase.master.ServerManager: Registering 
 server=192.168.0.100,60020,1369960502544
 {code}
 Since master has wrong rpc server address of RS, META is not getting assigned.
 {code}
 2013-05-31 06:05:34,362 DEBUG [master-192.168.0.100,6,1369960497008] 
 org.apache.hadoop.hbase.master.AssignmentManager: No previous transition plan 
 was found (or we are ignoring an existing plan) for .META.,,1.1028785192 so 
 generated a random one; hri=.META.,,1.1028785192, src=, 
 dest=192.168.0.100,60020,1369960502544; 1 (online=1, available=1) available 
 servers, forceNewPlan=false
 -
 org.apache.hadoop.hbase.master.AssignmentManager: Failed assignment of 
 .META.,,1.1028785192 to 192.168.0.100,60020,1369960502544, trying to assign 
 elsewhere instead; try=1 of 10
 java.net.ConnectException: Connection refused
   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
   at 
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
   at 
 org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
   at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:511)
   at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:481)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupConnection(RpcClient.java:549)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupIOstreams(RpcClient.java:813)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.getConnection(RpcClient.java:1422)
   at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1315)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1532)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1587)
   at 
 org.apache.hadoop.hbase.protobuf.generated.AdminProtos$AdminService$BlockingStub.openRegion(AdminProtos.java:15039)
   at

[jira] [Commented] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691740#comment-13691740
]

Hadoop QA commented on HBASE-8783:
--

{color:green}+1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12589372/HBASE-8783-v1.patch
against trunk revision .

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 6 new
or modified tests.

{color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop
1.0 profile.

{color:green}+1 hadoop2.0{color}. The patch compiles against the hadoop
2.0 profile.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100

{color:green}+1 site{color}. The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}. The patch passed unit tests in .

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/6114//console

This message is automatically generated.

RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong
server name
-

Key: HBASE-8783
URL: https://issues.apache.org/jira/browse/HBASE-8783
Project: HBase
Issue Type: Bug
Components: snapshots
Affects Versions: 0.94.8, 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
Fix For: 0.95.2, 0.94.9

Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-v0.patch,
HBASE-8783-v1.patch

The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be
initialized with the wrong memberName.
{code}
2013-06-21 05:03:41,732 DEBUG
org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot
Manager
...
2013-06-21 05:03:41,875 INFO
org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname
to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
{code}
The Region Server Name is used as memberName, but since the snapshot manger
is initialized before the RS receives the server name used by the master, the
zkprocedure will use the wrong name (0.0.0.0).
This will case the snapshot to fail with a TimeoutException since the master
will not receive the expected RS
{code}
Master:
ZKProcedureCoordinatorRpcs: Watching for acquire
node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
RS:
ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired
barrier for procedure (foo23) in zk
...
org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed!
Source:Timeout caused Foreign Exception Start:1371798732141,
End:1371798792141, diff:6, max:6 ms
{code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Matteo Bertozzi (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-8783:
---

Attachment: HBASE-8783-0.94-v1.patch

v1 fixes the javadoc warning

 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong 
 server name
 -

 Key: HBASE-8783
 URL: https://issues.apache.org/jira/browse/HBASE-8783
 Project: HBase
  Issue Type: Bug
  Components: snapshots
Affects Versions: 0.94.8, 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.95.2, 0.94.9

 Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch, 
 HBASE-8783-v0.patch, HBASE-8783-v1.patch


 The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be 
 initialized with the wrong memberName.
 {code}
 2013-06-21 05:03:41,732 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot 
 Manager
 ...
 2013-06-21 05:03:41,875 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname 
 to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
 {code}
 The Region Server Name is used as memberName, but since the snapshot manger 
 is initialized before the RS receives the server name used by the master, the 
 zkprocedure will use the wrong name (0.0.0.0). 
 This will case the snapshot to fail with a TimeoutException since the master 
 will not receive the expected RS
 {code}
 Master:
 ZKProcedureCoordinatorRpcs: Watching for acquire 
 node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
 RS:
 ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired 
 barrier for procedure (foo23) in zk
 ...
 org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! 
 Source:Timeout caused Foreign Exception Start:1371798732141, 
 End:1371798792141, diff:6, max:6 ms
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691760#comment-13691760
]

Hadoop QA commented on HBASE-8783:
--

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment

http://issues.apache.org/jira/secure/attachment/12589379/HBASE-8783-0.94-v1.patch
against trunk revision .

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 6 new
or modified tests.

{color:red}-1 patch{color}. The patch command could not apply the patch.

Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/6116//console

This message is automatically generated.

RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong
server name
-

Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch,
HBASE-8783-v0.patch, HBASE-8783-v1.patch

[jira] [Commented] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Lars Hofhansl (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691772#comment-13691772
 ] 

Lars Hofhansl commented on HBASE-8783:
--

+1. Can we commit today, so that I can roll an RC?

 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong 
 server name
 -

 Key: HBASE-8783
 URL: https://issues.apache.org/jira/browse/HBASE-8783
 Project: HBase
  Issue Type: Bug
  Components: snapshots
Affects Versions: 0.94.8, 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.95.2, 0.94.9

 Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch, 
 HBASE-8783-v0.patch, HBASE-8783-v1.patch


 The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be 
 initialized with the wrong memberName.
 {code}
 2013-06-21 05:03:41,732 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot 
 Manager
 ...
 2013-06-21 05:03:41,875 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname 
 to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
 {code}
 The Region Server Name is used as memberName, but since the snapshot manger 
 is initialized before the RS receives the server name used by the master, the 
 zkprocedure will use the wrong name (0.0.0.0). 
 This will case the snapshot to fail with a TimeoutException since the master 
 will not receive the expected RS
 {code}
 Master:
 ZKProcedureCoordinatorRpcs: Watching for acquire 
 node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
 RS:
 ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired 
 barrier for procedure (foo23) in zk
 ...
 org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! 
 Source:Timeout caused Foreign Exception Start:1371798732141, 
 End:1371798792141, diff:6, max:6 ms
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-5083) Backup HMaster should have http infoport open with link to the active master

2013-06-24 Thread Lars Hofhansl (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-5083?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-5083:
-

Attachment: HBASE-5083_trunk.patch

And again

 Backup HMaster should have http infoport open with link to the active master
 

 Key: HBASE-5083
 URL: https://issues.apache.org/jira/browse/HBASE-5083
 Project: HBase
  Issue Type: Improvement
  Components: master
Affects Versions: 0.92.0
Reporter: Jonathan Hsieh
Assignee: Cody Marcel
 Fix For: 0.94.9

 Attachments: backup_master.png, HBASE-5083.patch, HBASE-5083.patch, 
 HBASE-5083.patch, HBASE-5083.patch, HBASE-5083.patch, HBASE-5083_trunk.patch, 
 HBASE-5083_trunk.patch, HBASE-5083_trunk.patch, master.png, 
 Trunk_Backup_Master.png, Trunk_Master.png


 Without ssh'ing and jps/ps'ing, it is difficult to see if a backup hmaster is 
 up.  It seems like it would be good for a backup hmaster to have a basic web 
 page up on the info port so that users could see that it is up.  Also it 
 should probably either provide a link to the active master or automatically 
 forward to the active master.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-8667) Master and Regionserver not able to communicate if both bound to different network interfaces on the same machine.

2013-06-24 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691777#comment-13691777
 ] 

Hadoop QA commented on HBASE-8667:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12589373/HBASE-8667_trunk_v6.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6115//console

This message is automatically generated.

 Master and Regionserver not able to communicate if both bound to different 
 network interfaces on the same machine.
 --

 Key: HBASE-8667
 URL: https://issues.apache.org/jira/browse/HBASE-8667
 Project: HBase
  Issue Type: Bug
  Components: IPC/RPC
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.95.2, 0.94.9

 Attachments: HBASE-8667_trunk.patch, HBASE-8667_Trunk.patch, 
 HBASE-8667_Trunk-V2.patch, HBASE-8667_trunk_v4.patch, 
 HBASE-8667_trunk_v5.patch, HBASE-8667_trunk_v6.patch


 While testing HBASE-8640 fix found that master and regionserver running on 
 different interfaces are not communicating properly.
 I have two interfaces 1) lo 2) eth0 in my machine and default hostname 
 interface is lo.
 I have configured master ipc address to ip of eth0 interface.
 Started master and regionserver on the same machine.
 1) master rpc server bound to eth0 and RS rpc server bound to lo
 2) Since rpc client is not binding to any ip address, when RS is reporting RS 
 startup its getting registered with eth0 ip address(but actually it should 
 register localhost)
 Here are RS logs:
 {code}
 2013-05-31 06:05:28,608 WARN  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: reportForDuty failed; 
 sleeping and then retrying.
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Attempting connect to 
 Master server at 192.168.0.100,6,1369960497008
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Telling master at 
 192.168.0.100,6,1369960497008 that we are up with port=60020,

[jira] [Updated] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Matteo Bertozzi (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-8783:
---

   Resolution: Fixed
Fix Version/s: 0.98.0
   Status: Resolved  (was: Patch Available)

committed to 0.94, 0.95 and trunk. thanks for the reviews

 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong 
 server name
 -

 Key: HBASE-8783
 URL: https://issues.apache.org/jira/browse/HBASE-8783
 Project: HBase
  Issue Type: Bug
  Components: snapshots
Affects Versions: 0.94.8, 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.98.0, 0.95.2, 0.94.9

 Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch, 
 HBASE-8783-v0.patch, HBASE-8783-v1.patch


 The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be 
 initialized with the wrong memberName.
 {code}
 2013-06-21 05:03:41,732 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot 
 Manager
 ...
 2013-06-21 05:03:41,875 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname 
 to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
 {code}
 The Region Server Name is used as memberName, but since the snapshot manger 
 is initialized before the RS receives the server name used by the master, the 
 zkprocedure will use the wrong name (0.0.0.0). 
 This will case the snapshot to fail with a TimeoutException since the master 
 will not receive the expected RS
 {code}
 Master:
 ZKProcedureCoordinatorRpcs: Watching for acquire 
 node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
 RS:
 ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired 
 barrier for procedure (foo23) in zk
 ...
 org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! 
 Source:Timeout caused Foreign Exception Start:1371798732141, 
 End:1371798792141, diff:6, max:6 ms
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8776) port HBASE-8723 to 0.94

2013-06-24 Thread Lars Hofhansl (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-8776:
-

Fix Version/s: (was: 0.94.9)
   0.94.10

Pushing to 0.94.10, since we (or at I) are still discussing.

 port HBASE-8723 to 0.94
 ---

 Key: HBASE-8776
 URL: https://issues.apache.org/jira/browse/HBASE-8776
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.8
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Fix For: 0.94.10

 Attachments: HBASE-8776-v0.patch




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8667) Master and Regionserver not able to communicate if both bound to different network interfaces on the same machine.

2013-06-24 Thread Lars Hofhansl (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8667?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-8667:
-

Fix Version/s: (was: 0.94.9)
   0.94.10

Pushing to 0.94.10

 Master and Regionserver not able to communicate if both bound to different 
 network interfaces on the same machine.
 --

 Key: HBASE-8667
 URL: https://issues.apache.org/jira/browse/HBASE-8667
 Project: HBase
  Issue Type: Bug
  Components: IPC/RPC
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.95.2, 0.94.10

 Attachments: HBASE-8667_trunk.patch, HBASE-8667_Trunk.patch, 
 HBASE-8667_Trunk-V2.patch, HBASE-8667_trunk_v4.patch, 
 HBASE-8667_trunk_v5.patch, HBASE-8667_trunk_v6.patch


 While testing HBASE-8640 fix found that master and regionserver running on 
 different interfaces are not communicating properly.
 I have two interfaces 1) lo 2) eth0 in my machine and default hostname 
 interface is lo.
 I have configured master ipc address to ip of eth0 interface.
 Started master and regionserver on the same machine.
 1) master rpc server bound to eth0 and RS rpc server bound to lo
 2) Since rpc client is not binding to any ip address, when RS is reporting RS 
 startup its getting registered with eth0 ip address(but actually it should 
 register localhost)
 Here are RS logs:
 {code}
 2013-05-31 06:05:28,608 WARN  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: reportForDuty failed; 
 sleeping and then retrying.
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Attempting connect to 
 Master server at 192.168.0.100,6,1369960497008
 2013-05-31 06:05:31,609 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Telling master at 
 192.168.0.100,6,1369960497008 that we are up with port=60020, 
 startcode=1369960502544
 2013-05-31 06:05:31,618 DEBUG [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Config from master: 
 hbase.rootdir=hdfs://localhost:2851/hbase
 2013-05-31 06:05:31,618 DEBUG [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Config from master: 
 fs.default.name=hdfs://localhost:2851
 2013-05-31 06:05:31,618 INFO  [regionserver60020] 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us a 
 different hostname to use; was=localhost, but now=192.168.0.100
 {code}
 Here are master logs:
 {code}
 2013-05-31 06:05:31,615 INFO  [IPC Server handler 9 on 6] 
 org.apache.hadoop.hbase.master.ServerManager: Registering 
 server=192.168.0.100,60020,1369960502544
 {code}
 Since master has wrong rpc server address of RS, META is not getting assigned.
 {code}
 2013-05-31 06:05:34,362 DEBUG [master-192.168.0.100,6,1369960497008] 
 org.apache.hadoop.hbase.master.AssignmentManager: No previous transition plan 
 was found (or we are ignoring an existing plan) for .META.,,1.1028785192 so 
 generated a random one; hri=.META.,,1.1028785192, src=, 
 dest=192.168.0.100,60020,1369960502544; 1 (online=1, available=1) available 
 servers, forceNewPlan=false
 -
 org.apache.hadoop.hbase.master.AssignmentManager: Failed assignment of 
 .META.,,1.1028785192 to 192.168.0.100,60020,1369960502544, trying to assign 
 elsewhere instead; try=1 of 10
 java.net.ConnectException: Connection refused
   at sun.nio.ch.SocketChannelImpl.checkConnect(Native Method)
   at 
 sun.nio.ch.SocketChannelImpl.finishConnect(SocketChannelImpl.java:592)
   at 
 org.apache.hadoop.net.SocketIOWithTimeout.connect(SocketIOWithTimeout.java:206)
   at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:511)
   at org.apache.hadoop.net.NetUtils.connect(NetUtils.java:481)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupConnection(RpcClient.java:549)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$Connection.setupIOstreams(RpcClient.java:813)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.getConnection(RpcClient.java:1422)
   at org.apache.hadoop.hbase.ipc.RpcClient.call(RpcClient.java:1315)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient.callBlockingMethod(RpcClient.java:1532)
   at 
 org.apache.hadoop.hbase.ipc.RpcClient$BlockingRpcChannelImplementation.callBlockingMethod(RpcClient.java:1587)
   at 
 org.apache.hadoop.hbase.protobuf.generated.AdminProtos$AdminService$BlockingStub.openRegion(AdminProtos.java:15039)
   at 
 org.apache.hadoop.hbase.master.ServerManager.sendRegionOpen(ServerManager.java:627)
   at 
 org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1826)
   at

[jira] [Commented] (HBASE-8656) Rpc call may not be notified in SecureClient

2013-06-24 Thread Lars Hofhansl (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8656?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691785#comment-13691785
 ] 

Lars Hofhansl commented on HBASE-8656:
--

[~apurtell] Did you get a chance to test this? Patch looks good, in line with 
the non-secure client.
If not, I'll just push to 0.94.10.

 Rpc call may not be notified in SecureClient
 

 Key: HBASE-8656
 URL: https://issues.apache.org/jira/browse/HBASE-8656
 Project: HBase
  Issue Type: Bug
  Components: Client, IPC/RPC, security
Affects Versions: 0.94.7
Reporter: cuijianwei
Assignee: cuijianwei
 Fix For: 0.94.9

 Attachments: HBASE-8656-0.94-v1.txt


 In SecureClient.java, rpc responses will be processed by receiveResponse() 
 which looks like:
 {code}
 try {
 int id = in.readInt();// try to read an id
 if (LOG.isDebugEnabled())
   LOG.debug(getName() +  got value # + id);
 Call call = calls.remove(id);
 int state = in.readInt(); // read call status
 if (LOG.isDebugEnabled()) {
   LOG.debug(call #+id+ state is  + state);
 }
 if (state == Status.SUCCESS.state) {
   Writable value = ReflectionUtils.newInstance(valueClass, conf);
   value.readFields(in); // read value
   if (LOG.isDebugEnabled()) {
 LOG.debug(call #+id+, response is:\n+value.toString());
   }
   // it's possible that this call may have been cleaned up due to a 
 RPC
   // timeout, so check if it still exists before setting the value.
   if (call != null) {
 call.setValue(value);
   }
 } else if (state == Status.ERROR.state) {
   if (call != null) {
 call.setException(new 
 RemoteException(WritableUtils.readString(in), WritableUtils
 .readString(in)));
   }
 } else if (state == Status.FATAL.state) {
   // Close the connection
   markClosed(new RemoteException(WritableUtils.readString(in),
  WritableUtils.readString(in)));
 }
   } catch (IOException e) {
 if (e instanceof SocketTimeoutException  remoteId.rpcTimeout  0) {
   // Clean up open calls but don't treat this as a fatal condition,
   // since we expect certain responses to not make it by the specified
   // {@link ConnectionId#rpcTimeout}.
   closeException = e;
 } else {
   // Since the server did not respond within the default ping interval
   // time, treat this as a fatal condition and close this connection
   markClosed(e);
 }
   } finally {
 if (remoteId.rpcTimeout  0) {
   cleanupCalls(remoteId.rpcTimeout);
 }
   }
 }
 {code}
 In above code, in the try block, the call will be firstly removed from call 
 map by:
 {code}
 Call call = calls.remove(id);
 {code}
 There may be two cases leading the call couldn't be notified and the invoking 
 thread will wait forever. 
 Firstly, if the returned status is Status.FATAL.state by:
 {code}
 int state = in.readInt(); // read call status
 {code}
 The code will come into:
 {code}
 } else if (state == Status.FATAL.state) {
   // Close the connection
   markClosed(new RemoteException(WritableUtils.readString(in),
  WritableUtils.readString(in)));
 }
 {code}
 Here, the SecureConnection is marked as closed and all rpc calls in call map 
 of this connection will be notified to receive an exception. However, the 
 current rpc call has been removed from the call map, it won't be notified.
 Secondly, after the call has been removed by:
 {code}
 Call call = calls.remove(id);
 {code}
 If we encounter any exception before the 'try' block finished, the code will 
 come into 'catch' and 'finally' block, neither 'catch' block nor 'finally' 
 block will notify the rpc call because it has been removed from call map.
 Compared with receiveResponse() in HBaseClient.java, it may be better to get 
 the rpc call from call map and remove it at the end of the 'try' block.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8790) NullPointerException throwed when stopping regionserver

2013-06-24 Thread Xiong LIU (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiong LIU updated HBASE-8790:
-

Description: 
The Hbase cluster is a fresh start with one regionserver.
When we stop hbase, an unhandled NullPointerException is throwed in the 
regionserver.
The regionserver's log is as follows:

2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
Closing user regions
2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
Waiting on 1028785192
2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
ABORTING region server HOSTNAME_TEST,61020,1371781086817
: Unhandled: null
java.lang.NullPointerException
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
at java.lang.Thread.run(Thread.java:662)
2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
RegionServer abort: loaded coprocessors are: [org.apache
.hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
STOPPED: Unhandled: null
2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
server on 61020

It seems that after closing user regions, the rssStub is null.

update:
we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
hbase.client.ipc.pool.size to 10(possibly different value on your machine) in 
hbase-site.xml, the regionserver is continuously attempting connect to master.

  was:
The Hbase cluster is a fresh start with one regionserver.
When we stop hbase, an unhandled NullPointerException is throwed in the 
regionserver.
The regionserver's log is as follows:

2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
Closing user regions
2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
Waiting on 1028785192
2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
ABORTING region server HOSTNAME_TEST,61020,1371781086817
: Unhandled: null
java.lang.NullPointerException
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
at java.lang.Thread.run(Thread.java:662)
2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
RegionServer abort: loaded coprocessors are: [org.apache
.hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
STOPPED: Unhandled: null
2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
server on 61020

It seems that after closing user regions, the rssStub is null.


 NullPointerException throwed when stopping regionserver
 ---

 Key: HBASE-8790
 URL: https://issues.apache.org/jira/browse/HBASE-8790
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.95.1
 Environment: CentOS 5.9 x86_64, java version 1.6.0_45, CDH4.3
Reporter: Xiong LIU

 The Hbase cluster is a fresh start with one regionserver.
 When we stop hbase, an unhandled NullPointerException is throwed in the 
 regionserver.
 The regionserver's log is as follows:
 2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
 Closing user regions
 2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
 Waiting on 1028785192
 2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
 ABORTING region server HOSTNAME_TEST,61020,1371781086817
 : Unhandled: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
 at java.lang.Thread.run(Thread.java:662)
 2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
 RegionServer abort: loaded coprocessors are: [org.apache
 .hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
 2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
 STOPPED: Unhandled: null
 2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
 server on 61020
 It seems that after closing user regions, the rssStub is null.
 update:
 we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
 hbase.client.ipc.pool.size to 10(possibly different value on your machine) in 
 hbase-site.xml, the regionserver is continuously attempting connect to master.

--
This message is

[jira] [Commented] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691800#comment-13691800
 ] 

Hudson commented on HBASE-8783:
---

Integrated in HBase-0.94-security #177 (See 
[https://builds.apache.org/job/HBase-0.94-security/177/])
HBASE-8783 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with 
the wrong server name (Revision 1495945)

 Result = SUCCESS
mbertozzi : 
Files : 
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ProcedureMemberRpcs.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureCoordinatorRpcs.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureMemberRpcs.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureUtil.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/regionserver/snapshot/RegionServerSnapshotManager.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/procedure/TestZKProcedure.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/procedure/TestZKProcedureControllers.java


 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong 
 server name
 -

 Key: HBASE-8783
 URL: https://issues.apache.org/jira/browse/HBASE-8783
 Project: HBase
  Issue Type: Bug
  Components: snapshots
Affects Versions: 0.94.8, 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.98.0, 0.95.2, 0.94.9

 Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch, 
 HBASE-8783-v0.patch, HBASE-8783-v1.patch


 The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be 
 initialized with the wrong memberName.
 {code}
 2013-06-21 05:03:41,732 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot 
 Manager
 ...
 2013-06-21 05:03:41,875 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname 
 to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
 {code}
 The Region Server Name is used as memberName, but since the snapshot manger 
 is initialized before the RS receives the server name used by the master, the 
 zkprocedure will use the wrong name (0.0.0.0). 
 This will case the snapshot to fail with a TimeoutException since the master 
 will not receive the expected RS
 {code}
 Master:
 ZKProcedureCoordinatorRpcs: Watching for acquire 
 node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
 RS:
 ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired 
 barrier for procedure (foo23) in zk
 ...
 org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! 
 Source:Timeout caused Foreign Exception Start:1371798732141, 
 End:1371798792141, diff:6, max:6 ms
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8790) NullPointerException throwed when stopping regionserver

2013-06-24 Thread Xiong LIU (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xiong LIU updated HBASE-8790:
-

Description: 
The Hbase cluster is a fresh start with one regionserver.
When we stop hbase, an unhandled NullPointerException is throwed in the 
regionserver.
The regionserver's log is as follows:

2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
Closing user regions
2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
Waiting on 1028785192
2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
ABORTING region server HOSTNAME_TEST,61020,1371781086817
: Unhandled: null
java.lang.NullPointerException
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
at java.lang.Thread.run(Thread.java:662)
2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
RegionServer abort: loaded coprocessors are: [org.apache
.hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
STOPPED: Unhandled: null
2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
server on 61020

It seems that after closing user regions, the rssStub is null.

update:
we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
hbase.client.ipc.pool.size to 10(possibly other values) in hbase-site.xml, the 
regionserver is continuously attempting connect to master.

  was:
The Hbase cluster is a fresh start with one regionserver.
When we stop hbase, an unhandled NullPointerException is throwed in the 
regionserver.
The regionserver's log is as follows:

2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
Closing user regions
2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
Waiting on 1028785192
2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
ABORTING region server HOSTNAME_TEST,61020,1371781086817
: Unhandled: null
java.lang.NullPointerException
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
at java.lang.Thread.run(Thread.java:662)
2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
RegionServer abort: loaded coprocessors are: [org.apache
.hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
STOPPED: Unhandled: null
2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
server on 61020

It seems that after closing user regions, the rssStub is null.

update:
we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
hbase.client.ipc.pool.size to 10(possibly different value on your machine) in 
hbase-site.xml, the regionserver is continuously attempting connect to master.


 NullPointerException throwed when stopping regionserver
 ---

 Key: HBASE-8790
 URL: https://issues.apache.org/jira/browse/HBASE-8790
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.95.1
 Environment: CentOS 5.9 x86_64, java version 1.6.0_45, CDH4.3
Reporter: Xiong LIU

 The Hbase cluster is a fresh start with one regionserver.
 When we stop hbase, an unhandled NullPointerException is throwed in the 
 regionserver.
 The regionserver's log is as follows:
 2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
 Closing user regions
 2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
 Waiting on 1028785192
 2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
 ABORTING region server HOSTNAME_TEST,61020,1371781086817
 : Unhandled: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
 at java.lang.Thread.run(Thread.java:662)
 2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
 RegionServer abort: loaded coprocessors are: [org.apache
 .hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
 2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
 STOPPED: Unhandled: null
 2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
 server on 61020
 It seems that after closing user regions, the rssStub is null.
 update:
 we found that if setting

[jira] [Updated] (HBASE-8790) NullPointerException throwed when stopping regionserver

2013-06-24 Thread Liang Xie (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie updated HBASE-8790:
-

Attachment: HBase-8790.txt

Attached is a trivial fix.
rssStub could be null while we hit ServiceException in tryRegionServerReport, 
then get it from createRegionServerStatusStub(), per Javadoc :

@return master + port, or null if server has been stopped

so we can ensure rssStub == null only happened while current server was 
stopped. and a simple fix should be just fine.

 NullPointerException throwed when stopping regionserver
 ---

 Key: HBASE-8790
 URL: https://issues.apache.org/jira/browse/HBASE-8790
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.95.1
 Environment: CentOS 5.9 x86_64, java version 1.6.0_45, CDH4.3
Reporter: Xiong LIU
 Attachments: HBase-8790.txt


 The Hbase cluster is a fresh start with one regionserver.
 When we stop hbase, an unhandled NullPointerException is throwed in the 
 regionserver.
 The regionserver's log is as follows:
 2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
 Closing user regions
 2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
 Waiting on 1028785192
 2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
 ABORTING region server HOSTNAME_TEST,61020,1371781086817
 : Unhandled: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
 at java.lang.Thread.run(Thread.java:662)
 2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
 RegionServer abort: loaded coprocessors are: [org.apache
 .hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
 2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
 STOPPED: Unhandled: null
 2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
 server on 61020
 It seems that after closing user regions, the rssStub is null.
 update:
 we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
 hbase.client.ipc.pool.size to 10(possibly other values) in hbase-site.xml, 
 the regionserver is continuously attempting connect to master.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8790) NullPointerException throwed when stopping regionserver

2013-06-24 Thread Liang Xie (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie updated HBASE-8790:
-

Assignee: Liang Xie
  Status: Patch Available  (was: Open)

 NullPointerException throwed when stopping regionserver
 ---

 Key: HBASE-8790
 URL: https://issues.apache.org/jira/browse/HBASE-8790
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.95.1
 Environment: CentOS 5.9 x86_64, java version 1.6.0_45, CDH4.3
Reporter: Xiong LIU
Assignee: Liang Xie
 Attachments: HBase-8790.txt


 The Hbase cluster is a fresh start with one regionserver.
 When we stop hbase, an unhandled NullPointerException is throwed in the 
 regionserver.
 The regionserver's log is as follows:
 2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
 Closing user regions
 2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
 Waiting on 1028785192
 2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
 ABORTING region server HOSTNAME_TEST,61020,1371781086817
 : Unhandled: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
 at java.lang.Thread.run(Thread.java:662)
 2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
 RegionServer abort: loaded coprocessors are: [org.apache
 .hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
 2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
 STOPPED: Unhandled: null
 2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
 server on 61020
 It seems that after closing user regions, the rssStub is null.
 update:
 we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
 hbase.client.ipc.pool.size to 10(possibly other values) in hbase-site.xml, 
 the regionserver is continuously attempting connect to master.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-8783) RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong server name

2013-06-24 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8783?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691808#comment-13691808
 ] 

Hudson commented on HBASE-8783:
---

Integrated in HBase-0.94 #1022 (See 
[https://builds.apache.org/job/HBase-0.94/1022/])
HBASE-8783 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with 
the wrong server name (Revision 1495945)

 Result = SUCCESS
mbertozzi : 
Files : 
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ProcedureMemberRpcs.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureCoordinatorRpcs.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureMemberRpcs.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/procedure/ZKProcedureUtil.java
* 
/hbase/branches/0.94/src/main/java/org/apache/hadoop/hbase/regionserver/snapshot/RegionServerSnapshotManager.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/procedure/TestZKProcedure.java
* 
/hbase/branches/0.94/src/test/java/org/apache/hadoop/hbase/procedure/TestZKProcedureControllers.java


 RSSnapshotManager.ZKProcedureMemberRpcs may be initialized with the wrong 
 server name
 -

 Key: HBASE-8783
 URL: https://issues.apache.org/jira/browse/HBASE-8783
 Project: HBase
  Issue Type: Bug
  Components: snapshots
Affects Versions: 0.94.8, 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.98.0, 0.95.2, 0.94.9

 Attachments: HBASE-8783-0.94-v0.patch, HBASE-8783-0.94-v1.patch, 
 HBASE-8783-v0.patch, HBASE-8783-v1.patch


 The ZKProcedureMemberRpcs of the RegionServerSnapshotManager may be 
 initialized with the wrong memberName.
 {code}
 2013-06-21 05:03:41,732 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Initialize Snapshot 
 Manager
 ...
 2013-06-21 05:03:41,875 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Master passed us hostname 
 to use. Was=0.0.0.0, Now=srv-5.test.cloudera.com
 {code}
 The Region Server Name is used as memberName, but since the snapshot manger 
 is initialized before the RS receives the server name used by the master, the 
 zkprocedure will use the wrong name (0.0.0.0). 
 This will case the snapshot to fail with a TimeoutException since the master 
 will not receive the expected RS
 {code}
 Master:
 ZKProcedureCoordinatorRpcs: Watching for acquire 
 node:/hbase/online-snapshot/acquired/foo23/srv-5.test.cloudera.com,60020,1371813451915
 RS:
 ZKProcedureMemberRpcs: Member: '0.0.0.0,60020,1371814996779' joining acquired 
 barrier for procedure (foo23) in zk
 ...
 org.apache.hadoop.hbase.errorhandling.TimeoutException: Timeout elapsed! 
 Source:Timeout caused Foreign Exception Start:1371798732141, 
 End:1371798792141, diff:6, max:6 ms
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-8790) NullPointerException thrown when stopping regionserver

2013-06-24 Thread Ted Yu (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-8790?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-8790:
--

Summary: NullPointerException thrown when stopping regionserver  (was: 
NullPointerException throwed when stopping regionserver)

 NullPointerException thrown when stopping regionserver
 --

 Key: HBASE-8790
 URL: https://issues.apache.org/jira/browse/HBASE-8790
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.95.1
 Environment: CentOS 5.9 x86_64, java version 1.6.0_45, CDH4.3
Reporter: Xiong LIU
Assignee: Liang Xie
 Attachments: HBase-8790.txt


 The Hbase cluster is a fresh start with one regionserver.
 When we stop hbase, an unhandled NullPointerException is throwed in the 
 regionserver.
 The regionserver's log is as follows:
 2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
 Closing user regions
 2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
 Waiting on 1028785192
 2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
 ABORTING region server HOSTNAME_TEST,61020,1371781086817
 : Unhandled: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
 at java.lang.Thread.run(Thread.java:662)
 2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
 RegionServer abort: loaded coprocessors are: [org.apache
 .hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
 2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
 STOPPED: Unhandled: null
 2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
 server on 61020
 It seems that after closing user regions, the rssStub is null.
 update:
 we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
 hbase.client.ipc.pool.size to 10(possibly other values) in hbase-site.xml, 
 the regionserver is continuously attempting connect to master.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-8790) NullPointerException thrown when stopping regionserver

2013-06-24 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-8790?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691828#comment-13691828
 ] 

Ted Yu commented on HBASE-8790:
---

Looks good to me.

 NullPointerException thrown when stopping regionserver
 --

 Key: HBASE-8790
 URL: https://issues.apache.org/jira/browse/HBASE-8790
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.95.1
 Environment: CentOS 5.9 x86_64, java version 1.6.0_45, CDH4.3
Reporter: Xiong LIU
Assignee: Liang Xie
 Attachments: HBase-8790.txt


 The Hbase cluster is a fresh start with one regionserver.
 When we stop hbase, an unhandled NullPointerException is throwed in the 
 regionserver.
 The regionserver's log is as follows:
 2013-06-21 10:21:11,284 INFO  [regionserver61020] regionserver.HRegionServer: 
 Closing user regions
 2013-06-21 10:21:14,288 DEBUG [regionserver61020] regionserver.HRegionServer: 
 Waiting on 1028785192
 2013-06-21 10:21:14,290 FATAL [regionserver61020] regionserver.HRegionServer: 
 ABORTING region server HOSTNAME_TEST,61020,1371781086817
 : Unhandled: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:988)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:832)
 at java.lang.Thread.run(Thread.java:662)
 2013-06-21 10:21:14,292 FATAL [regionserver61020] regionserver.HRegionServer: 
 RegionServer abort: loaded coprocessors are: [org.apache
 .hadoop.hbase.coprocessor.MultiRowMutationEndpoint]
 2013-06-21 10:21:14,293 INFO  [regionserver61020] regionserver.HRegionServer: 
 STOPPED: Unhandled: null
 2013-06-21 10:21:14,293 INFO  [regionserver61020] ipc.RpcServer: Stopping 
 server on 61020
 It seems that after closing user regions, the rssStub is null.
 update:
 we found that if setting hbase.client.ipc.pool.type to RoundRobinPool and 
 hbase.client.ipc.pool.size to 10(possibly other values) in hbase-site.xml, 
 the regionserver is continuously attempting connect to master.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-5083) Backup HMaster should have http infoport open with link to the active master

2013-06-24 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-5083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691836#comment-13691836
]

Hadoop QA commented on HBASE-5083:
--

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment

http://issues.apache.org/jira/secure/attachment/12589380/HBASE-5083_trunk.patch
against trunk revision .

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 3 new
or modified tests.

{color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop
1.0 profile.

{color:green}+1 hadoop2.0{color}. The patch compiles against the hadoop
2.0 profile.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:red}-1 lineLengths{color}. The patch introduces lines longer than
100

{color:green}+1 site{color}. The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}. The patch passed unit tests in .

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/6117//console

This message is automatically generated.

Backup HMaster should have http infoport open with link to the active master

Key: HBASE-5083
URL: https://issues.apache.org/jira/browse/HBASE-5083
Project: HBase
Issue Type: Improvement
Components: master
Affects Versions: 0.92.0
Reporter: Jonathan Hsieh
Assignee: Cody Marcel
Fix For: 0.94.9

Attachments: backup_master.png, HBASE-5083.patch, HBASE-5083.patch,
HBASE-5083.patch, HBASE-5083.patch, HBASE-5083.patch, HBASE-5083_trunk.patch,
HBASE-5083_trunk.patch, HBASE-5083_trunk.patch, master.png,
Trunk_Backup_Master.png, Trunk_Master.png

Without ssh'ing and jps/ps'ing, it is difficult to see if a backup hmaster is
up. It seems like it would be good for a backup hmaster to have a basic web
page up on the info port so that users could see that it is up. Also it
should probably either provide a link to the active master or automatically
forward to the active master.

[jira] [Commented] (HBASE-5083) Backup HMaster should have http infoport open with link to the active master

2013-06-24 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-5083?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13691837#comment-13691837
]