date:20131220


 [ 
https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie updated HBASE-8558:
-

Attachment: HBASE-8558-0.94.txt

 I meet a strange phenomenon. when a regionserver die , meanwhile client which 
 is performing put operation hangs. 
 -

 Key: HBASE-8558
 URL: https://issues.apache.org/jira/browse/HBASE-8558
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.5, 0.94.14
Reporter: wanbin
 Attachments: HBASE-8558-0.94.txt


 I run jstack at client host. The result is below.
 hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 
 nid=0x5173 runnable [0x579cc000]
java.lang.Thread.State: RUNNABLE
 at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
 at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210)
 at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
 - locked 0x000758cb0780 (a sun.nio.ch.Util$2)
 - locked 0x000758cb0770 (a 
 java.util.Collections$UnmodifiableSet)
 - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114)
 at 
 java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - locked 0x000754e978a0 (a java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620)
 - locked 0x000754e97880 (a java.io.DataOutputStream)
 at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86)
 at $Proxy13.multi(Unknown Source)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393)
 at 
 org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
 at java.util.concurrent.FutureTask.run(FutureTask.java:138)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 This thread have hung for one hours
 Meanwhile other thread try to close connection
 IPC Client (1983049639) connection to 
 dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 
 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry 
 [0x4bc0f000]
java.lang.Thread.State: BLOCKED (on object monitor)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - waiting to lock 0x000754e978a0 (a 
 java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
 at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
 at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715)
 - locked 0x000754e7b818 (a 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587)
 dump002030.cm6.tbsite.net is dead regionserver.
 I read  hbase sourececode, discover connection.out doesn't set timeout 
 this.out = new DataOutputStream
 (new BufferedOutputStream(NetUtils.getOutputStream(socket)));
 I see this mean epoll_wait will block indefinitely. 



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Assigned] (HBASE-8558) I meet a strange phenomenon. when a regionserver die , meanwhile client which is performing put operation hangs.


 [ 
https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie reassigned HBASE-8558:


Assignee: Liang Xie

 I meet a strange phenomenon. when a regionserver die , meanwhile client which 
 is performing put operation hangs. 
 -

 Key: HBASE-8558
 URL: https://issues.apache.org/jira/browse/HBASE-8558
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.5, 0.94.14
Reporter: wanbin
Assignee: Liang Xie
 Attachments: HBASE-8558-0.94.txt


 I run jstack at client host. The result is below.
 hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 
 nid=0x5173 runnable [0x579cc000]
java.lang.Thread.State: RUNNABLE
 at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
 at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210)
 at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
 - locked 0x000758cb0780 (a sun.nio.ch.Util$2)
 - locked 0x000758cb0770 (a 
 java.util.Collections$UnmodifiableSet)
 - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114)
 at 
 java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - locked 0x000754e978a0 (a java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620)
 - locked 0x000754e97880 (a java.io.DataOutputStream)
 at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86)
 at $Proxy13.multi(Unknown Source)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393)
 at 
 org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
 at java.util.concurrent.FutureTask.run(FutureTask.java:138)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 This thread have hung for one hours
 Meanwhile other thread try to close connection
 IPC Client (1983049639) connection to 
 dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 
 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry 
 [0x4bc0f000]
java.lang.Thread.State: BLOCKED (on object monitor)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - waiting to lock 0x000754e978a0 (a 
 java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
 at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
 at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715)
 - locked 0x000754e7b818 (a 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587)
 dump002030.cm6.tbsite.net is dead regionserver.
 I read  hbase sourececode, discover connection.out doesn't set timeout 
 this.out = new DataOutputStream
 (new BufferedOutputStream(NetUtils.getOutputStream(socket)));
 I see this mean epoll_wait will block indefinitely. 



--
This message was sent by

[jira] [Updated] (HBASE-8558) I meet a strange phenomenon. when a regionserver die , meanwhile client which is performing put operation hangs.


 [ 
https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie updated HBASE-8558:
-

Affects Version/s: 0.94.14
   Status: Patch Available  (was: Open)

 I meet a strange phenomenon. when a regionserver die , meanwhile client which 
 is performing put operation hangs. 
 -

 Key: HBASE-8558
 URL: https://issues.apache.org/jira/browse/HBASE-8558
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.14, 0.94.5
Reporter: wanbin
Assignee: Liang Xie
 Attachments: HBASE-8558-0.94.txt


 I run jstack at client host. The result is below.
 hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 
 nid=0x5173 runnable [0x579cc000]
java.lang.Thread.State: RUNNABLE
 at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
 at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210)
 at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
 - locked 0x000758cb0780 (a sun.nio.ch.Util$2)
 - locked 0x000758cb0770 (a 
 java.util.Collections$UnmodifiableSet)
 - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114)
 at 
 java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - locked 0x000754e978a0 (a java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620)
 - locked 0x000754e97880 (a java.io.DataOutputStream)
 at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86)
 at $Proxy13.multi(Unknown Source)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393)
 at 
 org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
 at java.util.concurrent.FutureTask.run(FutureTask.java:138)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 This thread have hung for one hours
 Meanwhile other thread try to close connection
 IPC Client (1983049639) connection to 
 dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 
 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry 
 [0x4bc0f000]
java.lang.Thread.State: BLOCKED (on object monitor)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - waiting to lock 0x000754e978a0 (a 
 java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
 at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
 at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715)
 - locked 0x000754e7b818 (a 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587)
 dump002030.cm6.tbsite.net is dead regionserver.
 I read  hbase sourececode, discover connection.out doesn't set timeout 
 this.out = new DataOutputStream
 (new BufferedOutputStream(NetUtils.getOutputStream(socket)));
 I see this mean epoll_wait will

[jira] [Updated] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream


 [ 
https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie updated HBASE-8558:
-

Summary: Add timeout limit for HBaseClient dataOutputStream  (was: I meet a 
strange phenomenon. when a regionserver die , meanwhile client which is 
performing put operation hangs. )

 Add timeout limit for HBaseClient dataOutputStream
 --

 Key: HBASE-8558
 URL: https://issues.apache.org/jira/browse/HBASE-8558
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.5, 0.94.14
Reporter: wanbin
Assignee: Liang Xie
 Attachments: HBASE-8558-0.94.txt


 I run jstack at client host. The result is below.
 hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 
 nid=0x5173 runnable [0x579cc000]
java.lang.Thread.State: RUNNABLE
 at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
 at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210)
 at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
 - locked 0x000758cb0780 (a sun.nio.ch.Util$2)
 - locked 0x000758cb0770 (a 
 java.util.Collections$UnmodifiableSet)
 - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114)
 at 
 java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - locked 0x000754e978a0 (a java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620)
 - locked 0x000754e97880 (a java.io.DataOutputStream)
 at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86)
 at $Proxy13.multi(Unknown Source)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393)
 at 
 org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
 at java.util.concurrent.FutureTask.run(FutureTask.java:138)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 This thread have hung for one hours
 Meanwhile other thread try to close connection
 IPC Client (1983049639) connection to 
 dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 
 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry 
 [0x4bc0f000]
java.lang.Thread.State: BLOCKED (on object monitor)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - waiting to lock 0x000754e978a0 (a 
 java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
 at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
 at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715)
 - locked 0x000754e7b818 (a 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587)
 dump002030.cm6.tbsite.net is dead regionserver.
 I read  hbase sourececode, discover connection.out doesn't set timeout 
 this.out = new DataOutputStream
 (new BufferedOutputStream(NetUtils.getOutputStream(socket)));
 I see this mean epoll_wait will block indefinitely.

[jira] [Commented] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream


[ 
https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853801#comment-13853801
 ] 

Liang Xie commented on HBASE-8558:
--

Thanks [~wanbin] for your detailed report ! Current impl has a default 0 
value for timeout, it really need an explicit setting:)
Right now it's just a 0.94 branch issue,  i found all 0.96+ branch have those 
similar code style already:
{code}
NetUtils.getOutputStream(socket, pingInterval);
{code}

[~lhofhansl], i didn't add/run any test case, but it's just small, so, i guess 
OK:)

 Add timeout limit for HBaseClient dataOutputStream
 --

 Key: HBASE-8558
 URL: https://issues.apache.org/jira/browse/HBASE-8558
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.5, 0.94.14
Reporter: wanbin
Assignee: Liang Xie
 Attachments: HBASE-8558-0.94.txt


 I run jstack at client host. The result is below.
 hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 
 nid=0x5173 runnable [0x579cc000]
java.lang.Thread.State: RUNNABLE
 at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
 at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210)
 at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
 - locked 0x000758cb0780 (a sun.nio.ch.Util$2)
 - locked 0x000758cb0770 (a 
 java.util.Collections$UnmodifiableSet)
 - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114)
 at 
 java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - locked 0x000754e978a0 (a java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620)
 - locked 0x000754e97880 (a java.io.DataOutputStream)
 at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86)
 at $Proxy13.multi(Unknown Source)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393)
 at 
 org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
 at java.util.concurrent.FutureTask.run(FutureTask.java:138)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 This thread have hung for one hours
 Meanwhile other thread try to close connection
 IPC Client (1983049639) connection to 
 dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 
 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry 
 [0x4bc0f000]
java.lang.Thread.State: BLOCKED (on object monitor)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - waiting to lock 0x000754e978a0 (a 
 java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
 at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237)
 at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715)
 - locked 0x000754e7b818 (a 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587)
 dump002030.cm6.tbsite.net is dead

[jira] [Created] (HBASE-10213) Add read log size per second metrics for replication source

2013-12-20 Thread cuijianwei (JIRA)

cuijianwei created HBASE-10213:
--

 Summary: Add read log size per second metrics for replication 
source
 Key: HBASE-10213
 URL: https://issues.apache.org/jira/browse/HBASE-10213
 Project: HBase
  Issue Type: Improvement
  Components: metrics, Replication
Affects Versions: 0.94.14
Reporter: cuijianwei
Priority: Minor


The current metrics of replication source contain logEditsReadRate, 
shippedBatchesRate, etc, which could indicate how fast the data replicated to 
peer cluster to some extent. However, it is not clear enough to know how many 
bytes replicating to peer cluster from these metrics. In production 
environment, it may be important to know the size of replicating data per 
second because the services may be affected if the network become busy.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10213) Add read log size per second metrics for replication source

2013-12-20 Thread cuijianwei (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-10213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

cuijianwei updated HBASE-10213:
---

Attachment: HBASE-10213-0.94-v1.patch

This patch adds a metric 'logReadRateInByte' to show how many bytes read by the 
source per second.

 Add read log size per second metrics for replication source
 ---

 Key: HBASE-10213
 URL: https://issues.apache.org/jira/browse/HBASE-10213
 Project: HBase
  Issue Type: Improvement
  Components: metrics, Replication
Affects Versions: 0.94.14
Reporter: cuijianwei
Priority: Minor
 Attachments: HBASE-10213-0.94-v1.patch


 The current metrics of replication source contain logEditsReadRate, 
 shippedBatchesRate, etc, which could indicate how fast the data replicated to 
 peer cluster to some extent. However, it is not clear enough to know how many 
 bytes replicating to peer cluster from these metrics. In production 
 environment, it may be important to know the size of replicating data per 
 second because the services may be affected if the network become busy.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10213) Add read log size per second metrics for replication source


 [ 
https://issues.apache.org/jira/browse/HBASE-10213?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Liang Xie updated HBASE-10213:
--

Assignee: cuijianwei
  Status: Patch Available  (was: Open)

 Add read log size per second metrics for replication source
 ---

 Key: HBASE-10213
 URL: https://issues.apache.org/jira/browse/HBASE-10213
 Project: HBase
  Issue Type: Improvement
  Components: metrics, Replication
Affects Versions: 0.94.14
Reporter: cuijianwei
Assignee: cuijianwei
Priority: Minor
 Attachments: HBASE-10213-0.94-v1.patch


 The current metrics of replication source contain logEditsReadRate, 
 shippedBatchesRate, etc, which could indicate how fast the data replicated to 
 peer cluster to some extent. However, it is not clear enough to know how many 
 bytes replicating to peer cluster from these metrics. In production 
 environment, it may be important to know the size of replicating data per 
 second because the services may be affected if the network become busy.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-7781) Update security unit tests to use a KDC if available

2013-12-20 Thread ramkrishna.s.vasudevan (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-7781?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853857#comment-13853857
]

ramkrishna.s.vasudevan commented on HBASE-7781:
---

Before proceeding with the JIRA, so I went through what is given in all the
JIRAs mentioned here. HADOOP-8078 also tries to start the ApacheDS but it
seems to be an older version.
HADOOP-9848 introduces miniKDC in the hadoop project itself as a module.
So we would also be introducing the miniKDC in HBase side and all security
testcases will run that along with the cluster?
So the miniKDC available in hbase will be a seperate module(like in hadoop) or
will it be a class that just allows to start a minikdc?

Update security unit tests to use a KDC if available

Key: HBASE-7781
URL: https://issues.apache.org/jira/browse/HBASE-7781
Project: HBase
Issue Type: Test
Components: security, test
Reporter: Gary Helmling
Assignee: ramkrishna.s.vasudevan
Priority: Blocker
Fix For: 0.98.0

We currently have large holes in the test coverage of HBase with security
enabled. Two recent examples of bugs which really should have been caught
with testing are HBASE-7771 and HBASE-7772. The long standing problem with
testing with security enabled has been the requirement for supporting
kerberos infrastructure.
We need to close this gap and provide some automated testing with security
enabled, if necessary standing up and provisioning a temporary KDC as an
option for running integration tests, see HADOOP-8078 and HADOOP-9004 where a
similar approach was taken.

--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Created] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.

binlijin created HBASE-10214:


 Summary: Regionserver shutdown impropery and leave the dir in .old 
not delete.
 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin


RegionServer log
{code}
2013-12-18 15:17:45,771 DEBUG 
org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
51b27391410efdca841db264df46085f
2013-12-18 15:17:45,776 INFO 
org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at null

2013-12-18 15:17:48,776 INFO 
org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
shutdown set and not carrying any regions
2013-12-18 15:17:48,776 FATAL 
org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
node,60020,1384410974572: Unhandled exception: null
java.lang.NullPointerException
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
at 
org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
at java.lang.Thread.run(Thread.java:662)
{code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


 [ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

binlijin updated HBASE-10214:
-

Attachment: HBASE-10214.patch

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


 [ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

binlijin updated HBASE-10214:
-

Attachment: HBASE-10214-94.patch

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853876#comment-13853876
 ] 

binlijin commented on HBASE-10214:
--

Looks like the trunk don't have this problem and the patch is based on 
0.94-branch.

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


 [ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

binlijin updated HBASE-10214:
-

Attachment: (was: HBASE-10214.patch)

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10161) [AccessController] Tolerate regions in recovery


 [ 
https://issues.apache.org/jira/browse/HBASE-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-10161:
---

Status: Open  (was: Patch Available)

 [AccessController] Tolerate regions in recovery
 ---

 Key: HBASE-10161
 URL: https://issues.apache.org/jira/browse/HBASE-10161
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Andrew Purtell
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: HBASE-10161.patch, HBASE-10161_V2.patch


 AccessController fixes for the issue also affecting VisibilityController 
 described on HBASE-10148. Coprocessors that initialize in postOpen upcalls 
 must check if the region is still in recovery and defer initialization until 
 recovery is complete. We need to add a new CP hook for post recovery upcalls 
 and modify existing CPs to defer initialization until this new hook as needed.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10161) [AccessController] Tolerate regions in recovery


 [ 
https://issues.apache.org/jira/browse/HBASE-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-10161:
---

Attachment: (was: HBASE-10161_V2.patch)

 [AccessController] Tolerate regions in recovery
 ---

 Key: HBASE-10161
 URL: https://issues.apache.org/jira/browse/HBASE-10161
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Andrew Purtell
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: HBASE-10161.patch, HBASE-10161_V2.patch


 AccessController fixes for the issue also affecting VisibilityController 
 described on HBASE-10148. Coprocessors that initialize in postOpen upcalls 
 must check if the region is still in recovery and defer initialization until 
 recovery is complete. We need to add a new CP hook for post recovery upcalls 
 and modify existing CPs to defer initialization until this new hook as needed.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10161) [AccessController] Tolerate regions in recovery


 [ 
https://issues.apache.org/jira/browse/HBASE-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-10161:
---

Attachment: HBASE-10161_V2.patch

 [AccessController] Tolerate regions in recovery
 ---

 Key: HBASE-10161
 URL: https://issues.apache.org/jira/browse/HBASE-10161
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Andrew Purtell
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: HBASE-10161.patch, HBASE-10161_V2.patch


 AccessController fixes for the issue also affecting VisibilityController 
 described on HBASE-10148. Coprocessors that initialize in postOpen upcalls 
 must check if the region is still in recovery and defer initialization until 
 recovery is complete. We need to add a new CP hook for post recovery upcalls 
 and modify existing CPs to defer initialization until this new hook as needed.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10161) [AccessController] Tolerate regions in recovery


 [ 
https://issues.apache.org/jira/browse/HBASE-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-10161:
---

Status: Patch Available  (was: Open)

 [AccessController] Tolerate regions in recovery
 ---

 Key: HBASE-10161
 URL: https://issues.apache.org/jira/browse/HBASE-10161
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Andrew Purtell
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.96.2, 0.99.0

 Attachments: HBASE-10161.patch, HBASE-10161_V2.patch


 AccessController fixes for the issue also affecting VisibilityController 
 described on HBASE-10148. Coprocessors that initialize in postOpen upcalls 
 must check if the region is still in recovery and defer initialization until 
 recovery is complete. We need to add a new CP hook for post recovery upcalls 
 and modify existing CPs to defer initialization until this new hook as needed.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Created] (HBASE-10215) TableNotFoundException should be thrown after removing stale znode in ETH

rajeshbabu created HBASE-10215:
--

 Summary: TableNotFoundException should be thrown after removing 
stale znode in ETH
 Key: HBASE-10215
 URL: https://issues.apache.org/jira/browse/HBASE-10215
 Project: HBase
  Issue Type: Bug
  Components: master
Affects Versions: 0.94.14, 0.96.1
Reporter: rajeshbabu
Assignee: rajeshbabu
Priority: Minor
 Fix For: 0.98.0, 0.94.16, 0.96.2, 0.99.0


Lets suppose master went down while creating table then znode will be left in 
ENABLING state. Master to recover them on restart. 
If there are no meta entries for the table.
While recovering the table we are checking whether table exists in meta or not, 
if not we are removing the znode. After removing znode we need to throw 
TableNotFoundException. Presently not throwing the exception so the znode will 
be recrated. It will be stale forever. Even on master restart we cannot delete. 
We cannot create the table with same name also.

{code}
  // Check if table exists
  if (!MetaReader.tableExists(catalogTracker, tableName)) {
// retainAssignment is true only during recovery.  In normal case it is 
false
if (!this.skipTableStateCheck) {
  throw new TableNotFoundException(tableName);
} 
try {
  this.assignmentManager.getZKTable().removeEnablingTable(tableName, 
true);
} catch (KeeperException e) {
  // TODO : Use HBCK to clear such nodes
  LOG.warn(Failed to delete the ENABLING node for the table  + 
tableName
  + .  The table will remain unusable. Run HBCK to manually fix 
the problem.);
}
  }
{code}




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10175) 2-thread ChaosMonkey steps on its own toes


[ 
https://issues.apache.org/jira/browse/HBASE-10175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853931#comment-13853931
 ] 

Hadoop QA commented on HBASE-10175:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12619725/HBASE-10175.patch
  against trunk revision .
  ATTACHMENT ID: 12619725

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 21 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   
org.apache.hadoop.hbase.security.access.TestAccessController

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8239//console

This message is automatically generated.

 2-thread ChaosMonkey steps on its own toes
 --

 Key: HBASE-10175
 URL: https://issues.apache.org/jira/browse/HBASE-10175
 Project: HBase
  Issue Type: Improvement
  Components: test
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
Priority: Minor
 Attachments: HBASE-10175.patch


 ChaosMonkey with one destructive and one volatility 
 (flush-compact-split-etc.) threads steps on its own toes and logs a lot of 
 exceptions.
 A simple solution would be to catch most (or all), like 
 NotServingRegionException, and log less (not a full callstack for example, 
 it's not very useful anyway).
 A more complicated/complementary one would be to keep track which regions the 
 destructive thread affects and use other regions for volatile one.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10215) TableNotFoundException should be thrown after removing stale znode in ETH


 [ 
https://issues.apache.org/jira/browse/HBASE-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

rajeshbabu updated HBASE-10215:
---

Status: Patch Available  (was: Open)

 TableNotFoundException should be thrown after removing stale znode in ETH
 -

 Key: HBASE-10215
 URL: https://issues.apache.org/jira/browse/HBASE-10215
 Project: HBase
  Issue Type: Bug
  Components: master
Affects Versions: 0.94.14, 0.96.1
Reporter: rajeshbabu
Assignee: rajeshbabu
Priority: Minor
 Fix For: 0.98.0, 0.94.16, 0.96.2, 0.99.0

 Attachments: HBASE-10215.patch


 Lets suppose master went down while creating table then znode will be left in 
 ENABLING state. Master to recover them on restart. 
 If there are no meta entries for the table.
 While recovering the table we are checking whether table exists in meta or 
 not, if not we are removing the znode. After removing znode we need to throw 
 TableNotFoundException. Presently not throwing the exception so the znode 
 will be recrated. It will be stale forever. Even on master restart we cannot 
 delete. We cannot create the table with same name also.
 {code}
   // Check if table exists
   if (!MetaReader.tableExists(catalogTracker, tableName)) {
 // retainAssignment is true only during recovery.  In normal case it 
 is false
 if (!this.skipTableStateCheck) {
   throw new TableNotFoundException(tableName);
 } 
 try {
   this.assignmentManager.getZKTable().removeEnablingTable(tableName, 
 true);
 } catch (KeeperException e) {
   // TODO : Use HBCK to clear such nodes
   LOG.warn(Failed to delete the ENABLING node for the table  + 
 tableName
   + .  The table will remain unusable. Run HBCK to manually fix 
 the problem.);
 }
   }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10215) TableNotFoundException should be thrown after removing stale znode in ETH


 [ 
https://issues.apache.org/jira/browse/HBASE-10215?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

rajeshbabu updated HBASE-10215:
---

Attachment: HBASE-10215.patch

Patch for trunk. Please review.

 TableNotFoundException should be thrown after removing stale znode in ETH
 -

 Key: HBASE-10215
 URL: https://issues.apache.org/jira/browse/HBASE-10215
 Project: HBase
  Issue Type: Bug
  Components: master
Affects Versions: 0.96.1, 0.94.14
Reporter: rajeshbabu
Assignee: rajeshbabu
Priority: Minor
 Fix For: 0.98.0, 0.94.16, 0.96.2, 0.99.0

 Attachments: HBASE-10215.patch


 Lets suppose master went down while creating table then znode will be left in 
 ENABLING state. Master to recover them on restart. 
 If there are no meta entries for the table.
 While recovering the table we are checking whether table exists in meta or 
 not, if not we are removing the znode. After removing znode we need to throw 
 TableNotFoundException. Presently not throwing the exception so the znode 
 will be recrated. It will be stale forever. Even on master restart we cannot 
 delete. We cannot create the table with same name also.
 {code}
   // Check if table exists
   if (!MetaReader.tableExists(catalogTracker, tableName)) {
 // retainAssignment is true only during recovery.  In normal case it 
 is false
 if (!this.skipTableStateCheck) {
   throw new TableNotFoundException(tableName);
 } 
 try {
   this.assignmentManager.getZKTable().removeEnablingTable(tableName, 
 true);
 } catch (KeeperException e) {
   // TODO : Use HBCK to clear such nodes
   LOG.warn(Failed to delete the ENABLING node for the table  + 
 tableName
   + .  The table will remain unusable. Run HBCK to manually fix 
 the problem.);
 }
   }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10213) Add read log size per second metrics for replication source

[
https://issues.apache.org/jira/browse/HBASE-10213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853941#comment-13853941
]

Hadoop QA commented on HBASE-10213:
---

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment

http://issues.apache.org/jira/secure/attachment/12619783/HBASE-10213-0.94-v1.patch
against trunk revision .
ATTACHMENT ID: 12619783

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:red}-1 tests included{color}. The patch doesn't appear to include
any new or modified tests.
Please justify why no new tests are needed for this
patch.
Also please list what manual steps were performed to
verify this patch.

{color:red}-1 patch{color}. The patch command could not apply the patch.

Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/8242//console

This message is automatically generated.

Add read log size per second metrics for replication source
---

Key: HBASE-10213
URL: https://issues.apache.org/jira/browse/HBASE-10213
Project: HBase
Issue Type: Improvement
Components: metrics, Replication
Affects Versions: 0.94.14
Reporter: cuijianwei
Assignee: cuijianwei
Priority: Minor
Attachments: HBASE-10213-0.94-v1.patch

The current metrics of replication source contain logEditsReadRate,
shippedBatchesRate, etc, which could indicate how fast the data replicated to
peer cluster to some extent. However, it is not clear enough to know how many
bytes replicating to peer cluster from these metrics. In production
environment, it may be important to know the size of replicating data per
second because the services may be affected if the network become busy.

--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-8859) truncate_preserve should get table split keys as it is instead of converting them to string type and then again to bytes


[ 
https://issues.apache.org/jira/browse/HBASE-8859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853945#comment-13853945
 ] 

Hadoop QA commented on HBASE-8859:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12619748/HBASE-8859_trunk_4.patch
  against trunk revision .
  ATTACHMENT ID: 12619748

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   
org.apache.hadoop.hbase.security.access.TestAccessController

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8241//console

This message is automatically generated.

 truncate_preserve should get table split keys as it is instead of converting 
 them to string type and then again to bytes
 

 Key: HBASE-8859
 URL: https://issues.apache.org/jira/browse/HBASE-8859
 Project: HBase
  Issue Type: Bug
  Components: scripts
Affects Versions: 0.95.1
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-8859-Test_to_reproduce.patch, 
 HBASE-8859_trunk.patch, HBASE-8859_trunk_2.patch, HBASE-8859_trunk_3.patch, 
 HBASE-8859_trunk_4.patch


 If we take int,long or double bytes as split keys then we are not creating 
 table with same split keys because converting them to strings directly and to 
 bytes is giving different split keys, sometimes getting IllegalArgument 
 exception because of same split keys(converted). Instead we can get split 
 keys directly from HTable and pass them while creating table.
 {code}
   h_table = org.apache.hadoop.hbase.client.HTable.new(conf, table_name)
   splits = h_table.getRegionLocations().keys().map{|i| i.getStartKey} 
 :byte
   splits = org.apache.hadoop.hbase.util.Bytes.toByteArrays(splits)
 {code}
 {code}
 Truncating 'emp3' table (it may take a while):
  - Disabling table...
  - Dropping table...
  - Creating table with region boundaries...
 ERROR: java.lang.IllegalArgumentException: All split keys must be unique, 
 found duplicate:

[jira] [Commented] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream


[ 
https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853950#comment-13853950
 ] 

Hadoop QA commented on HBASE-8558:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12619777/HBASE-8558-0.94.txt
  against trunk revision .
  ATTACHMENT ID: 12619777

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8244//console

This message is automatically generated.

 Add timeout limit for HBaseClient dataOutputStream
 --

 Key: HBASE-8558
 URL: https://issues.apache.org/jira/browse/HBASE-8558
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.5, 0.94.14
Reporter: wanbin
Assignee: Liang Xie
 Attachments: HBASE-8558-0.94.txt


 I run jstack at client host. The result is below.
 hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 
 nid=0x5173 runnable [0x579cc000]
java.lang.Thread.State: RUNNABLE
 at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method)
 at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210)
 at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65)
 at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69)
 - locked 0x000758cb0780 (a sun.nio.ch.Util$2)
 - locked 0x000758cb0770 (a 
 java.util.Collections$UnmodifiableSet)
 - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl)
 at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336)
 at 
 org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153)
 at 
 org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114)
 at 
 java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - locked 0x000754e978a0 (a java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at 
 org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620)
 - locked 0x000754e97880 (a java.io.DataOutputStream)
 at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86)
 at $Proxy13.multi(Unknown Source)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393)
 at 
 org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402)
 at 
 org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390)
 at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
 at java.util.concurrent.FutureTask.run(FutureTask.java:138)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886)
 at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908)
 at java.lang.Thread.run(Thread.java:662)
 This thread have hung for one hours
 Meanwhile other thread try to close connection
 IPC Client (1983049639) connection to 
 dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 
 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry 
 [0x4bc0f000]
java.lang.Thread.State: BLOCKED (on object monitor)
 at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123)
 - waiting to lock 0x000754e978a0 (a 
 java.io.BufferedOutputStream)
 at java.io.DataOutputStream.flush(DataOutputStream.java:106)
 at java.io.FilterOutputStream.close(FilterOutputStream.java:140)
 at

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853952#comment-13853952
 ] 

Jean-Marc Spaggiari commented on HBASE-10214:
-

Hi [~aoxiang], which HBase version did you try with? The trace doesn't seems to 
allign with a recent one.

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853959#comment-13853959
 ] 

binlijin commented on HBASE-10214:
--

[~jmspaggi]，i use 0.94.10 , this patch is for 0.94 version.

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-9346) HBCK should provide an option to check if regions boundaries are the same in META and in stores.


[ 
https://issues.apache.org/jira/browse/HBASE-9346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853960#comment-13853960
 ] 

Jean-Marc Spaggiari commented on HBASE-9346:


For the = 0 vs  0 I think we should keep = 0

The storesLastKey should almost never be equal to metaLastKey, but there is 
nothing to avoid that so it still can be.

If it's never equal, then = will not hurt. If it is, then = will be we good 
to have it.

I might be wrong ;) But that's seems to be correct.

 HBCK should provide an option to check if regions boundaries are the same in 
 META and in stores.
 

 Key: HBASE-9346
 URL: https://issues.apache.org/jira/browse/HBASE-9346
 Project: HBase
  Issue Type: Bug
  Components: hbck, Operability
Affects Versions: 0.94.14, 0.98.1, 0.99.0, 0.96.1.1
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
 Attachments: HBASE-9346-v0-0.94.patch, HBASE-9346-v1-trunk.patch, 
 HBASE-9346-v2-trunk.patch, HBASE-9346-v3-trunk.patch, 
 HBASE-9346-v4-trunk.patch, HBASE-9346-v5-trunk.patch, 
 HBASE-9346-v6-trunk.patch, HBASE-9346-v7-trunk.patch, 
 HBASE-9346-v8-trunk.patch


 If META don't have the same region boundaries as the stores files, writes and 
 read might go to the wrong place. We need to provide a way to check that 
 withing HBCK.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853967#comment-13853967
 ] 

Jean-Marc Spaggiari commented on HBASE-10214:
-

I'm not able to find the same lines into 0.94.10 neither.

Line 880 of HRegionServer is:
{code}
closeWAL(abortRequested ? false : true);
{code}
Line 753 is:
{code}
registerMBean();
{code}

Code for 0.94.10 and 0.94.15 is the same for tryRegionServerReport() so should 
not be an issue. But might be interesting to see what was throwing this NPE... 
Was it hbaseMaster like in your patch?


 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10173) Need HFile version check in security coprocessors


[ 
https://issues.apache.org/jira/browse/HBASE-10173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853979#comment-13853979
 ] 

Hudson commented on HBASE-10173:


FAILURE: Integrated in HBase-0.98 #26 (See 
[https://builds.apache.org/job/HBase-0.98/26/])
HBASE-10173. Need HFile version check in security coprocessors (apurtell: rev 
1552504)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/security/access/AccessController.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/security/visibility/VisibilityController.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/rest/TestScannersWithLabels.java
* 
/hbase/branches/0.98/hbase-thrift/src/test/java/org/apache/hadoop/hbase/thrift2/TestThriftHBaseServiceHandlerWithLabels.java


 Need HFile version check in security coprocessors
 -

 Key: HBASE-10173
 URL: https://issues.apache.org/jira/browse/HBASE-10173
 Project: HBase
  Issue Type: Improvement
  Components: security
Affects Versions: 0.98.0, 0.99.0
Reporter: Anoop Sam John
Assignee: Andrew Purtell
Priority: Critical
 Fix For: 0.98.0, 0.99.0

 Attachments: 10173.patch, 10173.patch, HBASE-10173.patch, 
 HBASE-10173_partial.patch


 Cell level visibility labels are stored as cell tags. So HFile V3 is the 
 minimum version which can support this feature. Better to have a version 
 check in VisibilityController. Some one using this CP but with any HFile 
 version as V2, we can better throw error.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10138) incorrect or confusing test value is used in block caches


[ 
https://issues.apache.org/jira/browse/HBASE-10138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853978#comment-13853978
 ] 

Hudson commented on HBASE-10138:


FAILURE: Integrated in HBase-0.98 #26 (See 
[https://builds.apache.org/job/HBase-0.98/26/])
HBASE-10138. Incorrect or confusing test value is used in block caches (Sergey 
Shelukhin) (apurtell: rev 1552505)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/CacheConfig.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/bucket/BucketCache.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/io/hfile/bucket/TestBucketCache.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestStore.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestStoreFile.java


 incorrect or confusing test value is used in block caches
 -

 Key: HBASE-10138
 URL: https://issues.apache.org/jira/browse/HBASE-10138
 Project: HBase
  Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10138.patch


 DEFAULT_BLOCKSIZE_SMALL is described as:
 {code}
   // Make default block size for StoreFiles 8k while testing.  TODO: FIX!
   // Need to make it 8k for testing.
   public static final int DEFAULT_BLOCKSIZE_SMALL = 8 * 1024;
 {code}
 This value is used on production path in CacheConfig thru HStore/HRegion, and 
 passed to various cache object.
 We should change it to actual block size, or if it is somehow by design at 
 least we should clarify it and remove the comment. 



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10207) ZKVisibilityLabelWatcher : Populate the labels cache on startup


[ 
https://issues.apache.org/jira/browse/HBASE-10207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853980#comment-13853980
 ] 

Hudson commented on HBASE-10207:


FAILURE: Integrated in HBase-0.98 #26 (See 
[https://builds.apache.org/job/HBase-0.98/26/])
HBASE-10207 ZKVisibilityLabelWatcher : Populate the labels cache on startup 
(anoopsamjohn: rev 1552489)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/security/visibility/ZKVisibilityLabelWatcher.java


 ZKVisibilityLabelWatcher : Populate the labels cache on startup
 ---

 Key: HBASE-10207
 URL: https://issues.apache.org/jira/browse/HBASE-10207
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10207.patch






--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853984#comment-13853984
 ] 

binlijin commented on HBASE-10214:
--

oh, sorry, the line is not match with the apache hbase 0.94.10 version, this is 
our own internal version which based on apache hbase 0.94.10.
{code}

long now = System.currentTimeMillis();
if ((now - lastMsg) = msgInterval) {
  doMetrics();
  tryRegionServerReport();  // 753
  lastMsg = System.currentTimeMillis();
}
if (!this.stopped) this.sleeper.sleep();


  void tryRegionServerReport()
  throws IOException {
HServerLoad hsl = buildServerLoad();
// Why we do this?
this.requestCount.set(0);
try {
  
this.hbaseMaster.regionServerReport(this.serverNameFromMasterPOV.getVersionedBytes(),
 hsl); // line 880
} catch (IOException ioe) {
  if (ioe instanceof RemoteException) {
ioe = ((RemoteException)ioe).unwrapRemoteException();
  }
  if (ioe instanceof YouAreDeadException) {
// This will be caught and handled as a fatal error in run()
throw ioe;
  }
  // Couldn't connect to the master, get location from zk and reconnect
  // Method blocks until new master is found or we are stopped
  getMaster();
}
  }
{code}

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10161) [AccessController] Tolerate regions in recovery

[
https://issues.apache.org/jira/browse/HBASE-10161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13853990#comment-13853990
]

Hadoop QA commented on HBASE-10161:
---

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12619800/HBASE-10161_V2.patch
against trunk revision .
ATTACHMENT ID: 12619800

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop
1.0 profile.

{color:green}+1 hadoop1.1{color}. The patch compiles against the hadoop
1.1 profile.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100

{color:red}-1 site{color}. The patch appears to cause mvn site goal to
fail.

{color:red}-1 core tests{color}. The patch failed these unit tests:

org.apache.hadoop.hbase.security.access.TestAccessController

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/8243//console

This message is automatically generated.

[AccessController] Tolerate regions in recovery
---

Key: HBASE-10161
URL: https://issues.apache.org/jira/browse/HBASE-10161
Project: HBase
Issue Type: Bug
Affects Versions: 0.96.0
Reporter: Andrew Purtell
Assignee: Anoop Sam John
Priority: Blocker
Fix For: 0.98.0, 0.96.2, 0.99.0

Attachments: HBASE-10161.patch, HBASE-10161_V2.patch

AccessController fixes for the issue also affecting VisibilityController
described on HBASE-10148. Coprocessors that initialize in postOpen upcalls
must check if the region is still in recovery and defer initialization until
recovery is complete. We need to add a new CP hook for post recovery upcalls
and modify existing CPs to defer initialization until this new hook as needed.

--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10207) ZKVisibilityLabelWatcher : Populate the labels cache on startup


[ 
https://issues.apache.org/jira/browse/HBASE-10207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854006#comment-13854006
 ] 

Hudson commented on HBASE-10207:


FAILURE: Integrated in HBase-TRUNK #4741 (See 
[https://builds.apache.org/job/HBase-TRUNK/4741/])
HBASE-10207 ZKVisibilityLabelWatcher : Populate the labels cache on startup 
(anoopsamjohn: rev 1552488)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/security/visibility/ZKVisibilityLabelWatcher.java


 ZKVisibilityLabelWatcher : Populate the labels cache on startup
 ---

 Key: HBASE-10207
 URL: https://issues.apache.org/jira/browse/HBASE-10207
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10207.patch






--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10173) Need HFile version check in security coprocessors


[ 
https://issues.apache.org/jira/browse/HBASE-10173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854005#comment-13854005
 ] 

Hudson commented on HBASE-10173:


FAILURE: Integrated in HBase-TRUNK #4741 (See 
[https://builds.apache.org/job/HBase-TRUNK/4741/])
HBASE-10173. Need HFile version check in security coprocessors (apurtell: rev 
1552503)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/security/access/AccessController.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/security/visibility/VisibilityController.java
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/rest/TestScannersWithLabels.java
* 
/hbase/trunk/hbase-thrift/src/test/java/org/apache/hadoop/hbase/thrift2/TestThriftHBaseServiceHandlerWithLabels.java


 Need HFile version check in security coprocessors
 -

 Key: HBASE-10173
 URL: https://issues.apache.org/jira/browse/HBASE-10173
 Project: HBase
  Issue Type: Improvement
  Components: security
Affects Versions: 0.98.0, 0.99.0
Reporter: Anoop Sam John
Assignee: Andrew Purtell
Priority: Critical
 Fix For: 0.98.0, 0.99.0

 Attachments: 10173.patch, 10173.patch, HBASE-10173.patch, 
 HBASE-10173_partial.patch


 Cell level visibility labels are stored as cell tags. So HFile V3 is the 
 minimum version which can support this feature. Better to have a version 
 check in VisibilityController. Some one using this CP but with any HFile 
 version as V2, we can better throw error.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854013#comment-13854013
 ] 

Jean-Marc Spaggiari commented on HBASE-10214:
-

Ok. Make sense now ;) Thanks for the clarification.

Is there any risk for isClusterUp() to return true but for hbaseMaster to be 
null? If so, we will still get a NPE. no?


 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-9151) HBCK cannot fix when meta server znode deleted, this can happen if all region servers stopped and there are no logs to split.


[ 
https://issues.apache.org/jira/browse/HBASE-9151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854023#comment-13854023
 ] 

Hadoop QA commented on HBASE-9151:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12619761/HBASE-9151.patch
  against trunk revision .
  ATTACHMENT ID: 12619761

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop1.1{color}.  The patch compiles against the hadoop 
1.1 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   
org.apache.hadoop.hbase.regionserver.TestRSKilledWhenInitializing
  org.apache.hadoop.hbase.security.access.TestAccessController

 {color:red}-1 core zombie tests{color}.  There are 1 zombie test(s):   
at 
org.apache.hadoop.hbase.TestAcidGuarantees.testMixedAtomicity(TestAcidGuarantees.java:351)

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/8245//console

This message is automatically generated.

 HBCK cannot fix when meta server znode deleted, this can happen if all region 
 servers stopped and there are no logs to split.
 -

 Key: HBASE-9151
 URL: https://issues.apache.org/jira/browse/HBASE-9151
 Project: HBase
  Issue Type: Bug
  Components: hbck
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-9151.patch


 When meta server znode deleted and meta in FAILED_OPEN state, then hbck 
 cannot fix it. This scenario can come when all region servers stopped by stop 
 command and didnt start any RS within 10 secs(with default configurations). 
 {code}
   public void assignMeta() throws KeeperException {
 MetaRegionTracker.deleteMetaLocation(this.watcher);
 assign(HRegionInfo.FIRST_META_REGIONINFO, true);
   }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10138) incorrect or confusing test value is used in block caches


[ 
https://issues.apache.org/jira/browse/HBASE-10138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854025#comment-13854025
 ] 

Hudson commented on HBASE-10138:


FAILURE: Integrated in HBase-0.98-on-Hadoop-1.1 #23 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/23/])
HBASE-10138. Incorrect or confusing test value is used in block caches (Sergey 
Shelukhin) (apurtell: rev 1552505)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/CacheConfig.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/bucket/BucketCache.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/io/hfile/bucket/TestBucketCache.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestStore.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestStoreFile.java


 incorrect or confusing test value is used in block caches
 -

 Key: HBASE-10138
 URL: https://issues.apache.org/jira/browse/HBASE-10138
 Project: HBase
  Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10138.patch


 DEFAULT_BLOCKSIZE_SMALL is described as:
 {code}
   // Make default block size for StoreFiles 8k while testing.  TODO: FIX!
   // Need to make it 8k for testing.
   public static final int DEFAULT_BLOCKSIZE_SMALL = 8 * 1024;
 {code}
 This value is used on production path in CacheConfig thru HStore/HRegion, and 
 passed to various cache object.
 We should change it to actual block size, or if it is somehow by design at 
 least we should clarify it and remove the comment. 



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10207) ZKVisibilityLabelWatcher : Populate the labels cache on startup


[ 
https://issues.apache.org/jira/browse/HBASE-10207?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854027#comment-13854027
 ] 

Hudson commented on HBASE-10207:


FAILURE: Integrated in HBase-0.98-on-Hadoop-1.1 #23 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/23/])
HBASE-10207 ZKVisibilityLabelWatcher : Populate the labels cache on startup 
(anoopsamjohn: rev 1552489)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/security/visibility/ZKVisibilityLabelWatcher.java


 ZKVisibilityLabelWatcher : Populate the labels cache on startup
 ---

 Key: HBASE-10207
 URL: https://issues.apache.org/jira/browse/HBASE-10207
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
Priority: Blocker
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-10207.patch






--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10173) Need HFile version check in security coprocessors


[ 
https://issues.apache.org/jira/browse/HBASE-10173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854026#comment-13854026
 ] 

Hudson commented on HBASE-10173:


FAILURE: Integrated in HBase-0.98-on-Hadoop-1.1 #23 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/23/])
HBASE-10173. Need HFile version check in security coprocessors (apurtell: rev 
1552504)
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/security/access/AccessController.java
* 
/hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/security/visibility/VisibilityController.java
* 
/hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/rest/TestScannersWithLabels.java
* 
/hbase/branches/0.98/hbase-thrift/src/test/java/org/apache/hadoop/hbase/thrift2/TestThriftHBaseServiceHandlerWithLabels.java


 Need HFile version check in security coprocessors
 -

 Key: HBASE-10173
 URL: https://issues.apache.org/jira/browse/HBASE-10173
 Project: HBase
  Issue Type: Improvement
  Components: security
Affects Versions: 0.98.0, 0.99.0
Reporter: Anoop Sam John
Assignee: Andrew Purtell
Priority: Critical
 Fix For: 0.98.0, 0.99.0

 Attachments: 10173.patch, 10173.patch, HBASE-10173.patch, 
 HBASE-10173_partial.patch


 Cell level visibility labels are stored as cell tags. So HFile V3 is the 
 minimum version which can support this feature. Better to have a version 
 check in VisibilityController. Some one using this CP but with any HFile 
 version as V2, we can better throw error.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


[ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854055#comment-13854055
 ] 

binlijin commented on HBASE-10214:
--

No, lt looks like impossible.

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-9151) HBCK cannot fix when meta server znode deleted, this can happen if all region servers stopped and there are no logs to split.


[ 
https://issues.apache.org/jira/browse/HBASE-9151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854057#comment-13854057
 ] 

rajeshbabu commented on HBASE-9151:
---

TestRSKilledWhenInitializing  test case failure is related to the patch. I will 
fix and upload new patch.

 HBCK cannot fix when meta server znode deleted, this can happen if all region 
 servers stopped and there are no logs to split.
 -

 Key: HBASE-9151
 URL: https://issues.apache.org/jira/browse/HBASE-9151
 Project: HBase
  Issue Type: Bug
  Components: hbck
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.98.0, 0.99.0

 Attachments: HBASE-9151.patch


 When meta server znode deleted and meta in FAILED_OPEN state, then hbck 
 cannot fix it. This scenario can come when all region servers stopped by stop 
 command and didnt start any RS within 10 secs(with default configurations). 
 {code}
   public void assignMeta() throws KeeperException {
 MetaRegionTracker.deleteMetaLocation(this.watcher);
 assign(HRegionInfo.FIRST_META_REGIONINFO, true);
   }
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10214) Regionserver shutdown impropery and leave the dir in .old not delete.


 [ 
https://issues.apache.org/jira/browse/HBASE-10214?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

binlijin updated HBASE-10214:
-

Attachment: HBASE-10214-94-V2.patch

 Regionserver shutdown impropery and leave the dir in .old not delete.
 -

 Key: HBASE-10214
 URL: https://issues.apache.org/jira/browse/HBASE-10214
 Project: HBase
  Issue Type: Bug
Reporter: binlijin
 Attachments: HBASE-10214-94-V2.patch, HBASE-10214-94.patch


 RegionServer log
 {code}
 2013-12-18 15:17:45,771 DEBUG 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Waiting on 
 51b27391410efdca841db264df46085f
 2013-12-18 15:17:45,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: Connected to master at 
 null
 2013-12-18 15:17:48,776 INFO 
 org.apache.hadoop.hbase.regionserver.HRegionServer: STOPPED: Exiting; cluster 
 shutdown set and not carrying any regions
 2013-12-18 15:17:48,776 FATAL 
 org.apache.hadoop.hbase.regionserver.HRegionServer: ABORTING region server 
 node,60020,1384410974572: Unhandled exception: null
 java.lang.NullPointerException
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.tryRegionServerReport(HRegionServer.java:880)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.run(HRegionServer.java:753)
 at java.lang.Thread.run(Thread.java:662)
 {code}



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10173) Need HFile version check in security coprocessors


[ 
https://issues.apache.org/jira/browse/HBASE-10173?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13854072#comment-13854072
 ] 

Anoop Sam John commented on HBASE-10173:


https://builds.apache.org/job/PreCommit-HBASE-Build/8241//testReport/org.apache.hadoop.hbase.security.access/TestAccessController/testCellPermissions/
Failure is related to commit..  I can give a small addendum here.

 Need HFile version check in security coprocessors
 -

 Key: HBASE-10173
 URL: https://issues.apache.org/jira/browse/HBASE-10173
 Project: HBase
  Issue Type: Improvement
  Components: security
Affects Versions: 0.98.0, 0.99.0
Reporter: Anoop Sam John
Assignee: Andrew Purtell
Priority: Critical
 Fix For: 0.98.0, 0.99.0

 Attachments: 10173.patch, 10173.patch, HBASE-10173.patch, 
 HBASE-10173_partial.patch


 Cell level visibility labels are stored as cell tags. So HFile V3 is the 
 minimum version which can support this feature. Better to have a version 
 check in VisibilityController. Some one using this CP but with any HFile 
 version as V2, we can better throw error.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Created] (HBASE-10216) Change HBase to support local compactions

2013-12-20 Thread David Witten (JIRA)

David Witten created HBASE-10216:


 Summary: Change HBase to support local compactions
 Key: HBASE-10216
 URL: https://issues.apache.org/jira/browse/HBASE-10216
 Project: HBase
  Issue Type: New Feature
  Components: Compaction
 Environment: All
Reporter: David Witten


As I understand it compactions will read data from DFS and write to DFS.  This 
means that even when the reading occurs on the local host (because region 
server has a local copy) all the writing must go over the network to the other 
replicas.  This proposal suggests that HBase would perform much better if all 
the reading and writing occurred locally and did not go over the network. 

I propose that the DFS interface be extended to provide method that would merge 
files so that the merging and deleting can be performed on local data nodes 
with no file contents moving over the network.  The method would take a list of 
paths to be merged and deleted and the merged file path and an indication of a 
file-format-aware class that would be run on each data node to perform the 
merge.  The merge method provided by this merging class would be passed files 
open for reading for all the files to be merged and one file open for writing.  
The custom class provided merge method would read all the input files and 
append to the output file using some standard API that would work across all 
DFS implementations.  The DFS would ensure that the merge had happened properly 
on all replicas before returning to the caller.  It could be that greater 
resiliency could be achieved by implementing the deletion as a separate phase 
that is only done after enough of the replicas had completed the merge. 

HBase would be changed to use the new merge method for compactions, and would 
provide an implementation of the merging class that works with HFiles.

This proposal would require a custom code that understands the file format to 
be runnable by the data nodes to manage the merge.  So there would need to be a 
facility to load classes into DFS if there isn't such a facility already.  Or, 
less generally, HDFS could build in support for HFile merging.

The merge method might be optional.  If the DFS implementation did not provide 
it a generic version that performed the merge on top of the regular DFS 
interfaces would be used.

It may be that this method needs to be tweaked or ignored when the region 
server does not have a local copy data so that, as happens currently, one copy 
of the data moves to the region server.




--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Updated] (HBASE-10173) Need HFile version check in security coprocessors


 [ 
https://issues.apache.org/jira/browse/HBASE-10173?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-10173:
---

Attachment: HBASE-10173_Addendum.patch

Addendum to fix test failure .
[~apurtell] , [~ram_krish] what do you guys say?

 Need HFile version check in security coprocessors
 -

 Key: HBASE-10173
 URL: https://issues.apache.org/jira/browse/HBASE-10173
 Project: HBase
  Issue Type: Improvement
  Components: security
Affects Versions: 0.98.0, 0.99.0
Reporter: Anoop Sam John
Assignee: Andrew Purtell
Priority: Critical
 Fix For: 0.98.0, 0.99.0

 Attachments: 10173.patch, 10173.patch, HBASE-10173.patch, 
 HBASE-10173_Addendum.patch, HBASE-10173_partial.patch


 Cell level visibility labels are stored as cell tags. So HFile V3 is the 
 minimum version which can support this feature. Better to have a version 
 check in VisibilityController. Some one using this CP but with any HFile 
 version as V2, we can better throw error.



--
This message was sent by Atlassian JIRA
(v6.1.4#6159)

[jira] [Commented] (HBASE-10216) Change HBase to support local compactions