[jira] [Commented] (HBASE-8731) Use the JDK 1.7 in the precommit env for trunk
[ https://issues.apache.org/jira/browse/HBASE-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856808#comment-13856808 ] Nicolas Liochon commented on HBASE-8731: On precommit, we don't own the script the script that changes the variable. A while ago, I spoke to Giri who said he could change this for us, and ask me to create this jira. I reopen it, so :-) Use the JDK 1.7 in the precommit env for trunk -- Key: HBASE-8731 URL: https://issues.apache.org/jira/browse/HBASE-8731 Project: HBase Issue Type: Improvement Components: build Affects Versions: 0.98.0 Reporter: Nicolas Liochon Assignee: Giridharan Kesavan Fix For: 0.98.0 HBase today uses the jdk 1.6. In the past it created issues when we tried to use 1.7 for the core build while the precommit was on 1.6. Having the precommit on 1.7 would solve this. The best is to start with trunk. Likely 0.95 will come next, and may be, a day, 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Reopened] (HBASE-8731) Use the JDK 1.7 in the precommit env for trunk
[ https://issues.apache.org/jira/browse/HBASE-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas Liochon reopened HBASE-8731: Assignee: Giridharan Kesavan (was: stack) Use the JDK 1.7 in the precommit env for trunk -- Key: HBASE-8731 URL: https://issues.apache.org/jira/browse/HBASE-8731 Project: HBase Issue Type: Improvement Components: build Affects Versions: 0.98.0 Reporter: Nicolas Liochon Assignee: Giridharan Kesavan Fix For: 0.98.0 HBase today uses the jdk 1.6. In the past it created issues when we tried to use 1.7 for the core build while the precommit was on 1.6. Having the precommit on 1.7 would solve this. The best is to start with trunk. Likely 0.95 will come next, and may be, a day, 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Comment Edited] (HBASE-8731) Use the JDK 1.7 in the precommit env for trunk
[ https://issues.apache.org/jira/browse/HBASE-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856808#comment-13856808 ] Nicolas Liochon edited comment on HBASE-8731 at 12/26/13 9:53 AM: -- On precommit, we don't own the script that changes the variable. A while ago, I spoke to Giri who said he could change this for us, and ask me to create this jira. I reopen it, so :-) was (Author: nkeywal): On precommit, we don't own the script the script that changes the variable. A while ago, I spoke to Giri who said he could change this for us, and ask me to create this jira. I reopen it, so :-) Use the JDK 1.7 in the precommit env for trunk -- Key: HBASE-8731 URL: https://issues.apache.org/jira/browse/HBASE-8731 Project: HBase Issue Type: Improvement Components: build Affects Versions: 0.99.0 Reporter: Nicolas Liochon Assignee: Giridharan Kesavan HBase today uses the jdk 1.6. In the past it created issues when we tried to use 1.7 for the core build while the precommit was on 1.6. Having the precommit on 1.7 would solve this. The best is to start with trunk. Likely 0.95 will come next, and may be, a day, 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-8731) Use the JDK 1.7 in the precommit env for trunk
[ https://issues.apache.org/jira/browse/HBASE-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas Liochon updated HBASE-8731: --- Affects Version/s: (was: 0.98.0) 0.99.0 Use the JDK 1.7 in the precommit env for trunk -- Key: HBASE-8731 URL: https://issues.apache.org/jira/browse/HBASE-8731 Project: HBase Issue Type: Improvement Components: build Affects Versions: 0.99.0 Reporter: Nicolas Liochon Assignee: Giridharan Kesavan HBase today uses the jdk 1.6. In the past it created issues when we tried to use 1.7 for the core build while the precommit was on 1.6. Having the precommit on 1.7 would solve this. The best is to start with trunk. Likely 0.95 will come next, and may be, a day, 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-8731) Use the JDK 1.7 in the precommit env for trunk
[ https://issues.apache.org/jira/browse/HBASE-8731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nicolas Liochon updated HBASE-8731: --- Fix Version/s: (was: 0.98.0) Use the JDK 1.7 in the precommit env for trunk -- Key: HBASE-8731 URL: https://issues.apache.org/jira/browse/HBASE-8731 Project: HBase Issue Type: Improvement Components: build Affects Versions: 0.99.0 Reporter: Nicolas Liochon Assignee: Giridharan Kesavan HBase today uses the jdk 1.6. In the past it created issues when we tried to use 1.7 for the core build while the precommit was on 1.6. Having the precommit on 1.7 would solve this. The best is to start with trunk. Likely 0.95 will come next, and may be, a day, 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-7226) HRegion.checkAndMutate uses incorrect comparison result for , =, and =
[ https://issues.apache.org/jira/browse/HBASE-7226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856819#comment-13856819 ] Hudson commented on HBASE-7226: --- SUCCESS: Integrated in HBase-0.98-on-Hadoop-1.1 #37 (See [https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/37/]) HBASE-7226 HRegion.checkAndMutate uses incorrect comparison result for , =, and = (tedyu: rev 1553454) * /hbase/branches/0.98/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java * /hbase/branches/0.98/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestHRegion.java HRegion.checkAndMutate uses incorrect comparison result for , =, and = --- Key: HBASE-7226 URL: https://issues.apache.org/jira/browse/HBASE-7226 Project: HBase Issue Type: Bug Components: regionserver Reporter: Feng Honghua Assignee: Feng Honghua Fix For: 0.98.0, 0.94.16, 0.99.0 Attachments: HBASE-7226-trunk-v2.patch, HBASE-7226-trunk.patch, HRegion_HBASE_7226_0.94.2.patch in HRegion.checkAndMutate, incorrect comparison results are used for , =, and =, as below: switch (compareOp) { case LESS: matches = compareResult = 0; // should be '' here break; case LESS_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case EQUAL: matches = compareResult == 0; break; case NOT_EQUAL: matches = compareResult != 0; break; case GREATER_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case GREATER: matches = compareResult = 0; // should be '' here break; -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-6104) Require EXEC permission to call coprocessor endpoints
[ https://issues.apache.org/jira/browse/HBASE-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856850#comment-13856850 ] Hudson commented on HBASE-6104: --- FAILURE: Integrated in HBase-TRUNK #4761 (See [https://builds.apache.org/job/HBase-TRUNK/4761/]) Revert HBASE-6104. Revert initial commit and addendum (apurtell: rev 1553450) * /hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/coprocessor/EndpointObserver.java * /hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java * /hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/RegionCoprocessorHost.java * /hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/security/access/AccessController.java * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/security/access/TestAccessController.java * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/security/visibility/TestVisibilityLabelsWithACL.java Amend HBASE-6104. TestAccessController#testCoprocessorExec is improperly failing (apurtell: rev 1553446) * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/security/access/TestAccessController.java Require EXEC permission to call coprocessor endpoints - Key: HBASE-6104 URL: https://issues.apache.org/jira/browse/HBASE-6104 Project: HBase Issue Type: New Feature Components: Coprocessors, security Reporter: Gary Helmling Assignee: Andrew Purtell Fix For: 0.99.0 Attachments: 6104-addendum-1.patch, 6104-revert.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch The EXEC action currently exists as only a placeholder in access control. It should really be used to enforce access to coprocessor endpoint RPC calls, which are currently unrestricted. How the ACLs to support this would be modeled deserves some discussion: * Should access be scoped to a specific table and CoprocessorProtocol extension? * Should it be possible to grant access to a CoprocessorProtocol implementation globally (regardless of table)? * Are per-method restrictions necessary? * Should we expose hooks available to endpoint implementors so that they could additionally apply their own permission checks? Some CP endpoints may want to require READ permissions, others may want to enforce WRITE, or READ + WRITE. To apply these kinds of checks we would also have to extend the RegionObserver interface to provide hooks wrapping HRegion.exec(). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-7226) HRegion.checkAndMutate uses incorrect comparison result for , =, and =
[ https://issues.apache.org/jira/browse/HBASE-7226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856848#comment-13856848 ] Hudson commented on HBASE-7226: --- FAILURE: Integrated in HBase-TRUNK #4761 (See [https://builds.apache.org/job/HBase-TRUNK/4761/]) HBASE-7226 HRegion.checkAndMutate uses incorrect comparison result for , =, and = (tedyu: rev 1553453) * /hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestHRegion.java HRegion.checkAndMutate uses incorrect comparison result for , =, and = --- Key: HBASE-7226 URL: https://issues.apache.org/jira/browse/HBASE-7226 Project: HBase Issue Type: Bug Components: regionserver Reporter: Feng Honghua Assignee: Feng Honghua Fix For: 0.98.0, 0.94.16, 0.99.0 Attachments: HBASE-7226-trunk-v2.patch, HBASE-7226-trunk.patch, HRegion_HBASE_7226_0.94.2.patch in HRegion.checkAndMutate, incorrect comparison results are used for , =, and =, as below: switch (compareOp) { case LESS: matches = compareResult = 0; // should be '' here break; case LESS_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case EQUAL: matches = compareResult == 0; break; case NOT_EQUAL: matches = compareResult != 0; break; case GREATER_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case GREATER: matches = compareResult = 0; // should be '' here break; -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10238) TestAccessController#verifyDenied can fail even if ADE is thrown
[ https://issues.apache.org/jira/browse/HBASE-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856849#comment-13856849 ] Hudson commented on HBASE-10238: FAILURE: Integrated in HBase-TRUNK #4761 (See [https://builds.apache.org/job/HBase-TRUNK/4761/]) Revert HBASE-10238. Revert initial commit and addendum (apurtell: rev 1553442) * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/security/access/TestAccessController.java Amend HBASE-10238. Special case for UndeclaredThrowableException (apurtell: rev 1553438) * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/security/access/TestAccessController.java TestAccessController#verifyDenied can fail even if ADE is thrown Key: HBASE-10238 URL: https://issues.apache.org/jira/browse/HBASE-10238 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Attachments: 10238-addendum-1.patch, 10238.patch In TestAccessController#verifyDenied there is code attempting to deal with UTEs with AccessDeniedException as a nested cause. Although it would seem the code intends to handle the exception reported by a recently failed test on the 0.98 Hadoop 1.1 Jenkins build here: {noformat} java.lang.reflect.UndeclaredThrowableException: Unknown exception in doAs Caused by: java.security.PrivilegedActionException: com.google.protobuf.ServiceException: Error calling method PingService.noop Caused by: com.google.protobuf.ServiceException: Error calling method PingService.noop Caused by: org.apache.hadoop.hbase.security.AccessDeniedExceptionorg.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions (user=UserB, scope=testCoprocessorExec, family=, action=EXEC) Caused by: org.apache.hadoop.hbase.ipc.RemoteWithExtrasException: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient permissions (user=UserB, scope=testCoprocessorExec, family=, action=EXEC) ... {noformat} it didn't work properly. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-7386) Investigate providing some supervisor support for znode deletion
[ https://issues.apache.org/jira/browse/HBASE-7386?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Samir Ahmic updated HBASE-7386: --- Attachment: HBASE-7386-src.patch HBASE-7386-conf.patch HBASE-7386-bin.patch Investigate providing some supervisor support for znode deletion Key: HBASE-7386 URL: https://issues.apache.org/jira/browse/HBASE-7386 Project: HBase Issue Type: Task Components: master, regionserver, scripts Reporter: Gregory Chanan Assignee: stack Priority: Blocker Attachments: HBASE-7386-bin.patch, HBASE-7386-conf.patch, HBASE-7386-src.patch, HBASE-7386-v0.patch, supervisordconfigs-v0.patch There a couple of JIRAs for deleting the znode on a process failure: HBASE-5844 (RS) HBASE-5926 (Master) which are pretty neat; on process failure, they delete the znode of the underlying process so HBase can recover faster. These JIRAs were implemented via the startup scripts; i.e. the script hangs around and waits for the process to exit, then deletes the znode. There are a few problems associated with this approach, as listed in the below JIRAs: 1) Hides startup output in script https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401 2) two hbase processes listed per launched daemon https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409 3) Not run by a real supervisor https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409 4) Weird output after kill -9 actual process in standalone mode https://issues.apache.org/jira/browse/HBASE-5926?focusedCommentId=13506801page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506801 5) Can kill existing RS if called again https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401 6) Hides stdout/stderr[6] https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13506832page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506832 I suspect running in via something like supervisor.d can solve these issues if we provide the right support. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-7386) Investigate providing some supervisor support for znode deletion
[ https://issues.apache.org/jira/browse/HBASE-7386?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856860#comment-13856860 ] Samir Ahmic commented on HBASE-7386: Based on what was discussed here i have created set of scripts and configs that support running hbase processes under python supervisor control. I did not touch any scrips from bin directory as i see this as optional solution for managing hbase processes. Only change in hbase source is adding 'autorestart' option in HMasterCommandLine.java. Scripts does not require any config changes or additional env variables. All testing was done on 0.96 branch and partially on 0.94 using supervisor 3.0 version, OS was Centos 5.8 and 5.10. All suggestions and critics are welcome:) Cheers Investigate providing some supervisor support for znode deletion Key: HBASE-7386 URL: https://issues.apache.org/jira/browse/HBASE-7386 Project: HBase Issue Type: Task Components: master, regionserver, scripts Reporter: Gregory Chanan Assignee: stack Priority: Blocker Attachments: HBASE-7386-bin.patch, HBASE-7386-conf.patch, HBASE-7386-src.patch, HBASE-7386-v0.patch, supervisordconfigs-v0.patch There a couple of JIRAs for deleting the znode on a process failure: HBASE-5844 (RS) HBASE-5926 (Master) which are pretty neat; on process failure, they delete the znode of the underlying process so HBase can recover faster. These JIRAs were implemented via the startup scripts; i.e. the script hangs around and waits for the process to exit, then deletes the znode. There are a few problems associated with this approach, as listed in the below JIRAs: 1) Hides startup output in script https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401 2) two hbase processes listed per launched daemon https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409 3) Not run by a real supervisor https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463409page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463409 4) Weird output after kill -9 actual process in standalone mode https://issues.apache.org/jira/browse/HBASE-5926?focusedCommentId=13506801page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506801 5) Can kill existing RS if called again https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13463401page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13463401 6) Hides stdout/stderr[6] https://issues.apache.org/jira/browse/HBASE-5844?focusedCommentId=13506832page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13506832 I suspect running in via something like supervisor.d can solve these issues if we provide the right support. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10229) Support OperationAttributes in Increment and Append in Shell
[ https://issues.apache.org/jira/browse/HBASE-10229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856865#comment-13856865 ] ramkrishna.s.vasudevan commented on HBASE-10229: I plan to commit this if all are fine with this. This would help me to give the final patch for HBASE-10228. I have it but thought it would be better if I update it after this goes in. Support OperationAttributes in Increment and Append in Shell Key: HBASE-10229 URL: https://issues.apache.org/jira/browse/HBASE-10229 Project: HBase Issue Type: Improvement Components: shell Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Attachments: HBASE-10229_1.patch -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10147) Canary additions
[ https://issues.apache.org/jira/browse/HBASE-10147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gustavo Anatoly updated HBASE-10147: Attachment: HBASE-10147.patch Hi, Stack. Could you please review the patch? Thanks. Canary additions Key: HBASE-10147 URL: https://issues.apache.org/jira/browse/HBASE-10147 Project: HBase Issue Type: Improvement Reporter: stack Assignee: Gustavo Anatoly Attachments: HBASE-10147.patch I've been using the canary to quickly identify the dodgy machine in my cluster. It is useful for this. What would make it better would be: + Rather than saying how long it took to get a region after you have gotten the region, it'd be sweet to log BEFORE you went to get the region the regionname and the server it is on. I ask for this because as is, I have to wait for the canary to timeout which can be a while. + Second ask is that when I pass the -t, that when it fails, it says what it failed against -- what region and hopefully what server location (might be hard). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10169) Batch coprocessor
[ https://issues.apache.org/jira/browse/HBASE-10169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856889#comment-13856889 ] Jingcheng Du commented on HBASE-10169: -- [~ghelmling], [~apurtell]. Hi Gary, Andy. It's a good idea to return the CoprocessorServiceResponse in a batch without combining in the server side. But it's very difficult to implement the coprocessor transparently. In the HTable, we have the method public T extends Service, R Mapbyte[],R coprocessorService(final ClassT service, byte[] startKey, byte[] endKey, final Batch.CallT,R callable). The Batch.call allows users to decide how to execute the coprocessor. final RegionCoprocessorRpcChannel channel = new RegionCoprocessorRpcChannel(connection, tableName, r); T instance = ProtobufUtil.newServiceStub(service, channel); R result = callable.call(instance); The RegionCoprocessorRpcChannel executes the HRegionServer.execService against a single region each time. If we use a RegionServerCoprocessorRpcChannel to directly invoke the HRegionServer.execMultiService() instead, the returned value of the callable.call(instance) should be ListR, which is not right. So it's difficult to have the batch execution for coprocessor service.in a transparent way. Maybe we could add a new API for the HTable, like public T extends Service, R Mapbyte[],R coprocessorService(final ClassT service, Descriptors.MethodDescriptor method, byte[] startKey, byte[] endKey) to realize the batch coprocessor execution. But users are not allowed to decide how to execute the coprocessor since the Batch.Call is not in the arguments any more. How do you think about this? Please help advise. Thanks! Batch coprocessor - Key: HBASE-10169 URL: https://issues.apache.org/jira/browse/HBASE-10169 Project: HBase Issue Type: Sub-task Components: Coprocessors Affects Versions: 0.99.0 Reporter: Jingcheng Du Assignee: Jingcheng Du Attachments: Batch Coprocessor Design Document.docx, HBASE-10169.patch This is designed to improve the coprocessor invocation in the client side. Currently the coprocessor invocation is to send a call to each region. If there’s one region server, and 100 regions are located in this server, each coprocessor invocation will send 100 calls, each call uses a single thread in the client side. The threads will run out soon when the coprocessor invocations are heavy. In this design, all the calls to the same region server will be grouped into one in a single coprocessor invocation. This call will be spread into each region in the server side, and the results will be merged ahead in the server side before being returned to the client. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10147) Canary additions
[ https://issues.apache.org/jira/browse/HBASE-10147?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gustavo Anatoly updated HBASE-10147: Status: Patch Available (was: Open) Canary additions Key: HBASE-10147 URL: https://issues.apache.org/jira/browse/HBASE-10147 Project: HBase Issue Type: Improvement Reporter: stack Assignee: Gustavo Anatoly Attachments: HBASE-10147.patch I've been using the canary to quickly identify the dodgy machine in my cluster. It is useful for this. What would make it better would be: + Rather than saying how long it took to get a region after you have gotten the region, it'd be sweet to log BEFORE you went to get the region the regionname and the server it is on. I ask for this because as is, I have to wait for the canary to timeout which can be a while. + Second ask is that when I pass the -t, that when it fails, it says what it failed against -- what region and hopefully what server location (might be hard). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10147) Canary additions
[ https://issues.apache.org/jira/browse/HBASE-10147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856918#comment-13856918 ] Hadoop QA commented on HBASE-10147: --- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12620502/HBASE-10147.patch against trunk revision . ATTACHMENT ID: 12620502 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:red}-1 tests included{color}. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop 1.0 profile. {color:green}+1 hadoop1.1{color}. The patch compiles against the hadoop 1.1 profile. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 lineLengths{color}. The patch does not introduce lines longer than 100 {color:red}-1 site{color}. The patch appears to cause mvn site goal to fail. {color:red}-1 core tests{color}. The patch failed these unit tests: Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/8284//console This message is automatically generated. Canary additions Key: HBASE-10147 URL: https://issues.apache.org/jira/browse/HBASE-10147 Project: HBase Issue Type: Improvement Reporter: stack Assignee: Gustavo Anatoly Attachments: HBASE-10147.patch I've been using the canary to quickly identify the dodgy machine in my cluster. It is useful for this. What would make it better would be: + Rather than saying how long it took to get a region after you have gotten the region, it'd be sweet to log BEFORE you went to get the region the regionname and the server it is on. I ask for this because as is, I have to wait for the canary to timeout which can be a while. + Second ask is that when I pass the -t, that when it fails, it says what it failed against -- what region and hopefully what server location (might be hard). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10239) Improve determinism and debugability of TestAccessController
Andrew Purtell created HBASE-10239: -- Summary: Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Fix For: 0.98.0, 0.99.0 Separate grant and revoke API invocations to static helper methods in SecureUtils (rename to SecureTestUtils). Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Assigned] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell reassigned HBASE-10239: -- Assignee: Andrew Purtell Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Separate grant and revoke API invocations to static helper methods in SecureUtils (rename to SecureTestUtils). Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-6104) Require EXEC permission to call coprocessor endpoints
[ https://issues.apache.org/jira/browse/HBASE-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856945#comment-13856945 ] Andrew Purtell commented on HBASE-6104: --- TestAccessController improvements filed as HBASE-10239 Require EXEC permission to call coprocessor endpoints - Key: HBASE-6104 URL: https://issues.apache.org/jira/browse/HBASE-6104 Project: HBase Issue Type: New Feature Components: Coprocessors, security Reporter: Gary Helmling Assignee: Andrew Purtell Fix For: 0.99.0 Attachments: 6104-addendum-1.patch, 6104-revert.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch The EXEC action currently exists as only a placeholder in access control. It should really be used to enforce access to coprocessor endpoint RPC calls, which are currently unrestricted. How the ACLs to support this would be modeled deserves some discussion: * Should access be scoped to a specific table and CoprocessorProtocol extension? * Should it be possible to grant access to a CoprocessorProtocol implementation globally (regardless of table)? * Are per-method restrictions necessary? * Should we expose hooks available to endpoint implementors so that they could additionally apply their own permission checks? Some CP endpoints may want to require READ permissions, others may want to enforce WRITE, or READ + WRITE. To apply these kinds of checks we would also have to extend the RegionObserver interface to provide hooks wrapping HRegion.exec(). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Description: Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. (was: Separate grant and revoke API invocations to static helper methods in SecureUtils (rename to SecureTestUtils). Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. ) Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10240) Remove 0.94-0.96 migration code
Andrew Purtell created HBASE-10240: -- Summary: Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10228) Support setCellVisibility and setAuthorizations in Shell
[ https://issues.apache.org/jira/browse/HBASE-10228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-10228: --- Status: Patch Available (was: Open) Support setCellVisibility and setAuthorizations in Shell Key: HBASE-10228 URL: https://issues.apache.org/jira/browse/HBASE-10228 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.98.0, 0.99.0 Attachments: HBASE-10228.patch Currently the shell scripts supports attributes but not the new apis in the Query class. So this JIRA is to support them thro shell. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10228) Support setCellVisibility and setAuthorizations in Shell
[ https://issues.apache.org/jira/browse/HBASE-10228?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-10228: --- Attachment: HBASE-10228.patch Attaching the patch here. After HBASE-10229 gets committed will update this patch once again. Support setCellVisibility and setAuthorizations in Shell Key: HBASE-10228 URL: https://issues.apache.org/jira/browse/HBASE-10228 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.98.0, 0.99.0 Attachments: HBASE-10228.patch Currently the shell scripts supports attributes but not the new apis in the Query class. So this JIRA is to support them thro shell. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-8763) [BRAINSTORM] Combine MVCC and SeqId
[ https://issues.apache.org/jira/browse/HBASE-8763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13856987#comment-13856987 ] Sergey Shelukhin commented on HBASE-8763: - Any update on this? Resolving this confusion would help greatly. [BRAINSTORM] Combine MVCC and SeqId --- Key: HBASE-8763 URL: https://issues.apache.org/jira/browse/HBASE-8763 Project: HBase Issue Type: Improvement Components: regionserver Reporter: Enis Soztutar Attachments: hbase-8736-poc.patch, hbase-8763_wip1.patch HBASE-8701 and a lot of recent issues include good discussions about mvcc + seqId semantics. It seems that having mvcc and the seqId complicates the comparator semantics a lot in regards to flush + WAL replay + compactions + delete markers and out of order puts. Thinking more about it I don't think we need a MVCC write number which is different than the seqId. We can keep the MVCC semantics, read point and smallest read points intact, but combine mvcc write number and seqId. This will allow cleaner semantics + implementation + smaller data files. We can do some brainstorming for 0.98. We still have to verify that this would be semantically correct, it should be so by my current understanding. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10241) implement mvcc-consistent scanners (across recovery)
Sergey Shelukhin created HBASE-10241: Summary: implement mvcc-consistent scanners (across recovery) Key: HBASE-10241 URL: https://issues.apache.org/jira/browse/HBASE-10241 Project: HBase Issue Type: New Feature Components: HFile, regionserver, Scanners Affects Versions: 0.99.0 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Scanners currently use mvcc for consistency. However, mvcc is lost on server restart, or even a region move. This JIRA is to enable the scanners to transfer mvcc (or seqId, or some other number, see HBASE-8763) between servers. First, client scanner needs to get and store the readpoint. Second, mvcc needs to be preserved in WAL. Third, the mvcc needs to be stored in store files per KV and discarded when not needed. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10242) client-side mvcc tracking in scanners
Sergey Shelukhin created HBASE-10242: Summary: client-side mvcc tracking in scanners Key: HBASE-10242 URL: https://issues.apache.org/jira/browse/HBASE-10242 Project: HBase Issue Type: Sub-task Reporter: Sergey Shelukhin Scanners should be able to track mvcc read point and send it to server. This is a subtask, so server can use this mvcc as best it can, but doesn't actually have to guarantee it within the scope of this jira. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10243) store mvcc in WAL
Sergey Shelukhin created HBASE-10243: Summary: store mvcc in WAL Key: HBASE-10243 URL: https://issues.apache.org/jira/browse/HBASE-10243 Project: HBase Issue Type: Sub-task Reporter: Sergey Shelukhin Priority: Minor mvcc needs to be stored in WAL. Right now seqId is already stored, so if they are combined, it would be removed or deprecated. Might also happen before this jira. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10244) store mvcc in store files per KV, discard during compactions based on scanner timeout
Sergey Shelukhin created HBASE-10244: Summary: store mvcc in store files per KV, discard during compactions based on scanner timeout Key: HBASE-10244 URL: https://issues.apache.org/jira/browse/HBASE-10244 Project: HBase Issue Type: Sub-task Components: Compaction, regionserver Reporter: Sergey Shelukhin Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4 -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10244) store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout
[ https://issues.apache.org/jira/browse/HBASE-10244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-10244: - Summary: store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout (was: store mvcc in store files per KV, discard during compactions based on scanner timeout) store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout - Key: HBASE-10244 URL: https://issues.apache.org/jira/browse/HBASE-10244 Project: HBase Issue Type: Sub-task Components: Compaction, regionserver Reporter: Sergey Shelukhin -Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4- -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10244) store mvcc in store files per KV, discard during compactions based on scanner timeout
[ https://issues.apache.org/jira/browse/HBASE-10244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857007#comment-13857007 ] Sergey Shelukhin commented on HBASE-10244: -- Hmm, the first part is already done (comment in KeyValue for mvcc field is wrong), so we only need to extend it store mvcc in store files per KV, discard during compactions based on scanner timeout - Key: HBASE-10244 URL: https://issues.apache.org/jira/browse/HBASE-10244 Project: HBase Issue Type: Sub-task Components: Compaction, regionserver Reporter: Sergey Shelukhin Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4 -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10244) store mvcc in store files per KV, discard during compactions based on scanner timeout
[ https://issues.apache.org/jira/browse/HBASE-10244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-10244: - Description: -Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4- (was: Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4) store mvcc in store files per KV, discard during compactions based on scanner timeout - Key: HBASE-10244 URL: https://issues.apache.org/jira/browse/HBASE-10244 Project: HBase Issue Type: Sub-task Components: Compaction, regionserver Reporter: Sergey Shelukhin -Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4- -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10241) implement mvcc-consistent scanners (across recovery)
[ https://issues.apache.org/jira/browse/HBASE-10241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-10241: - Attachment: Consistent scanners.pdf One-pager implement mvcc-consistent scanners (across recovery) Key: HBASE-10241 URL: https://issues.apache.org/jira/browse/HBASE-10241 Project: HBase Issue Type: New Feature Components: HFile, regionserver, Scanners Affects Versions: 0.99.0 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: Consistent scanners.pdf Scanners currently use mvcc for consistency. However, mvcc is lost on server restart, or even a region move. This JIRA is to enable the scanners to transfer mvcc (or seqId, or some other number, see HBASE-8763) between servers. First, client scanner needs to get and store the readpoint. Second, mvcc needs to be preserved in WAL. Third, the mvcc needs to be stored in store files per KV and discarded when not needed. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857027#comment-13857027 ] Lars Hofhansl commented on HBASE-10240: --- Are we saying one can only upgrade from 0.94 to 0.96 (but not to 0.98+)? Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Attachment: wip-10239.patch Something like this, attaching work in progress patch. Only updates a few tests, need to move the rest over. Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10244) store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout
[ https://issues.apache.org/jira/browse/HBASE-10244?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857029#comment-13857029 ] Lars Hofhansl commented on HBASE-10244: --- Came here to say that. The memstoreTS is already stored in the HFiles and removed as soon as possible during a compaction (see HBASE-8166). store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout - Key: HBASE-10244 URL: https://issues.apache.org/jira/browse/HBASE-10244 Project: HBase Issue Type: Sub-task Components: Compaction, regionserver Reporter: Sergey Shelukhin -Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4- -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Attachment: (was: wip-10239.patch) Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Attachment: wip-10239.patch Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Attachment: wip-10239.patch Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Attachment: (was: wip-10239.patch) Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10228) Support setCellVisibility and setAuthorizations in Shell
[ https://issues.apache.org/jira/browse/HBASE-10228?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857032#comment-13857032 ] Hadoop QA commented on HBASE-10228: --- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12620520/HBASE-10228.patch against trunk revision . ATTACHMENT ID: 12620520 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:green}+1 tests included{color}. The patch appears to include 3 new or modified tests. {color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop 1.0 profile. {color:green}+1 hadoop1.1{color}. The patch compiles against the hadoop 1.1 profile. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 lineLengths{color}. The patch does not introduce lines longer than 100 {color:red}-1 site{color}. The patch appears to cause mvn site goal to fail. {color:red}-1 core tests{color}. The patch failed these unit tests: org.apache.hadoop.hbase.util.TestHBaseFsck Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/8285//console This message is automatically generated. Support setCellVisibility and setAuthorizations in Shell Key: HBASE-10228 URL: https://issues.apache.org/jira/browse/HBASE-10228 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.98.0, 0.99.0 Attachments: HBASE-10228.patch Currently the shell scripts supports attributes but not the new apis in the Query class. So this JIRA is to support them thro shell. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Assigned] (HBASE-10242) client-side mvcc tracking in scanners
[ https://issues.apache.org/jira/browse/HBASE-10242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin reassigned HBASE-10242: Assignee: Sergey Shelukhin client-side mvcc tracking in scanners - Key: HBASE-10242 URL: https://issues.apache.org/jira/browse/HBASE-10242 Project: HBase Issue Type: Sub-task Components: HFile, regionserver, Scanners Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Scanners should be able to track mvcc read point and send it to server. This is a subtask, so server can use this mvcc as best it can, but doesn't actually have to guarantee it within the scope of this jira. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10244) store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout
[ https://issues.apache.org/jira/browse/HBASE-10244?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-10244: - Priority: Minor (was: Major) store mvcc in store files per KV for longer time, discard during compactions based on scanner timeout - Key: HBASE-10244 URL: https://issues.apache.org/jira/browse/HBASE-10244 Project: HBase Issue Type: Sub-task Components: Compaction, regionserver Reporter: Sergey Shelukhin Priority: Minor -Initially, we can store via KV tag. For perf reasons, it might make sense to make it a first-class member of KV, but that would require HFileV4- -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857037#comment-13857037 ] Sergey Shelukhin commented on HBASE-10210: -- ping? [~enis] do you want to review? during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857069#comment-13857069 ] Andrew Purtell commented on HBASE-10240: Previously we have upgraded users through each major release, but yeah with 0.98 coming quickly after 0.96 I can see a direct migration from 0.94 to 0.98 being convenient. I have no strong opinion either way. How long do we keep migration support for 0.94? Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857076#comment-13857076 ] Lars Hofhansl commented on HBASE-10240: --- With the Apache hat on, I think we want to foster upgrading to new version. With the Salesforce hat on, we plan to upgrade from 0.94 to a 1.x version in 2014. If the upgrade code is (1) not getting in the way or (2) hard to maintain, I'd vote to keep it in at least until one of these becomes true. Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857109#comment-13857109 ] Enis Soztutar commented on HBASE-10210: --- It seems that we still have two different mechanisms to track online region servers. RegionServerTracker via zk, and ServerManager via regionServerStartup + regionServerReport. What if we do not accept a region server report unless we processed the zk notification. Would that work? We can ignore the region server report until we get the zk notification. We can stop registering the RS on region server startup or region server reports. Only RSTracker can register new region servers. ServerManager can use the same list from RST? during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857112#comment-13857112 ] Enis Soztutar commented on HBASE-10240: --- I would prefer to keep the option of 0.94 - 0.98 available, which would ease out the user's burden. 0.96 and 0.98 code bases are not that different, so keeping that in makes managing the patches easier as well I think. Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream
[ https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857117#comment-13857117 ] Enis Soztutar commented on HBASE-8558: -- Does this ONLY affect 0.94? We should not commit to 0.94 only unless the other branches are not affected. Could you please provide trunk, 0.96 and 0.98 patches as well. Add timeout limit for HBaseClient dataOutputStream -- Key: HBASE-8558 URL: https://issues.apache.org/jira/browse/HBASE-8558 Project: HBase Issue Type: Bug Components: Client Affects Versions: 0.94.5, 0.94.14 Reporter: wanbin Assignee: Liang Xie Fix For: 0.94.16 Attachments: HBASE-8558-0.94-security.txt, HBASE-8558-0.94.txt I run jstack at client host. The result is below. hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 nid=0x5173 runnable [0x579cc000] java.lang.Thread.State: RUNNABLE at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210) at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) - locked 0x000758cb0780 (a sun.nio.ch.Util$2) - locked 0x000758cb0770 (a java.util.Collections$UnmodifiableSet) - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl) at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) at org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336) at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - locked 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620) - locked 0x000754e97880 (a java.io.DataOutputStream) at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975) at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86) at $Proxy13.multi(Unknown Source) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393) at org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:662) This thread have hung for one hours Meanwhile other thread try to close connection IPC Client (1983049639) connection to dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry [0x4bc0f000] java.lang.Thread.State: BLOCKED (on object monitor) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - waiting to lock 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at java.io.FilterOutputStream.close(FilterOutputStream.java:140) at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237) at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715) - locked 0x000754e7b818 (a org.apache.hadoop.hbase.ipc.HBaseClient$Connection) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587) dump002030.cm6.tbsite.net is dead regionserver. I read hbase sourececode, discover connection.out doesn't set timeout this.out = new DataOutputStream (new
[jira] [Commented] (HBASE-10175) 2-thread ChaosMonkey steps on its own toes
[ https://issues.apache.org/jira/browse/HBASE-10175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857120#comment-13857120 ] Enis Soztutar commented on HBASE-10175: --- +1. 2-thread ChaosMonkey steps on its own toes -- Key: HBASE-10175 URL: https://issues.apache.org/jira/browse/HBASE-10175 Project: HBase Issue Type: Improvement Components: test Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Priority: Minor Attachments: HBASE-10175.patch ChaosMonkey with one destructive and one volatility (flush-compact-split-etc.) threads steps on its own toes and logs a lot of exceptions. A simple solution would be to catch most (or all), like NotServingRegionException, and log less (not a full callstack for example, it's not very useful anyway). A more complicated/complementary one would be to keep track which regions the destructive thread affects and use other regions for volatile one. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857123#comment-13857123 ] Himanshu Vashishtha commented on HBASE-10240: - Yeah, I agree with keeping this code in 0.98 as it is getting released so soon after 0.96.0. Otherwise, it will add one extra step for users who wish to upgrade from 0.94 to 0.98. Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857134#comment-13857134 ] Sergey Shelukhin commented on HBASE-10210: -- What advantage would that give? That's another source of state which becomes important if we do this during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857135#comment-13857135 ] Sergey Shelukhin commented on HBASE-10210: -- [~jxiang] it is not 100% safe, timestamps can collide (I am assuming you mean don't restart on equals) during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10175) 2-thread ChaosMonkey steps on its own toes
[ https://issues.apache.org/jira/browse/HBASE-10175?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857137#comment-13857137 ] Sergey Shelukhin commented on HBASE-10175: -- will commit today if no objections 2-thread ChaosMonkey steps on its own toes -- Key: HBASE-10175 URL: https://issues.apache.org/jira/browse/HBASE-10175 Project: HBase Issue Type: Improvement Components: test Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Priority: Minor Attachments: HBASE-10175.patch ChaosMonkey with one destructive and one volatility (flush-compact-split-etc.) threads steps on its own toes and logs a lot of exceptions. A simple solution would be to catch most (or all), like NotServingRegionException, and log less (not a full callstack for example, it's not very useful anyway). A more complicated/complementary one would be to keep track which regions the destructive thread affects and use other regions for volatile one. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10240: --- Assignee: (was: Andrew Purtell) Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Reporter: Andrew Purtell Fix For: 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10240: --- Affects Version/s: (was: 0.99.0) (was: 0.98.0) Fix Version/s: (was: 0.98.0) Ok, unscheduled from 0.98 Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Status: Patch Available (was: Open) Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: 10239.patch, wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-10239: --- Attachment: 10239.patch Done. Passes JDK6 and 7 locally. Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: 10239.patch, wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857142#comment-13857142 ] Enis Soztutar commented on HBASE-10210: --- bq. That's another source of state which becomes important if we do this What do you mean? The znodes from RS's are already very important for expiring the servers. We should get rid of two different lists and use a combined list from zk, no? during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Comment Edited] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream
[ https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857143#comment-13857143 ] Lars Hofhansl edited comment on HBASE-8558 at 12/26/13 10:42 PM: - According to Liang above, this is only a 0.94 issue. (I am usually careful labeling jiras with the correct fix targets) was (Author: lhofhansl): According to Liang above, this is only a 0.94 issue. (I am usually careful labeling jiras with the correct fix targets, so if it's only targeted to 0.94) Add timeout limit for HBaseClient dataOutputStream -- Key: HBASE-8558 URL: https://issues.apache.org/jira/browse/HBASE-8558 Project: HBase Issue Type: Bug Components: Client Affects Versions: 0.94.5, 0.94.14 Reporter: wanbin Assignee: Liang Xie Fix For: 0.94.16 Attachments: HBASE-8558-0.94-security.txt, HBASE-8558-0.94.txt I run jstack at client host. The result is below. hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 nid=0x5173 runnable [0x579cc000] java.lang.Thread.State: RUNNABLE at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210) at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) - locked 0x000758cb0780 (a sun.nio.ch.Util$2) - locked 0x000758cb0770 (a java.util.Collections$UnmodifiableSet) - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl) at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) at org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336) at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - locked 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620) - locked 0x000754e97880 (a java.io.DataOutputStream) at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975) at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86) at $Proxy13.multi(Unknown Source) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393) at org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:662) This thread have hung for one hours Meanwhile other thread try to close connection IPC Client (1983049639) connection to dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry [0x4bc0f000] java.lang.Thread.State: BLOCKED (on object monitor) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - waiting to lock 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at java.io.FilterOutputStream.close(FilterOutputStream.java:140) at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237) at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715) - locked 0x000754e7b818 (a org.apache.hadoop.hbase.ipc.HBaseClient$Connection) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587)
[jira] [Commented] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream
[ https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857143#comment-13857143 ] Lars Hofhansl commented on HBASE-8558: -- According to Liang above, this is only a 0.94 issue. (I am usually careful labeling jiras with the correct fix targets, so if it's only targeted to 0.94) Add timeout limit for HBaseClient dataOutputStream -- Key: HBASE-8558 URL: https://issues.apache.org/jira/browse/HBASE-8558 Project: HBase Issue Type: Bug Components: Client Affects Versions: 0.94.5, 0.94.14 Reporter: wanbin Assignee: Liang Xie Fix For: 0.94.16 Attachments: HBASE-8558-0.94-security.txt, HBASE-8558-0.94.txt I run jstack at client host. The result is below. hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 nid=0x5173 runnable [0x579cc000] java.lang.Thread.State: RUNNABLE at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210) at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) - locked 0x000758cb0780 (a sun.nio.ch.Util$2) - locked 0x000758cb0770 (a java.util.Collections$UnmodifiableSet) - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl) at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) at org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336) at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - locked 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620) - locked 0x000754e97880 (a java.io.DataOutputStream) at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975) at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86) at $Proxy13.multi(Unknown Source) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393) at org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:662) This thread have hung for one hours Meanwhile other thread try to close connection IPC Client (1983049639) connection to dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry [0x4bc0f000] java.lang.Thread.State: BLOCKED (on object monitor) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - waiting to lock 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at java.io.FilterOutputStream.close(FilterOutputStream.java:140) at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237) at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715) - locked 0x000754e7b818 (a org.apache.hadoop.hbase.ipc.HBaseClient$Connection) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587) dump002030.cm6.tbsite.net is dead regionserver. I read hbase sourececode, discover connection.out doesn't set timeout this.out = new DataOutputStream (new
[jira] [Updated] (HBASE-6104) Require EXEC permission to call coprocessor endpoints
[ https://issues.apache.org/jira/browse/HBASE-6104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Purtell updated HBASE-6104: -- Attachment: 6104.patch Back for more. Attaching new patch that won't apply until after HBASE-10239 goes in. New test case for EXEC permission passes several times locally on JDK6 and 7, including the specific JDK (Oracle 6u43) I managed to reproduce the Jenkins failure with. Could be (re)considered for commit to trunk. Require EXEC permission to call coprocessor endpoints - Key: HBASE-6104 URL: https://issues.apache.org/jira/browse/HBASE-6104 Project: HBase Issue Type: New Feature Components: Coprocessors, security Reporter: Gary Helmling Assignee: Andrew Purtell Fix For: 0.99.0 Attachments: 6104-addendum-1.patch, 6104-revert.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch, 6104.patch The EXEC action currently exists as only a placeholder in access control. It should really be used to enforce access to coprocessor endpoint RPC calls, which are currently unrestricted. How the ACLs to support this would be modeled deserves some discussion: * Should access be scoped to a specific table and CoprocessorProtocol extension? * Should it be possible to grant access to a CoprocessorProtocol implementation globally (regardless of table)? * Are per-method restrictions necessary? * Should we expose hooks available to endpoint implementors so that they could additionally apply their own permission checks? Some CP endpoints may want to require READ permissions, others may want to enforce WRITE, or READ + WRITE. To apply these kinds of checks we would also have to extend the RegionObserver interface to provide hooks wrapping HRegion.exec(). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857150#comment-13857150 ] Sergey Shelukhin commented on HBASE-10210: -- Current logic is to read ZK nodes at startup only if the servers don't phone in... I'll have to check the code on whether it would work if ZK has problems. Even if not; I guess we could change to read from ZK and then wait, but I am not sure what advantage it gives. The issue in this bug is purely sync code bug, not some fundamental problem with RS registration. during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10229) Support OperationAttributes in Increment and Append in Shell
[ https://issues.apache.org/jira/browse/HBASE-10229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857151#comment-13857151 ] Andrew Purtell commented on HBASE-10229: +1, looks good to me Support OperationAttributes in Increment and Append in Shell Key: HBASE-10229 URL: https://issues.apache.org/jira/browse/HBASE-10229 Project: HBase Issue Type: Improvement Components: shell Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Attachments: HBASE-10229_1.patch -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-8558) Add timeout limit for HBaseClient dataOutputStream
[ https://issues.apache.org/jira/browse/HBASE-8558?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857154#comment-13857154 ] Enis Soztutar commented on HBASE-8558: -- Sorry for false alarm. I was looking into 0.94 code apparently when I was reviewing the changes for this patch. Thanks for the notification. Add timeout limit for HBaseClient dataOutputStream -- Key: HBASE-8558 URL: https://issues.apache.org/jira/browse/HBASE-8558 Project: HBase Issue Type: Bug Components: Client Affects Versions: 0.94.5, 0.94.14 Reporter: wanbin Assignee: Liang Xie Fix For: 0.94.16 Attachments: HBASE-8558-0.94-security.txt, HBASE-8558-0.94.txt I run jstack at client host. The result is below. hbase-tablepool-60-thread-34 daemon prio=10 tid=0x7f1e65a48000 nid=0x5173 runnable [0x579cc000] java.lang.Thread.State: RUNNABLE at sun.nio.ch.EPollArrayWrapper.epollWait(Native Method) at sun.nio.ch.EPollArrayWrapper.poll(EPollArrayWrapper.java:210) at sun.nio.ch.EPollSelectorImpl.doSelect(EPollSelectorImpl.java:65) at sun.nio.ch.SelectorImpl.lockAndDoSelect(SelectorImpl.java:69) - locked 0x000758cb0780 (a sun.nio.ch.Util$2) - locked 0x000758cb0770 (a java.util.Collections$UnmodifiableSet) - locked 0x000758cb0548 (a sun.nio.ch.EPollSelectorImpl) at sun.nio.ch.SelectorImpl.select(SelectorImpl.java:80) at org.apache.hadoop.net.SocketIOWithTimeout$SelectorPool.select(SocketIOWithTimeout.java:336) at org.apache.hadoop.net.SocketIOWithTimeout.doIO(SocketIOWithTimeout.java:158) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:153) at org.apache.hadoop.net.SocketOutputStream.write(SocketOutputStream.java:114) at java.io.BufferedOutputStream.flushBuffer(BufferedOutputStream.java:65) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - locked 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.sendParam(HBaseClient.java:620) - locked 0x000754e97880 (a java.io.DataOutputStream) at org.apache.hadoop.hbase.ipc.HBaseClient.call(HBaseClient.java:975) at org.apache.hadoop.hbase.ipc.WritableRpcEngine$Invoker.invoke(WritableRpcEngine.java:86) at $Proxy13.multi(Unknown Source) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1395) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3$1.call(HConnectionManager.java:1393) at org.apache.hadoop.hbase.client.ServerCallable.withoutRetries(ServerCallable.java:210) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1402) at org.apache.hadoop.hbase.client.HConnectionManager$HConnectionImplementation$3.call(HConnectionManager.java:1390) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303) at java.util.concurrent.FutureTask.run(FutureTask.java:138) at java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:886) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:908) at java.lang.Thread.run(Thread.java:662) This thread have hung for one hours Meanwhile other thread try to close connection IPC Client (1983049639) connection to dump002030.cm6.tbsite.net/10.246.2.30:30020 from admin daemon prio=10 tid=0x7f1e70674800 nid=0x3d76 waiting for monitor entry [0x4bc0f000] java.lang.Thread.State: BLOCKED (on object monitor) at java.io.BufferedOutputStream.flush(BufferedOutputStream.java:123) - waiting to lock 0x000754e978a0 (a java.io.BufferedOutputStream) at java.io.DataOutputStream.flush(DataOutputStream.java:106) at java.io.FilterOutputStream.close(FilterOutputStream.java:140) at org.apache.hadoop.io.IOUtils.cleanup(IOUtils.java:237) at org.apache.hadoop.io.IOUtils.closeStream(IOUtils.java:254) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.close(HBaseClient.java:715) - locked 0x000754e7b818 (a org.apache.hadoop.hbase.ipc.HBaseClient$Connection) at org.apache.hadoop.hbase.ipc.HBaseClient$Connection.run(HBaseClient.java:587) dump002030.cm6.tbsite.net is dead regionserver. I read hbase sourececode, discover connection.out doesn't set timeout this.out = new DataOutputStream (new
[jira] [Commented] (HBASE-9858) Integration test and LoadTestTool support for cell Visibility
[ https://issues.apache.org/jira/browse/HBASE-9858?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857156#comment-13857156 ] Andrew Purtell commented on HBASE-9858: --- In my opinion, it's ok to add command line switches to LTT for new schema features/settings or modes of operation. However, when we get down into the details of how cells should be constructed, and what tags to use, and such, it gets ugly fast. I think the existing command line flag for adding tags should be removed actually. Let's consider a new command line option that specifies a class to use for value generation. The parameter can be a colon delimited one, like -read and -write. The first element would be the name of the class and all remaining elements, if present, would be passed as String[] to the class to initialize it, e.g. {{-generator org.apache.hadoop.hbase.security.visibility.LoadTestDataGenerator}}. Integration test and LoadTestTool support for cell Visibility - Key: HBASE-9858 URL: https://issues.apache.org/jira/browse/HBASE-9858 Project: HBase Issue Type: Sub-task Components: security Affects Versions: 0.98.0 Reporter: Anoop Sam John Assignee: Anoop Sam John Fix For: 0.98.0 Attachments: HBASE-9858.patch Cell level visibility should have an integration test and LoadTestTool support. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Resolved] (HBASE-10218) Port HBASE-10142 'TestLogRolling#testLogRollOnDatanodeDeath test failure' to 0.96 branch
[ https://issues.apache.org/jira/browse/HBASE-10218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu resolved HBASE-10218. Resolution: Fixed Hadoop Flags: Reviewed Port HBASE-10142 'TestLogRolling#testLogRollOnDatanodeDeath test failure' to 0.96 branch Key: HBASE-10218 URL: https://issues.apache.org/jira/browse/HBASE-10218 Project: HBase Issue Type: Bug Reporter: Ted Yu Assignee: Ted Yu Fix For: 0.96.2 Attachments: 10218-v1.txt, 10218-v2.txt TestLogRolling#testLogRollOnDatanodeDeath used to fail quite often. This is fixed by HBASE-10142 in 0.98 and above. Toward the end of HBASE-10142, Jonathan and Enis proposed porting the fix to 0.96 and Stack ack'ed. This issue ports the fix to 0.96 branch -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10150) Write attachment Id of tested patch into JIRA comment
[ https://issues.apache.org/jira/browse/HBASE-10150?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-10150: --- Resolution: Fixed Hadoop Flags: Reviewed Status: Resolved (was: Patch Available) Write attachment Id of tested patch into JIRA comment - Key: HBASE-10150 URL: https://issues.apache.org/jira/browse/HBASE-10150 Project: HBase Issue Type: Sub-task Reporter: Ted Yu Assignee: Ted Yu Fix For: 0.99.0 Attachments: 10150-v1.txt, 10150-v2.txt The optimization proposed in HBASE-10044 wouldn't work if QA bot doesn't know the attachment Id of the most recently tested patch. The first step is to write attachment Id of tested patch into JIRA comment. For details, see HADOOP-10163 -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Resolved] (HBASE-10044) test-patch.sh should accept documents by known file extensions
[ https://issues.apache.org/jira/browse/HBASE-10044?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu resolved HBASE-10044. Resolution: Fixed test-patch.sh should accept documents by known file extensions -- Key: HBASE-10044 URL: https://issues.apache.org/jira/browse/HBASE-10044 Project: HBase Issue Type: Test Reporter: Ted Yu Assignee: Ted Yu Fix For: 0.99.0 Attachments: 10044-v1.txt, 10044-v2.txt, 10044-v3.txt Currently only htm[l] files are filtered out when test-patch.sh looks for patch attachment. In the email thread, 'Extensions for patches accepted by QA bot', consensus was to accept the following file extensions only: .patch .txt .diff -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857176#comment-13857176 ] Hadoop QA commented on HBASE-10239: --- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12620556/10239.patch against trunk revision . ATTACHMENT ID: 12620556 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:green}+1 tests included{color}. The patch appears to include 11 new or modified tests. {color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop 1.0 profile. {color:green}+1 hadoop1.1{color}. The patch compiles against the hadoop 1.1 profile. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 lineLengths{color}. The patch does not introduce lines longer than 100 {color:red}-1 site{color}. The patch appears to cause mvn site goal to fail. {color:green}+1 core tests{color}. The patch passed unit tests in . Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/8286//console This message is automatically generated. Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: 10239.patch, wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10176) Canary#sniff() should close the HTable instance
[ https://issues.apache.org/jira/browse/HBASE-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-10176: --- Attachment: 10176-v1.txt Canary#sniff() should close the HTable instance --- Key: HBASE-10176 URL: https://issues.apache.org/jira/browse/HBASE-10176 Project: HBase Issue Type: Bug Reporter: Ted Yu Priority: Minor Attachments: 10176-v1.txt {code} table = new HTable(admin.getConfiguration(), tableDesc.getName()); {code} HTable instance should be closed by the end of the method. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Assigned] (HBASE-10176) Canary#sniff() should close the HTable instance
[ https://issues.apache.org/jira/browse/HBASE-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu reassigned HBASE-10176: -- Assignee: Ted Yu Canary#sniff() should close the HTable instance --- Key: HBASE-10176 URL: https://issues.apache.org/jira/browse/HBASE-10176 Project: HBase Issue Type: Bug Reporter: Ted Yu Assignee: Ted Yu Priority: Minor Attachments: 10176-v1.txt {code} table = new HTable(admin.getConfiguration(), tableDesc.getName()); {code} HTable instance should be closed by the end of the method. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10176) Canary#sniff() should close the HTable instance
[ https://issues.apache.org/jira/browse/HBASE-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-10176: --- Status: Patch Available (was: Open) Canary#sniff() should close the HTable instance --- Key: HBASE-10176 URL: https://issues.apache.org/jira/browse/HBASE-10176 Project: HBase Issue Type: Bug Reporter: Ted Yu Assignee: Ted Yu Priority: Minor Attachments: 10176-v1.txt {code} table = new HTable(admin.getConfiguration(), tableDesc.getName()); {code} HTable instance should be closed by the end of the method. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10210) during master startup, RS can be you-are-dead-ed by master in error
[ https://issues.apache.org/jira/browse/HBASE-10210?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857190#comment-13857190 ] Enis Soztutar commented on HBASE-10210: --- After some offline discussion, I think it will be fine to fix this as a bug for sync issues, and do another jira for fixing up the root cause, which is the fact that we maintain two separate lists for online servers, do RS register using RS start + report, but expiry using zk. Let me do another pass at the patch. during master startup, RS can be you-are-dead-ed by master in error --- Key: HBASE-10210 URL: https://issues.apache.org/jira/browse/HBASE-10210 Project: HBase Issue Type: Bug Affects Versions: 0.98.0, 0.96.1, 0.99.0, 0.96.1.1 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: HBASE-10210.patch Not sure of the root cause yet, I am at how did this ever work stage. We see this problem in 0.96.1, but didn't in 0.96.0 + some patches. It looks like RS information arriving from 2 sources - ZK and server itself, can conflict. Master doesn't handle such cases (timestamp match), and anyway technically timestamps can collide for two separate servers. So, master YouAreDead-s the already-recorded reporting RS, and adds it too. Then it discovers that the new server has died with fatal error! Note the threads. Addition is called from master initialization and from RPC. {noformat} 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Finished waiting for region servers count to settle; checked in 2, slept for 18262 ms, expecting minimum of 1, maximum of 2147483647, master is running. 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.ServerManager: Registering server=h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,290 INFO [master:h2-ubuntu12-sec-1387431063-hbase-10:6] master.HMaster: Registered server found up in zk but who has not yet reported in: h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Triggering server recovery; existingServer h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 looks stale, new server:h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 2013-12-19 11:16:45,380 INFO [RpcServer.handler=4,port=6] master.ServerManager: Master doesn't enable ServerShutdownHandler during initialization, delay expiring server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 ... 2013-12-19 11:16:46,925 ERROR [RpcServer.handler=7,port=6] master.HMaster: Region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 reported a fatal error: ABORTING region server h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800: org.apache.hadoop.hbase.YouAreDeadException: Server REPORT rejected; currently processing h2-ubuntu12-sec-1387431063-hbase-8.cs1cloud.internal,60020,1387451803800 as dead server {noformat} Presumably some of the recent ZK listener related changes b -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-7226) HRegion.checkAndMutate uses incorrect comparison result for , =, and =
[ https://issues.apache.org/jira/browse/HBASE-7226?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857193#comment-13857193 ] Hudson commented on HBASE-7226: --- SUCCESS: Integrated in HBase-TRUNK-on-Hadoop-1.1 #26 (See [https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-1.1/26/]) HBASE-7226 HRegion.checkAndMutate uses incorrect comparison result for , =, and = (tedyu: rev 1553453) * /hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java * /hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestHRegion.java HRegion.checkAndMutate uses incorrect comparison result for , =, and = --- Key: HBASE-7226 URL: https://issues.apache.org/jira/browse/HBASE-7226 Project: HBase Issue Type: Bug Components: regionserver Reporter: Feng Honghua Assignee: Feng Honghua Fix For: 0.98.0, 0.94.16, 0.99.0 Attachments: HBASE-7226-trunk-v2.patch, HBASE-7226-trunk.patch, HRegion_HBASE_7226_0.94.2.patch in HRegion.checkAndMutate, incorrect comparison results are used for , =, and =, as below: switch (compareOp) { case LESS: matches = compareResult = 0; // should be '' here break; case LESS_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case EQUAL: matches = compareResult == 0; break; case NOT_EQUAL: matches = compareResult != 0; break; case GREATER_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case GREATER: matches = compareResult = 0; // should be '' here break; -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10224) Line length check in test-patch.sh is broken
[ https://issues.apache.org/jira/browse/HBASE-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-10224: --- Status: Patch Available (was: Open) Line length check in test-patch.sh is broken Key: HBASE-10224 URL: https://issues.apache.org/jira/browse/HBASE-10224 Project: HBase Issue Type: Test Reporter: Ted Yu Attachments: 10176-v1.txt Here is related code in the script: {code} MAX_LINE_LENGTH_PATCH=`expr $MAX_LINE_LENGTH + 1` ... ll=`echo $lines | wc -l` if [[ $ll -gt $MAX_LINE_LENGTH_PATCH ]]; then {code} Here was the result from dry run: {code} TYus-MacBook-Pro:m7 tyu$ lines=`cat ~/trunk/7226-trunk.patch | grep ^+ | grep -v ^@@ | grep -v ^+++ | grep -v import | grep -v hbase.protobuf.generated | awk -v len=101 'length ($0) len' | head -n 10` TYus-MacBook-Pro:m7 tyu$ ll=`echo $lines | wc -l` TYus-MacBook-Pro:m7 tyu$ echo $ll 3 TYus-MacBook-Pro:m7 tyu$ echo $lines + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val1, succeed (now value = val3) + // Test CompareOp.GREATER_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) {code} The value of $ll should be compared with 0, not the value of $MAX_LINE_LENGTH_PATCH -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10224) Line length check in test-patch.sh is broken
[ https://issues.apache.org/jira/browse/HBASE-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-10224: --- Attachment: 10176-v1.txt Line length check in test-patch.sh is broken Key: HBASE-10224 URL: https://issues.apache.org/jira/browse/HBASE-10224 Project: HBase Issue Type: Test Reporter: Ted Yu Attachments: 10176-v1.txt Here is related code in the script: {code} MAX_LINE_LENGTH_PATCH=`expr $MAX_LINE_LENGTH + 1` ... ll=`echo $lines | wc -l` if [[ $ll -gt $MAX_LINE_LENGTH_PATCH ]]; then {code} Here was the result from dry run: {code} TYus-MacBook-Pro:m7 tyu$ lines=`cat ~/trunk/7226-trunk.patch | grep ^+ | grep -v ^@@ | grep -v ^+++ | grep -v import | grep -v hbase.protobuf.generated | awk -v len=101 'length ($0) len' | head -n 10` TYus-MacBook-Pro:m7 tyu$ ll=`echo $lines | wc -l` TYus-MacBook-Pro:m7 tyu$ echo $ll 3 TYus-MacBook-Pro:m7 tyu$ echo $lines + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val1, succeed (now value = val3) + // Test CompareOp.GREATER_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) {code} The value of $ll should be compared with 0, not the value of $MAX_LINE_LENGTH_PATCH -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Assigned] (HBASE-10224) Line length check in test-patch.sh is broken
[ https://issues.apache.org/jira/browse/HBASE-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu reassigned HBASE-10224: -- Assignee: Ted Yu Line length check in test-patch.sh is broken Key: HBASE-10224 URL: https://issues.apache.org/jira/browse/HBASE-10224 Project: HBase Issue Type: Test Reporter: Ted Yu Assignee: Ted Yu Attachments: 10176-v1.txt Here is related code in the script: {code} MAX_LINE_LENGTH_PATCH=`expr $MAX_LINE_LENGTH + 1` ... ll=`echo $lines | wc -l` if [[ $ll -gt $MAX_LINE_LENGTH_PATCH ]]; then {code} Here was the result from dry run: {code} TYus-MacBook-Pro:m7 tyu$ lines=`cat ~/trunk/7226-trunk.patch | grep ^+ | grep -v ^@@ | grep -v ^+++ | grep -v import | grep -v hbase.protobuf.generated | awk -v len=101 'length ($0) len' | head -n 10` TYus-MacBook-Pro:m7 tyu$ ll=`echo $lines | wc -l` TYus-MacBook-Pro:m7 tyu$ echo $ll 3 TYus-MacBook-Pro:m7 tyu$ echo $lines + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val1, succeed (now value = val3) + // Test CompareOp.GREATER_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) {code} The value of $ll should be compared with 0, not the value of $MAX_LINE_LENGTH_PATCH -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-7226) HRegion.checkAndMutate uses incorrect comparison result for , =, and =
[ https://issues.apache.org/jira/browse/HBASE-7226?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Lars Hofhansl updated HBASE-7226: - Fix Version/s: 0.96.2 HRegion.checkAndMutate uses incorrect comparison result for , =, and = --- Key: HBASE-7226 URL: https://issues.apache.org/jira/browse/HBASE-7226 Project: HBase Issue Type: Bug Components: regionserver Reporter: Feng Honghua Assignee: Feng Honghua Fix For: 0.98.0, 0.94.16, 0.96.2, 0.99.0 Attachments: HBASE-7226-trunk-v2.patch, HBASE-7226-trunk.patch, HRegion_HBASE_7226_0.94.2.patch in HRegion.checkAndMutate, incorrect comparison results are used for , =, and =, as below: switch (compareOp) { case LESS: matches = compareResult = 0; // should be '' here break; case LESS_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case EQUAL: matches = compareResult == 0; break; case NOT_EQUAL: matches = compareResult != 0; break; case GREATER_OR_EQUAL: matches = compareResult 0; // should be '=' here break; case GREATER: matches = compareResult = 0; // should be '' here break; -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10176) Canary#sniff() should close the HTable instance
[ https://issues.apache.org/jira/browse/HBASE-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857239#comment-13857239 ] Hadoop QA commented on HBASE-10176: --- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12620563/10176-v1.txt against trunk revision . ATTACHMENT ID: 12620563 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:red}-1 tests included{color}. The patch doesn't appear to include any new or modified tests. Please justify why no new tests are needed for this patch. Also please list what manual steps were performed to verify this patch. {color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop 1.0 profile. {color:green}+1 hadoop1.1{color}. The patch compiles against the hadoop 1.1 profile. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 lineLengths{color}. The patch does not introduce lines longer than 100 {color:red}-1 site{color}. The patch appears to cause mvn site goal to fail. {color:green}+1 core tests{color}. The patch passed unit tests in . Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/8287//console This message is automatically generated. Canary#sniff() should close the HTable instance --- Key: HBASE-10176 URL: https://issues.apache.org/jira/browse/HBASE-10176 Project: HBase Issue Type: Bug Reporter: Ted Yu Assignee: Ted Yu Priority: Minor Attachments: 10176-v1.txt {code} table = new HTable(admin.getConfiguration(), tableDesc.getName()); {code} HTable instance should be closed by the end of the method. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10245) Mastor abord starting when region already in PENDING_OPEN state
Jean-Marc Spaggiari created HBASE-10245: --- Summary: Mastor abord starting when region already in PENDING_OPEN state Key: HBASE-10245 URL: https://issues.apache.org/jira/browse/HBASE-10245 Project: HBase Issue Type: Bug Components: master Affects Versions: 0.94.14 Reporter: Jean-Marc Spaggiari Not sure if it's impacting other versions too. When master is starting while a region was PENDING_OPEN, master abort starting. java.lang.IllegalStateException: Unexpected state : page,www\x1Fhttp\x1F-1\x1F/vote/comment/27996/1/\x1Fnull,1379104524006.17bee313797fc1ce982c0e31fdb6620c. state=PENDING_OPEN, ts=1388065670415, server=node6,60020,1388027343261 .. Cannot transit it to OFFLINE. at org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:1890) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1690) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1426) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1398) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1393) at org.apache.hadoop.hbase.master.handler.ClosedRegionHandler.process(ClosedRegionHandler.java:105) at org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:175) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10245) Master aborts starting when region already in PENDING_OPEN state
[ https://issues.apache.org/jira/browse/HBASE-10245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu updated HBASE-10245: --- Summary: Master aborts starting when region already in PENDING_OPEN state (was: Mastor abord starting when region already in PENDING_OPEN state) Master aborts starting when region already in PENDING_OPEN state Key: HBASE-10245 URL: https://issues.apache.org/jira/browse/HBASE-10245 Project: HBase Issue Type: Bug Components: master Affects Versions: 0.94.14 Reporter: Jean-Marc Spaggiari Not sure if it's impacting other versions too. When master is starting while a region was PENDING_OPEN, master abort starting. java.lang.IllegalStateException: Unexpected state : page,www\x1Fhttp\x1F-1\x1F/vote/comment/27996/1/\x1Fnull,1379104524006.17bee313797fc1ce982c0e31fdb6620c. state=PENDING_OPEN, ts=1388065670415, server=node6,60020,1388027343261 .. Cannot transit it to OFFLINE. at org.apache.hadoop.hbase.master.AssignmentManager.setOfflineInZooKeeper(AssignmentManager.java:1890) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1690) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1426) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1398) at org.apache.hadoop.hbase.master.AssignmentManager.assign(AssignmentManager.java:1393) at org.apache.hadoop.hbase.master.handler.ClosedRegionHandler.process(ClosedRegionHandler.java:105) at org.apache.hadoop.hbase.executor.EventHandler.run(EventHandler.java:175) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1145) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:615) at java.lang.Thread.run(Thread.java:744) -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10224) Line length check in test-patch.sh is broken
[ https://issues.apache.org/jira/browse/HBASE-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857253#comment-13857253 ] Hadoop QA commented on HBASE-10224: --- {color:red}-1 overall{color}. Here are the results of testing the latest attachment http://issues.apache.org/jira/secure/attachment/12620572/10176-v1.txt against trunk revision . ATTACHMENT ID: 12620572 {color:green}+1 @author{color}. The patch does not contain any @author tags. {color:green}+1 tests included{color}. The patch appears to include 3 new or modified tests. {color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop 1.0 profile. {color:green}+1 hadoop1.1{color}. The patch compiles against the hadoop 1.1 profile. {color:green}+1 javadoc{color}. The javadoc tool did not generate any warning messages. {color:green}+1 javac{color}. The applied patch does not increase the total number of javac compiler warnings. {color:green}+1 findbugs{color}. The patch does not introduce any new Findbugs (version 1.3.9) warnings. {color:green}+1 release audit{color}. The applied patch does not increase the total number of release audit warnings. {color:green}+1 lineLengths{color}. The patch does not introduce lines longer than 100 {color:red}-1 site{color}. The patch appears to cause mvn site goal to fail. {color:green}+1 core tests{color}. The patch passed unit tests in . Test results: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//testReport/ Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html Findbugs warnings: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html Console output: https://builds.apache.org/job/PreCommit-HBASE-Build/8288//console This message is automatically generated. Line length check in test-patch.sh is broken Key: HBASE-10224 URL: https://issues.apache.org/jira/browse/HBASE-10224 Project: HBase Issue Type: Test Reporter: Ted Yu Assignee: Ted Yu Attachments: 10176-v1.txt Here is related code in the script: {code} MAX_LINE_LENGTH_PATCH=`expr $MAX_LINE_LENGTH + 1` ... ll=`echo $lines | wc -l` if [[ $ll -gt $MAX_LINE_LENGTH_PATCH ]]; then {code} Here was the result from dry run: {code} TYus-MacBook-Pro:m7 tyu$ lines=`cat ~/trunk/7226-trunk.patch | grep ^+ | grep -v ^@@ | grep -v ^+++ | grep -v import | grep -v hbase.protobuf.generated | awk -v len=101 'length ($0) len' | head -n 10` TYus-MacBook-Pro:m7 tyu$ ll=`echo $lines | wc -l` TYus-MacBook-Pro:m7 tyu$ echo $ll 3 TYus-MacBook-Pro:m7 tyu$ echo $lines + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) + // Test CompareOp.LESS_OR_EQUAL: original = val2, compare with val1, succeed (now value = val3) + // Test CompareOp.GREATER_OR_EQUAL: original = val2, compare with val2, succeed (value still = val2) {code} The value of $ll should be compared with 0, not the value of $MAX_LINE_LENGTH_PATCH -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10176) Canary#sniff() should close the HTable instance
[ https://issues.apache.org/jira/browse/HBASE-10176?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857259#comment-13857259 ] Elliott Clark commented on HBASE-10176: --- +1 Canary#sniff() should close the HTable instance --- Key: HBASE-10176 URL: https://issues.apache.org/jira/browse/HBASE-10176 Project: HBase Issue Type: Bug Reporter: Ted Yu Assignee: Ted Yu Priority: Minor Attachments: 10176-v1.txt {code} table = new HTable(admin.getConfiguration(), tableDesc.getName()); {code} HTable instance should be closed by the end of the method. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10240) Remove 0.94-0.96 migration code
[ https://issues.apache.org/jira/browse/HBASE-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857261#comment-13857261 ] Jean-Marc Spaggiari commented on HBASE-10240: - Late, but I agree with others. Always easier to keep the upgrade options to late comer. I agree that since the releases are close to each others, some might want to skip some of them... Remove 0.94-0.96 migration code Key: HBASE-10240 URL: https://issues.apache.org/jira/browse/HBASE-10240 Project: HBase Issue Type: Improvement Reporter: Andrew Purtell Fix For: 0.99.0 Remove the objects and code only needed for supporting migration to 0.96 from 0.94. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857295#comment-13857295 ] ramkrishna.s.vasudevan commented on HBASE-10239: Went thro the patch. Looks good to me. +1. The updateACL is needed for solving the issue in HBASE-6104? Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: 10239.patch, wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10229) Support OperationAttributes in Increment and Append in Shell
[ https://issues.apache.org/jira/browse/HBASE-10229?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ramkrishna.s.vasudevan updated HBASE-10229: --- Fix Version/s: 0.99.0 0.98.0 Support OperationAttributes in Increment and Append in Shell Key: HBASE-10229 URL: https://issues.apache.org/jira/browse/HBASE-10229 Project: HBase Issue Type: Improvement Components: shell Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.98.0, 0.99.0 Attachments: HBASE-10229_1.patch -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10229) Support OperationAttributes in Increment and Append in Shell
[ https://issues.apache.org/jira/browse/HBASE-10229?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857298#comment-13857298 ] ramkrishna.s.vasudevan commented on HBASE-10229: Committed to trunk. I tried committing thro svn tool to 0.98. I still get permission denied. [~apurtell],[~anoop.hbase] - could you commit to 0.98. Support OperationAttributes in Increment and Append in Shell Key: HBASE-10229 URL: https://issues.apache.org/jira/browse/HBASE-10229 Project: HBase Issue Type: Improvement Components: shell Affects Versions: 0.98.0 Reporter: ramkrishna.s.vasudevan Assignee: ramkrishna.s.vasudevan Fix For: 0.98.0, 0.99.0 Attachments: HBASE-10229_1.patch -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10239) Improve determinism and debugability of TestAccessController
[ https://issues.apache.org/jira/browse/HBASE-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857304#comment-13857304 ] ramkrishna.s.vasudevan commented on HBASE-10239: One small concern, In the verifyAllowed and verifyDenied we check {code} if (obj != null obj instanceof List?) { {code} But incase of verifyAllowed if the obj itself is null then we silently come out of the method thinking it is success. But it should actually fail because in the verifyAllowed we expect the result. Improve determinism and debugability of TestAccessController Key: HBASE-10239 URL: https://issues.apache.org/jira/browse/HBASE-10239 Project: HBase Issue Type: Improvement Affects Versions: 0.98.0, 0.99.0 Reporter: Andrew Purtell Assignee: Andrew Purtell Fix For: 0.98.0, 0.99.0 Attachments: 10239.patch, wip-10239.patch Separate grant and revoke API invocations to static helper methods in SecureTestUtils. Wait for permissions cache updates using a Predicate. Log the API calls, state checks, and waits. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-9846) Integration test and LoadTestTool support for cell ACLs
[ https://issues.apache.org/jira/browse/HBASE-9846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857305#comment-13857305 ] ramkrishna.s.vasudevan commented on HBASE-9846: --- I tried out the pattern in HBASE-9858 but the core change is that extended the MultiThreadedReader/Writer/Updater and overridden some of the methods in them. So wanted to execute the puts/gets and all the mutations as a specific user. The user list and the permissions are passed as input to the tool. When we run this as a cluster how to run a specific action with a specific user? Using user.runAs did not work as expected. Integration test and LoadTestTool support for cell ACLs --- Key: HBASE-9846 URL: https://issues.apache.org/jira/browse/HBASE-9846 Project: HBase Issue Type: Sub-task Affects Versions: 0.98.0 Reporter: Andrew Purtell Assignee: ramkrishna.s.vasudevan Fix For: 0.98.0 Cell level ACLs should have an integration test and LoadTestTool support. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Resolved] (HBASE-10105) SaslUtil#encodeIdentifier may throw NullPointerException
[ https://issues.apache.org/jira/browse/HBASE-10105?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ted Yu resolved HBASE-10105. Resolution: Later SaslUtil#encodeIdentifier may throw NullPointerException Key: HBASE-10105 URL: https://issues.apache.org/jira/browse/HBASE-10105 Project: HBase Issue Type: Bug Affects Versions: 0.99.0 Reporter: Ted Yu Attachments: org.apache.hadoop.hbase.security.TestHBaseSaslRpcClient-output.txt Encountered the following exception when running TestHBaseSaslRpcClient on Mac: {code} 2013-12-08 14:18:24,754 ERROR [main] security.TestHBaseSaslRpcClient(243): java.lang.NullPointerException at java.lang.String.init(String.java:593) at org.apache.hadoop.hbase.security.SaslUtil.encodeIdentifier(SaslUtil.java:38) at org.apache.hadoop.hbase.security.HBaseSaslRpcClient$SaslClientCallbackHandler.init(HBaseSaslRpcClient.java:259) at org.apache.hadoop.hbase.security.HBaseSaslRpcClient.init(HBaseSaslRpcClient.java:78) at org.apache.hadoop.hbase.security.TestHBaseSaslRpcClient.assertSuccessCreationDigestPrincipal(TestHBaseSaslRpcClient.java:240) at org.apache.hadoop.hbase.security.TestHBaseSaslRpcClient.testHBaseSaslRpcClientCreation(TestHBaseSaslRpcClient.java:122) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47) at org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12) at org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44) at org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17) at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271) at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70) at org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50) at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238) at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63) at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236) at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53) at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229) at org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26) at org.junit.runners.ParentRunner.run(ParentRunner.java:309) at org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:234) at org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:133) at org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:114) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.maven.surefire.util.ReflectionUtils.invokeMethodWithArray(ReflectionUtils.java:188) at org.apache.maven.surefire.booter.ProviderFactory$ProviderProxy.invoke(ProviderFactory.java:166) at org.apache.maven.surefire.booter.ProviderFactory.invokeProvider(ProviderFactory.java:86) at org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:101) at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:74) 2013-12-08 14:18:24,755 DEBUG [main] security.HBaseSaslRpcClient(76): Creating SASL DIGEST-MD5 client to authenticate to service at null {code} Here is related code: {code} return new String(Base64.encodeBase64(identifier)); {code} Looks like Base64.encodeBase64() returned a null. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-9948) HMaster should handle duplicate log split requests
[ https://issues.apache.org/jira/browse/HBASE-9948?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857340#comment-13857340 ] Ted Yu commented on HBASE-9948: --- HMaster should handle IOException for duplicate split request so that it doesn't abort. HMaster should handle duplicate log split requests -- Key: HBASE-9948 URL: https://issues.apache.org/jira/browse/HBASE-9948 Project: HBase Issue Type: Bug Reporter: Ted Yu I saw the following in test output for TestRestartCluster: {code} 2013-11-11 19:59:55,538 DEBUG [M:0;kiyo:36213] master.SplitLogManager(327): Scheduling batch of logs to split 2013-11-11 19:59:55,538 INFO [M:0;kiyo:36213] master.SplitLogManager(329): started splitting 1 logs in [hdfs://localhost:46376/user/hortonzy/hbase/WALs/kk,44962,138410193-splitting] 2013-11-11 19:59:55,538 WARN [M:0;kiyo:36213] master.SplitLogManager(1048): Failure because two threads can't wait for the same task; path=/hbase/splitWAL/WALs%2Fkk%2C44962%2C138410193-splitting%2Fkk%252C44962%252C138410193.138413702.meta 2013-11-11 19:59:55,538 FATAL [M:0;kiyo:36213] master.HMaster(2188): Master server abort: loaded coprocessors are: [org.apache.hadoop.hbase.coprocessor.MultiRowMutationEndpoint] 2013-11-11 19:59:55,538 FATAL [M:0;kiyo:36213] master.HMaster(2193): Unhandled exception. Starting shutdown. java.io.IOException: duplicate log split scheduled for hdfs://localhost:46376/user/hortonzy/hbase/WALs/kk,44962,138410193-splitting/kk%2C44962%2C138410193.138413702.meta at org.apache.hadoop.hbase.master.SplitLogManager.splitLogDistributed(SplitLogManager.java:343) at org.apache.hadoop.hbase.master.MasterFileSystem.splitLog(MasterFileSystem.java:409) at org.apache.hadoop.hbase.master.MasterFileSystem.splitMetaLog(MasterFileSystem.java:301) at org.apache.hadoop.hbase.master.MasterFileSystem.splitMetaLog(MasterFileSystem.java:292) at org.apache.hadoop.hbase.master.HMaster.assignMeta(HMaster.java:1038) at org.apache.hadoop.hbase.master.HMaster.finishInitialization(HMaster.java:868) at org.apache.hadoop.hbase.master.HMaster.run(HMaster.java:605) at java.lang.Thread.run(Thread.java:724) 2013-11-11 19:59:55,539 INFO [M:0;kiyo:36213] master.HMaster(2386): Aborting 2013-11-11 19:59:55,539 DEBUG [M:0;kiyo:36213] master.HMaster(1234): Stopping service threads {code} HMaster should handle duplicate log split requests, instead of aborting. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10242) client-side mvcc tracking in scanners
[ https://issues.apache.org/jira/browse/HBASE-10242?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857341#comment-13857341 ] Sergey Shelukhin commented on HBASE-10242: -- Problem (talking to myself): splits and merges. Actually quite a bit of a problem, because by default the scanner will lose consistency on these - it's a new region so new mvcc will be obtained; but the simple alternative (tracking them and reusing mvcc) is even worse, esp. for merge - for example, if the first region had much lower mvcc before merge because it came from different server, the second region data with its old high mvcc numbers could become invisible. Probably, when obtaining the first mvcc for the region, the scanner will have track it for key range, and re-use it across splits / separate requests for merges. Almost makes you wonder if getting mvccs in advance is worth it for consistent scanner - that would be one request to a few RSes (that hold the requisite regions). client-side mvcc tracking in scanners - Key: HBASE-10242 URL: https://issues.apache.org/jira/browse/HBASE-10242 Project: HBase Issue Type: Sub-task Components: HFile, regionserver, Scanners Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Scanners should be able to track mvcc read point and send it to server. This is a subtask, so server can use this mvcc as best it can, but doesn't actually have to guarantee it within the scope of this jira. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10246) Wrap long lines in recently added source files
Ted Yu created HBASE-10246: -- Summary: Wrap long lines in recently added source files Key: HBASE-10246 URL: https://issues.apache.org/jira/browse/HBASE-10246 Project: HBase Issue Type: Task Reporter: Ted Yu Priority: Minor Due to ineffective line length detection, several newly added files have long lines in them. The following is a partial list: IntegrationTestTableSnapshotInputFormat.java ClientSideRegionScanner.java TableSnapshotInputFormat.java PerformanceEvaluationScan.java TestTableSnapshotScanner.java TestTableSnapshotInputFormat.java Long lines in these files should be wrapped. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (HBASE-10241) implement mvcc-consistent scanners (across recovery)
[ https://issues.apache.org/jira/browse/HBASE-10241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13857348#comment-13857348 ] Sergey Shelukhin commented on HBASE-10241: -- one-pager doesn't cover merges and splits. Looks like mvccs on client will have to be tracked per range, rather than region. For even more consistent scanner, mvccs might even optionally be pre-fetched from all regions, but that is for later implement mvcc-consistent scanners (across recovery) Key: HBASE-10241 URL: https://issues.apache.org/jira/browse/HBASE-10241 Project: HBase Issue Type: New Feature Components: HFile, regionserver, Scanners Affects Versions: 0.99.0 Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Attachments: Consistent scanners.pdf Scanners currently use mvcc for consistency. However, mvcc is lost on server restart, or even a region move. This JIRA is to enable the scanners to transfer mvcc (or seqId, or some other number, see HBASE-8763) between servers. First, client scanner needs to get and store the readpoint. Second, mvcc needs to be preserved in WAL. Third, the mvcc needs to be stored in store files per KV and discarded when not needed. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (HBASE-10175) 2-thread ChaosMonkey steps on its own toes
[ https://issues.apache.org/jira/browse/HBASE-10175?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sergey Shelukhin updated HBASE-10175: - Resolution: Fixed Fix Version/s: 0.99.0 Status: Resolved (was: Patch Available) committed to trunk 2-thread ChaosMonkey steps on its own toes -- Key: HBASE-10175 URL: https://issues.apache.org/jira/browse/HBASE-10175 Project: HBase Issue Type: Improvement Components: test Reporter: Sergey Shelukhin Assignee: Sergey Shelukhin Priority: Minor Fix For: 0.99.0 Attachments: HBASE-10175.patch ChaosMonkey with one destructive and one volatility (flush-compact-split-etc.) threads steps on its own toes and logs a lot of exceptions. A simple solution would be to catch most (or all), like NotServingRegionException, and log less (not a full callstack for example, it's not very useful anyway). A more complicated/complementary one would be to keep track which regions the destructive thread affects and use other regions for volatile one. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Created] (HBASE-10247) Client promises about timestamps
Lars Hofhansl created HBASE-10247: - Summary: Client promises about timestamps Key: HBASE-10247 URL: https://issues.apache.org/jira/browse/HBASE-10247 Project: HBase Issue Type: Brainstorming Reporter: Lars Hofhansl Priority: Minor This is to start a discussion about timestamp promises declared per table of CF. For example if a client promises only monotonically increasing timestamps (or no custom set timestamps) and VERSIONS=1, we can aggressively and easily remove old versions of the same row/fam/col from the memstore before we flush, just by supplying a comparator that ignores the timestamp (i.e. two KV just differing by TS would be considered equal). That would increase the performance of counters significantly. -- This message was sent by Atlassian JIRA (v6.1.5#6160)