[jira] [Updated] (HBASE-9002) TestDistributedLogSplitting.testRecoveredEdits fails

2013-07-22 Thread Jeffrey Zhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeffrey Zhong updated HBASE-9002:
-

Attachment: hbase-9002.patch

 TestDistributedLogSplitting.testRecoveredEdits fails
 

 Key: HBASE-9002
 URL: https://issues.apache.org/jira/browse/HBASE-9002
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
 Fix For: 0.95.2

 Attachments: hbase-9002.patch


 https://builds.apache.org/job/hbase-0.95/342/testReport/org.apache.hadoop.hbase.master/TestDistributedLogSplitting/testRecoveredEdits/
 {code}
 java.lang.AssertionError: expected:1000 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.master.TestDistributedLogSplitting.testRecoveredEdits(TestDistributedLogSplitting.java:209)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:601)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 [~jxiang] or [~jeffreyz] want to take a look at this one?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9002) TestDistributedLogSplitting.testRecoveredEdits fails

2013-07-22 Thread Jeffrey Zhong (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9002?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jeffrey Zhong updated HBASE-9002:
-

Assignee: Jeffrey Zhong
  Status: Patch Available  (was: Open)

 TestDistributedLogSplitting.testRecoveredEdits fails
 

 Key: HBASE-9002
 URL: https://issues.apache.org/jira/browse/HBASE-9002
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jeffrey Zhong
 Fix For: 0.95.2

 Attachments: hbase-9002.patch


 https://builds.apache.org/job/hbase-0.95/342/testReport/org.apache.hadoop.hbase.master/TestDistributedLogSplitting/testRecoveredEdits/
 {code}
 java.lang.AssertionError: expected:1000 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.master.TestDistributedLogSplitting.testRecoveredEdits(TestDistributedLogSplitting.java:209)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:601)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 [~jxiang] or [~jeffreyz] want to take a look at this one?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-4955) Use the official versions of surefire junit

2013-07-22 Thread Nicolas Liochon (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-4955?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714945#comment-13714945
 ] 

Nicolas Liochon commented on HBASE-4955:


On the precommit env, there is only one application tested at a time. So the 
load should be limited. Or more exactly, this load is suspicious. IIRC there 
are some improvements in the last surefire versions, but I can't say if they 
will work for us.

 Use the official versions of surefire  junit
 -

 Key: HBASE-4955
 URL: https://issues.apache.org/jira/browse/HBASE-4955
 Project: HBase
  Issue Type: Improvement
  Components: test
Affects Versions: 0.94.0
 Environment: all
Reporter: Nicolas Liochon
Assignee: Nicolas Liochon
Priority: Critical
 Attachments: 4955.v1.patch, 4955.v2.patch, 4955.v2.patch, 
 4955.v2.patch, 4955.v2.patch, 4955.v3.patch, 4955.v3.patch, 4955.v3.patch, 
 4955.v4.patch, 4955.v4.patch, 4955.v4.patch, 4955.v4.patch, 4955.v4.patch, 
 4955.v4.patch, 4955.v5.patch, 4955.v6.patch, 8204.v4.patch


 We currently use private versions for Surefire  JUnit since HBASE-4763.
 This JIRA traks what we need to move to official versions.
 Surefire 2.11 is just out, but, after some tests, it does not contain all 
 what we need.
 JUnit. Could be for JUnit 4.11. Issue to monitor:
 https://github.com/KentBeck/junit/issues/359: fixed in our version, no 
 feedback for an integration on trunk
 Surefire: Could be for Surefire 2.12. Issues to monitor are:
 329 (category support): fixed, we use the official implementation from the 
 trunk
 786 (@Category with forkMode=always): fixed, we use the official 
 implementation from the trunk
 791 (incorrect elapsed time on test failure): fixed, we use the official 
 implementation from the trunk
 793 (incorrect time in the XML report): Not fixed (reopen) on trunk, fixed on 
 our version.
 760 (does not take into account the test method): fixed in trunk, not fixed 
 in our version
 798 (print immediately the test class name): not fixed in trunk, not fixed in 
 our version
 799 (Allow test parallelization when forkMode=always): not fixed in trunk, 
 not fixed in our version
 800 (redirectTestOutputToFile not taken into account): not yet fix on trunk, 
 fixed on our version
 800  793 are the more important to monitor, it's the only ones that are 
 fixed in our version but not on trunk.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8705) RS holding META when restarted in a single node setup may hang infinitely without META assignment

2013-07-22 Thread rajeshbabu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714954#comment-13714954
 ] 

rajeshbabu commented on HBASE-8705:
---

[~ram_krish],[~jxiang]
if we stop all region servers by hbase-deamon.sh stop command then .META. is 
never online with 10 seconds delay in starting a region server.
In single node cluster, many times .META. in transition after stopping the RS.
Can we retry meta assignment even after max attempts until we find some online 
region server. Otherwise we need to run hbck everytime in this common case.
What do you say?
 



 RS holding META when restarted in a single node setup may hang infinitely 
 without META assignment
 -

 Key: HBASE-8705
 URL: https://issues.apache.org/jira/browse/HBASE-8705
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.95.1
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8705_1.patch, HBASE-8705_2.patch, HBASE-8705.patch


 This bug may be minor as it likely to happen in a single node setup.
 I restarted the RS holding META. The master tried assigning META using 
 MetaSSH. But tried this before the new RS came up.
 So as not region plan is found 
 {code}
  if (plan == null) {
 LOG.warn(Unable to determine a plan to assign  + region);
 if (tomActivated){
   this.timeoutMonitor.setAllRegionServersOffline(true);
 } else {
   regionStates.updateRegionState(region, 
 RegionState.State.FAILED_OPEN);
 }
 return;
   }
 {code}
 we just return without assigment.  And this being the META the small cluster 
 just hangs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8414) Hbck still refers to -ROOT- table to locate .META.

2013-07-22 Thread rajeshbabu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714957#comment-13714957
 ] 

rajeshbabu commented on HBASE-8414:
---

[~himan...@cloudera.com]
I have recently faced this problem and prepared patch with test case. 
If you are not working on this I will upload.


 Hbck still refers to -ROOT- table to locate .META.
 --

 Key: HBASE-8414
 URL: https://issues.apache.org/jira/browse/HBASE-8414
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.95.0
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha

 In the current ROOT-less trunk, hbck still tries to fix meta by looking its 
 location in the .ROOT. table. This happens if there is no .META. assigned 
 when hbck is ran.
 HbaseFsck.java:
 {code}
 boolean checkMetaRegion() {
 ...
   HRegionLocation rootLocation = connection.locateRegion(
 HConstants.ROOT_TABLE_NAME, HConstants.EMPTY_START_ROW);
 ...
 }
 {code}
 Running hbck while meta is in transition:
 {code}
 bin/hbase hbck
 Version: 0.95.0-SNAPSHOT
 ERROR: META region or some of its attributes are null.
 ERROR: Fatal error: unable to get root region location. Exiting...
 Summary:
 2 inconsistencies detected.
 Status: INCONSISTENT
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8414) Hbck still refers to -ROOT- table to locate .META.

2013-07-22 Thread Anoop Sam John (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714963#comment-13714963
 ] 

Anoop Sam John commented on HBASE-8414:
---

Rajesh
 Have a look at HBASE-8627.  I have uploaded a patch there which solves 
this issue also I guess. Can u pls test that once?

 Hbck still refers to -ROOT- table to locate .META.
 --

 Key: HBASE-8414
 URL: https://issues.apache.org/jira/browse/HBASE-8414
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.95.0
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha

 In the current ROOT-less trunk, hbck still tries to fix meta by looking its 
 location in the .ROOT. table. This happens if there is no .META. assigned 
 when hbck is ran.
 HbaseFsck.java:
 {code}
 boolean checkMetaRegion() {
 ...
   HRegionLocation rootLocation = connection.locateRegion(
 HConstants.ROOT_TABLE_NAME, HConstants.EMPTY_START_ROW);
 ...
 }
 {code}
 Running hbck while meta is in transition:
 {code}
 bin/hbase hbck
 Version: 0.95.0-SNAPSHOT
 ERROR: META region or some of its attributes are null.
 ERROR: Fatal error: unable to get root region location. Exiting...
 Summary:
 2 inconsistencies detected.
 Status: INCONSISTENT
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8627) HBCK can not fix meta not assigned issue

2013-07-22 Thread Anoop Sam John (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714966#comment-13714966
 ] 

Anoop Sam John commented on HBASE-8627:
---

You mean test cases for that scenarios also [~jxiang]?  Sorry for the late 
reply. May be I need to remove those code in deleteMetaRegion() which handles 
regionInfoOnly and hdfs. What do you say?  Is this test case enough for this 
issue?  This patch solves what is mentioned in HBASE-8414 also

 HBCK can not fix meta not assigned issue
 

 Key: HBASE-8627
 URL: https://issues.apache.org/jira/browse/HBASE-8627
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.95.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
 Attachments: HBASE-8627_Trunk.patch, HBASE-8627_Trunk-V2.patch


 When meta table region is not assigned to any RS, HBCK run will get 
 exception. I can see code added in checkMetaRegion() to solve this issue but 
 it wont work. It still refers to ROOT region!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8780) a column Family can have VERSIONS less than zero

2013-07-22 Thread Anoop Sam John (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714968#comment-13714968
 ] 

Anoop Sam John commented on HBASE-8780:
---

V2 lgtm. Test cases are passing now.

 a column Family can have VERSIONS less than zero 
 -

 Key: HBASE-8780
 URL: https://issues.apache.org/jira/browse/HBASE-8780
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.8
Reporter: Demai Ni
Assignee: Demai Ni
Priority: Trivial
 Attachments: HBASE-8780-0.94.8-v0.patch, HBASE-8780-test.txt, 
 HBASE-8780-trunk.patch, HBASE-8780-trunk.patch, HBASE-8780-trunk-v1.patch, 
 HBASE-8780-trunk-v2.patch


 User can create/alter a columnfam and set its VERSION(aka maxVERSIONS) to a 
 negative or zero value. Although there is a checking in 
 HColumnDesciptor#construtor, hbase shell command will invoke the 
 setter(setMaxVersions and setMinVersions) directly, hence by pass the 
 checking.  For example:
 {code:title=set VERSIONS = -1}
 hbase(main):016:0 create 't5_dn',{NAME='cf1',VERSIONS=-1}
 0 row(s) in 1.0420 seconds
 hbase(main):017:0 put 't5_dn','row1','cf1:q1','row1cf1_v1'
 0 row(s) in 0.0700 seconds
 hbase(main):018:0 scan 't5_dn'
 ROW   COLUMN+CELL 
  
 0 row(s) in 0.0090 seconds
 hbase(main):019:0 describe 't5_dn'
 DESCRIPTION  ENABLED  
   
 't5_dn', {NAME = 'cf1', REPLICATION_SCOPE = '0',  true  
 KEEP_DELETED_CELLS = 'false', COMPRESSION = 'NONE   
  
 ', ENCODE_ON_DISK = 'true', BLOCKCACHE = 'true',
 MIN_VERSIONS = '0', DATA_BLOCK_ENCODING = 'NONE',   
  
  IN_MEMORY = 'false', BLOOMFILTER = 'NONE', TTL =   
  
  '2147483647', VERSIONS = '-1', BLOCKSIZE = '655   
   
 36'}  
 1 row(s) in 0.0410 seconds
 {code}
 above example shows VERSIONS = '-1', and put/scan doesn't keep the data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8414) Hbck still refers to -ROOT- table to locate .META.

2013-07-22 Thread rajeshbabu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13714969#comment-13714969
 ] 

rajeshbabu commented on HBASE-8414:
---

[~anoop.hbase]
Sorry I didnt see that. I will test it. Thanks.

 Hbck still refers to -ROOT- table to locate .META.
 --

 Key: HBASE-8414
 URL: https://issues.apache.org/jira/browse/HBASE-8414
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.95.0
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha

 In the current ROOT-less trunk, hbck still tries to fix meta by looking its 
 location in the .ROOT. table. This happens if there is no .META. assigned 
 when hbck is ran.
 HbaseFsck.java:
 {code}
 boolean checkMetaRegion() {
 ...
   HRegionLocation rootLocation = connection.locateRegion(
 HConstants.ROOT_TABLE_NAME, HConstants.EMPTY_START_ROW);
 ...
 }
 {code}
 Running hbck while meta is in transition:
 {code}
 bin/hbase hbck
 Version: 0.95.0-SNAPSHOT
 ERROR: META region or some of its attributes are null.
 ERROR: Fatal error: unable to get root region location. Exiting...
 Summary:
 2 inconsistencies detected.
 Status: INCONSISTENT
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8947) Thrift 2 : Replace bool writeToWAL with TDurability durability

2013-07-22 Thread Hamed Madani (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715001#comment-13715001
 ] 

Hamed Madani commented on HBASE-8947:
-

For 0.94 we can drop the default for writeToWAL to avoid sending an extra 
field. I just double checked and for 0.94 
{code}
  private boolean writeToWAL = true;
{code}
writeToWAL get initialized to true , so I suggest we drop default value for 
writeToWAL, add Durability to Mutation (Apparently Increment doesn't support it 
since it doesn't extend Mutation in 0.94), then as you suggested we check for 
isSetDurability, if not set then isSetwriteToWAL and if not set then just don't 
set writeToWAL since it gets initialized to true inside Mutation class. What do 
you think ? Also do we need a test for this ? I wasn't sure how to test it.  

 Thrift 2 : Replace bool writeToWAL with TDurability durability 
 ---

 Key: HBASE-8947
 URL: https://issues.apache.org/jira/browse/HBASE-8947
 Project: HBase
  Issue Type: Sub-task
  Components: Thrift
Reporter: Hamed Madani
Assignee: Hamed Madani
 Attachments: HBASE-8947.patch, HBASE-8947-v2.patch


 Introduce new enum *TDurability* to expose more options for *Write To Wal.* 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9002) TestDistributedLogSplitting.testRecoveredEdits fails

2013-07-22 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715006#comment-13715006
 ] 

Hadoop QA commented on HBASE-9002:
--

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12593455/hbase-9002.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6422//console

This message is automatically generated.

 TestDistributedLogSplitting.testRecoveredEdits fails
 

 Key: HBASE-9002
 URL: https://issues.apache.org/jira/browse/HBASE-9002
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jeffrey Zhong
 Fix For: 0.95.2

 Attachments: hbase-9002.patch


 https://builds.apache.org/job/hbase-0.95/342/testReport/org.apache.hadoop.hbase.master/TestDistributedLogSplitting/testRecoveredEdits/
 {code}
 java.lang.AssertionError: expected:1000 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.master.TestDistributedLogSplitting.testRecoveredEdits(TestDistributedLogSplitting.java:209)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:601)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 [~jxiang] or [~jeffreyz] want to take a look at this one?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, 

[jira] [Commented] (HBASE-8627) HBCK can not fix meta not assigned issue

2013-07-22 Thread rajeshbabu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8627?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715035#comment-13715035
 ] 

rajeshbabu commented on HBASE-8627:
---

[~anoop.hbase]
If meta server znode is deleted(in metaSSH), then patch is not able to fix this 
issue.
{code}
  public void assignMeta() throws KeeperException {
MetaRegionTracker.deleteMetaLocation(this.watcher);
assign(HRegionInfo.FIRST_META_REGIONINFO, true);
  }
{code}

In recordMetaRegion we will read meta location from zk, in this case we will 
get false and returing without fixing.
{code}
// get regions according to what is online on each RegionServer
loadDeployedRegions();
// check whether .META. is deployed and online
if (!recordMetaRegion()) {
  // Will remove later if we can fix it
  errors.reportError(Fatal error: unable to get .META. region location. 
Exiting...);
  return -2;
}
{code}

I think in checkMetaRegion only we need to read from zk and deployed regions 
list. If it is not present in any case we need to deploy it.

Here is the test to reproduce the scenario. FYI
{code}
  @Test
  public void testFixAssignmentsWhenMETAinTransition() throws Exception {
MiniHBaseCluster cluster = TEST_UTIL.getHBaseCluster();
HBaseAdmin admin = null;
try {
  admin = new HBaseAdmin(TEST_UTIL.getConfiguration());
  admin.closeRegion(cluster.getServerHoldingMeta(),
  HRegionInfo.FIRST_META_REGIONINFO);
} finally {
  if (admin != null) {
admin.close();
  }
}
regionStates.regionOffline(HRegionInfo.FIRST_META_REGIONINFO);
MetaRegionTracker.deleteMetaLocation(cluster.getMaster().getZooKeeper());
assertFalse(regionStates
.isRegionAssigned(HRegionInfo.FIRST_META_REGIONINFO));
HBaseFsck hbck = doFsck(conf, true);
assertErrors(hbck, new ERROR_CODE[] { ERROR_CODE.NULL_META_REGION,
ERROR_CODE.UNKNOWN });
assertNoErrors(doFsck(conf, false));
  }
{code}


 HBCK can not fix meta not assigned issue
 

 Key: HBASE-8627
 URL: https://issues.apache.org/jira/browse/HBASE-8627
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.95.0
Reporter: Anoop Sam John
Assignee: Anoop Sam John
 Attachments: HBASE-8627_Trunk.patch, HBASE-8627_Trunk-V2.patch


 When meta table region is not assigned to any RS, HBCK run will get 
 exception. I can see code added in checkMetaRegion() to solve this issue but 
 it wont work. It still refers to ROOT region!

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8705) RS holding META when restarted in a single node setup may hang infinitely without META assignment

2013-07-22 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715064#comment-13715064
 ] 

ramkrishna.s.vasudevan commented on HBASE-8705:
---

In single node cluster, many times .META. in transition after stopping the RS.
Is there any easy way to identify a single node cluster?  Or you mean to say if 
online servers are 0 keep waiting till it becomes atleast 1?

 RS holding META when restarted in a single node setup may hang infinitely 
 without META assignment
 -

 Key: HBASE-8705
 URL: https://issues.apache.org/jira/browse/HBASE-8705
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.95.1
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8705_1.patch, HBASE-8705_2.patch, HBASE-8705.patch


 This bug may be minor as it likely to happen in a single node setup.
 I restarted the RS holding META. The master tried assigning META using 
 MetaSSH. But tried this before the new RS came up.
 So as not region plan is found 
 {code}
  if (plan == null) {
 LOG.warn(Unable to determine a plan to assign  + region);
 if (tomActivated){
   this.timeoutMonitor.setAllRegionServersOffline(true);
 } else {
   regionStates.updateRegionState(region, 
 RegionState.State.FAILED_OPEN);
 }
 return;
   }
 {code}
 we just return without assigment.  And this being the META the small cluster 
 just hangs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8705) RS holding META when restarted in a single node setup may hang infinitely without META assignment

2013-07-22 Thread rajeshbabu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715076#comment-13715076
 ] 

rajeshbabu commented on HBASE-8705:
---

I mean we can retry until we find one online server to assign META.
After reaching max attempts if we reset i value to 0(only for META region), 
then we can check for online servers even after max attempts. If any one RS is 
online then META will be assinged.
{code}
  if (plan == null) {
LOG.warn(Unable to determine a plan to assign  + region);
if (tomActivated){
  this.timeoutMonitor.setAllRegionServersOffline(true);
} else {
  if (region.isMetaRegion()) {
try {
  if (i == maximumAttempts) {
i = 0;
  }
  Thread.sleep(this.sleepTimeBeforeRetryingMetaAssignment);
  continue;
} catch (InterruptedException e) {
  LOG.error(Got exception while waiting for META assignment);
  Thread.currentThread().interrupt();
}
  }
  regionStates.updateRegionState(region, RegionState.State.FAILED_OPEN);
}
return;
  }
{code}

 RS holding META when restarted in a single node setup may hang infinitely 
 without META assignment
 -

 Key: HBASE-8705
 URL: https://issues.apache.org/jira/browse/HBASE-8705
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.95.1
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8705_1.patch, HBASE-8705_2.patch, HBASE-8705.patch


 This bug may be minor as it likely to happen in a single node setup.
 I restarted the RS holding META. The master tried assigning META using 
 MetaSSH. But tried this before the new RS came up.
 So as not region plan is found 
 {code}
  if (plan == null) {
 LOG.warn(Unable to determine a plan to assign  + region);
 if (tomActivated){
   this.timeoutMonitor.setAllRegionServersOffline(true);
 } else {
   regionStates.updateRegionState(region, 
 RegionState.State.FAILED_OPEN);
 }
 return;
   }
 {code}
 we just return without assigment.  And this being the META the small cluster 
 just hangs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8015) Support for Namespaces

2013-07-22 Thread Francis Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715085#comment-13715085
 ] 

Francis Liu commented on HBASE-8015:


@enis it looks like ':' is not in this list of not-allowed characters in ZK. I 
naively tried it and it works. Just curious I'm not familiar with the HBase on 
windows effort. They won't be running HBase on HDFS? But on a windows DFS?

@stack I believe enis is suggesting on using a different delimiter when storing 
FQTN as part of filenames on hdfs.

 Support for Namespaces
 --

 Key: HBASE-8015
 URL: https://issues.apache.org/jira/browse/HBASE-8015
 Project: HBase
  Issue Type: New Feature
Reporter: Francis Liu
Assignee: Francis Liu
 Attachments: HBASE-8015_draft_94.patch, Namespace Design.pdf




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8662) [rest] support impersonation

2013-07-22 Thread Francis Liu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715090#comment-13715090
 ] 

Francis Liu commented on HBASE-8662:


I believe the 0.94 sample patch I provided is backwards compatible (including 
the rpc). I only need to update it with the minor differences in the trunk 
patch.

 [rest] support impersonation
 

 Key: HBASE-8662
 URL: https://issues.apache.org/jira/browse/HBASE-8662
 Project: HBase
  Issue Type: Sub-task
  Components: REST, security
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.95.2

 Attachments: method_doas.patch, secure_rest.patch, trunk-8662.patch, 
 trunk-8662_v2.patch, trunk-8662_v3.patch, trunk-8662_v4.patch, 
 trunk-8662_v5.patch, trunk-8662_v6.patch, trunk-8662_v7.1.patch, 
 trunk-8662_v7.2.patch, trunk-8662_v7.3.patch, trunk-8662_v7.patch


 Currently, our client API uses a fixed user: the current user. It should 
 accept a user passed in, if authenticated.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8705) RS holding META when restarted in a single node setup may hang infinitely without META assignment

2013-07-22 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715094#comment-13715094
 ] 

ramkrishna.s.vasudevan commented on HBASE-8705:
---

I got your point. If you are running a single cluster setup
{code}
sleepTimeBeforeRetryingMetaAssignment
{code}
Can you increase this config and if needed the maxTries also. Should a fix in 
the code needed here?

 RS holding META when restarted in a single node setup may hang infinitely 
 without META assignment
 -

 Key: HBASE-8705
 URL: https://issues.apache.org/jira/browse/HBASE-8705
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.95.1
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8705_1.patch, HBASE-8705_2.patch, HBASE-8705.patch


 This bug may be minor as it likely to happen in a single node setup.
 I restarted the RS holding META. The master tried assigning META using 
 MetaSSH. But tried this before the new RS came up.
 So as not region plan is found 
 {code}
  if (plan == null) {
 LOG.warn(Unable to determine a plan to assign  + region);
 if (tomActivated){
   this.timeoutMonitor.setAllRegionServersOffline(true);
 } else {
   regionStates.updateRegionState(region, 
 RegionState.State.FAILED_OPEN);
 }
 return;
   }
 {code}
 we just return without assigment.  And this being the META the small cluster 
 just hangs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8705) RS holding META when restarted in a single node setup may hang infinitely without META assignment

2013-07-22 Thread rajeshbabu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715100#comment-13715100
 ] 

rajeshbabu commented on HBASE-8705:
---

bq. Can you increase this config and if needed the maxTries also. 
We can increase these values, but may not know the delay before starting new RS.
bq. Should a fix in the code needed here?
It will be better I feel. 

 RS holding META when restarted in a single node setup may hang infinitely 
 without META assignment
 -

 Key: HBASE-8705
 URL: https://issues.apache.org/jira/browse/HBASE-8705
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.95.1
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8705_1.patch, HBASE-8705_2.patch, HBASE-8705.patch


 This bug may be minor as it likely to happen in a single node setup.
 I restarted the RS holding META. The master tried assigning META using 
 MetaSSH. But tried this before the new RS came up.
 So as not region plan is found 
 {code}
  if (plan == null) {
 LOG.warn(Unable to determine a plan to assign  + region);
 if (tomActivated){
   this.timeoutMonitor.setAllRegionServersOffline(true);
 } else {
   regionStates.updateRegionState(region, 
 RegionState.State.FAILED_OPEN);
 }
 return;
   }
 {code}
 we just return without assigment.  And this being the META the small cluster 
 just hangs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8705) RS holding META when restarted in a single node setup may hang infinitely without META assignment

2013-07-22 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8705?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715151#comment-13715151
 ] 

ramkrishna.s.vasudevan commented on HBASE-8705:
---

Okie.. Come up with a patch.  If others agree we can commit it.  

 RS holding META when restarted in a single node setup may hang infinitely 
 without META assignment
 -

 Key: HBASE-8705
 URL: https://issues.apache.org/jira/browse/HBASE-8705
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.95.1
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8705_1.patch, HBASE-8705_2.patch, HBASE-8705.patch


 This bug may be minor as it likely to happen in a single node setup.
 I restarted the RS holding META. The master tried assigning META using 
 MetaSSH. But tried this before the new RS came up.
 So as not region plan is found 
 {code}
  if (plan == null) {
 LOG.warn(Unable to determine a plan to assign  + region);
 if (tomActivated){
   this.timeoutMonitor.setAllRegionServersOffline(true);
 } else {
   regionStates.updateRegionState(region, 
 RegionState.State.FAILED_OPEN);
 }
 return;
   }
 {code}
 we just return without assigment.  And this being the META the small cluster 
 just hangs.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8900) TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey

2013-07-22 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715150#comment-13715150
 ] 

ramkrishna.s.vasudevan commented on HBASE-8900:
---

Actually here we have got an HDFS exception
{code}
Could not obtain block: 
BP-1276573177-67.195.138.60-1373240501128:blk_-2451087782628280101_1034 
file=/user/jenkins/hbase/testCorrectnessWhenMasterFailOver/fbf43cbd6a7509dbedc9f0fa410c46a9/recovered.edits/002
{code}

also AccessControlException which has prevented the HlogReader to read the file 
after failover.  Hence once of the regions continues to be in RIT and the 
testcase timed out. 

 TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey
 --

 Key: HBASE-8900
 URL: https://issues.apache.org/jira/browse/HBASE-8900
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: ramkrishna.s.vasudevan
 Attachments: 8900.txt


 Failed here:
 https://builds.apache.org/job/hbase-0.95-on-hadoop2/169/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/testCorrectnessWhenMasterFailOver/
 and
 http://54.241.6.143/job/HBase-0.95-Hadoop-2/579/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/org_apache_hadoop_hbase_regionserver_TestRSKilledWhenMasterInitializing/
 {code}
 java.lang.Exception: test timed out after 12 milliseconds
   at java.lang.Thread.sleep(Native Method)
   at 
 org.apache.hadoop.hbase.zookeeper.ZKAssign.blockUntilNoRIT(ZKAssign.java:1002)
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver(TestRSKilledWhenMasterInitializing.java:177)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 and with this:
 {code}
 java.lang.NullPointerException
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.tearDownAfterClass(TestRSKilledWhenMasterInitializing.java:83)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:33)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8353) -ROOT-/.META. regions are hanging if master restarted while closing -ROOT-/.META. regions on dead RS

2013-07-22 Thread ramkrishna.s.vasudevan (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8353?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715152#comment-13715152
 ] 

ramkrishna.s.vasudevan commented on HBASE-8353:
---

I will test all the scenarios including rolling restart tomorrow morning IST 
and let you know.
Have you tried this down in a cluster.  

 -ROOT-/.META. regions are hanging if master restarted while closing 
 -ROOT-/.META. regions on dead RS
 

 Key: HBASE-8353
 URL: https://issues.apache.org/jira/browse/HBASE-8353
 Project: HBase
  Issue Type: Bug
  Components: Region Assignment
Affects Versions: 0.94.6
Reporter: rajeshbabu
Assignee: rajeshbabu
 Fix For: 0.94.11

 Attachments: HBASE-8353_94_2.patch, HBASE-8353_94_3.patch, 
 HBASE-8353_94_4.patch, HBASE-8353_94.patch


 ROOT/META are not getting assigned if master restarted while closing 
 ROOT/META.
 Lets suppose catalog table regions in M_ZK_REGION_CLOSING state during master 
 initialization and then just we are adding the them to RIT and waiting for 
 TM. {code}
 if (isOnDeadServer(regionInfo, deadServers) 
 (data.getOrigin() == null || 
 !serverManager.isServerOnline(data.getOrigin( {
   // If was on dead server, its closed now. Force to OFFLINE and this
   // will get it reassigned if appropriate
   forceOffline(regionInfo, data);
 } else {
   // Just insert region into RIT.
   // If this never updates the timeout will trigger new assignment
   regionsInTransition.put(encodedRegionName, new RegionState(
 regionInfo, RegionState.State.CLOSING,
 data.getStamp(), data.getOrigin()));
 }
 {code}
 isOnDeadServer always return false to ROOT/META because deadServers is null.
 Even TM cannot close them properly because its not available in online 
 regions since its not yet assigned.
 {code}
 synchronized (this.regions) {
   // Check if this region is currently assigned
   if (!regions.containsKey(region)) {
 LOG.debug(Attempted to unassign region  +
   region.getRegionNameAsString() +  but it is not  +
   currently assigned anywhere);
 return;
   }
 }
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8947) Thrift 2 : Replace bool writeToWAL with TDurability durability

2013-07-22 Thread Lars George (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715271#comment-13715271
 ] 

Lars George commented on HBASE-8947:


If possible, you could somehow emulate the various settings and check if the 
resulting object reacts the way it should? Somehow we need to make sure that 
the above strategy works.

 Thrift 2 : Replace bool writeToWAL with TDurability durability 
 ---

 Key: HBASE-8947
 URL: https://issues.apache.org/jira/browse/HBASE-8947
 Project: HBase
  Issue Type: Sub-task
  Components: Thrift
Reporter: Hamed Madani
Assignee: Hamed Madani
 Attachments: HBASE-8947.patch, HBASE-8947-v2.patch


 Introduce new enum *TDurability* to expose more options for *Write To Wal.* 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9002) TestDistributedLogSplitting.testRecoveredEdits fails

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9002?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715310#comment-13715310
 ] 

stack commented on HBASE-9002:
--

+1  Makes sense.  Thanks [~jeffreyz]

 TestDistributedLogSplitting.testRecoveredEdits fails
 

 Key: HBASE-9002
 URL: https://issues.apache.org/jira/browse/HBASE-9002
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jeffrey Zhong
 Fix For: 0.95.2

 Attachments: hbase-9002.patch


 https://builds.apache.org/job/hbase-0.95/342/testReport/org.apache.hadoop.hbase.master/TestDistributedLogSplitting/testRecoveredEdits/
 {code}
 java.lang.AssertionError: expected:1000 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.master.TestDistributedLogSplitting.testRecoveredEdits(TestDistributedLogSplitting.java:209)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:601)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 [~jxiang] or [~jeffreyz] want to take a look at this one?

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8984) Test online snapshots with online merge

2013-07-22 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715320#comment-13715320
 ] 

Ted Yu commented on HBASE-8984:
---

Overall looks good.

I don't see where the admin is closed. Please add the close() call.

 Test online snapshots with online merge
 ---

 Key: HBASE-8984
 URL: https://issues.apache.org/jira/browse/HBASE-8984
 Project: HBase
  Issue Type: Test
  Components: snapshots
Affects Versions: 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.95.2

 Attachments: HBASE-8984.patch


 Add unit tests to verify that after an online merge:
  * taking a snapshot still works
  * snapshots and cloned tables are still in a good state.
 Also on the same line, fix the unit test to have more than one region, and 
 test multi RS/multi regions online snapshot.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8900) TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715346#comment-13715346
 ] 

stack commented on HBASE-8900:
--

I am going to remove from trunk too.

Here we have one of those silent failures.  If I compare a list of tests that 
passed on successful run to those that show on this fail I get this difference:

{code}
durruti:trunk stack$ diff /tmp/bad_trunk.txt /tmp/good_trunk.txt
91a92
 Running org.apache.hadoop.hbase.mapreduce.TestMultiTableInputFormat
159a161
 Running 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing
176a179,180
 Running org.apache.hadoop.hbase.regionserver.wal.TestHLogSplitCompressed
 Running org.apache.hadoop.hbase.regionserver.wal.TestLogRollAbort
183,185c187,188
 Running org.apache.hadoop.hbase.replication.TestReplicationKillMasterRS
 Running 
org.apache.hadoop.hbase.replication.TestReplicationKillMasterRSCompressed
 Running org.apache.hadoop.hbase.replication.TestReplicationKillSlaveRS
---
 Running org.apache.hadoop.hbase.replication.TestReplicationQueueFailover
 Running 
 org.apache.hadoop.hbase.replication.TestReplicationQueueFailoverCompressed
200a204
 Running org.apache.hadoop.hbase.rest.client.TestRemoteAdmin
{code}

TestMultiTableInputFormat has been removed already.  
TestReplicationKillMasterRS is likely new since the good run.  etc.

TestRSKilledWhenMasterInitializing is in the list.  I'm going to remove it for 
now until it has been fixed.  It is already removed from 0.95.  Doing same for 
trunk so can get clean builds.



 TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey
 --

 Key: HBASE-8900
 URL: https://issues.apache.org/jira/browse/HBASE-8900
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: ramkrishna.s.vasudevan
 Attachments: 8900.txt


 Failed here:
 https://builds.apache.org/job/hbase-0.95-on-hadoop2/169/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/testCorrectnessWhenMasterFailOver/
 and
 http://54.241.6.143/job/HBase-0.95-Hadoop-2/579/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/org_apache_hadoop_hbase_regionserver_TestRSKilledWhenMasterInitializing/
 {code}
 java.lang.Exception: test timed out after 12 milliseconds
   at java.lang.Thread.sleep(Native Method)
   at 
 org.apache.hadoop.hbase.zookeeper.ZKAssign.blockUntilNoRIT(ZKAssign.java:1002)
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver(TestRSKilledWhenMasterInitializing.java:177)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 and with this:
 {code}
 java.lang.NullPointerException
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.tearDownAfterClass(TestRSKilledWhenMasterInitializing.java:83)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:33)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at 

[jira] [Commented] (HBASE-8662) [rest] support impersonation

2013-07-22 Thread Jimmy Xiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8662?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715348#comment-13715348
 ] 

Jimmy Xiang commented on HBASE-8662:


That wound be great. It is fine with me.

 [rest] support impersonation
 

 Key: HBASE-8662
 URL: https://issues.apache.org/jira/browse/HBASE-8662
 Project: HBase
  Issue Type: Sub-task
  Components: REST, security
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
 Fix For: 0.98.0, 0.95.2

 Attachments: method_doas.patch, secure_rest.patch, trunk-8662.patch, 
 trunk-8662_v2.patch, trunk-8662_v3.patch, trunk-8662_v4.patch, 
 trunk-8662_v5.patch, trunk-8662_v6.patch, trunk-8662_v7.1.patch, 
 trunk-8662_v7.2.patch, trunk-8662_v7.3.patch, trunk-8662_v7.patch


 Currently, our client API uses a fixed user: the current user. It should 
 accept a user passed in, if authenticated.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8900) TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715347#comment-13715347
 ] 

stack commented on HBASE-8900:
--

Hmm... nvm.  Already removed above.  I was comparing old builds.

 TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey
 --

 Key: HBASE-8900
 URL: https://issues.apache.org/jira/browse/HBASE-8900
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: ramkrishna.s.vasudevan
 Attachments: 8900.txt


 Failed here:
 https://builds.apache.org/job/hbase-0.95-on-hadoop2/169/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/testCorrectnessWhenMasterFailOver/
 and
 http://54.241.6.143/job/HBase-0.95-Hadoop-2/579/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/org_apache_hadoop_hbase_regionserver_TestRSKilledWhenMasterInitializing/
 {code}
 java.lang.Exception: test timed out after 12 milliseconds
   at java.lang.Thread.sleep(Native Method)
   at 
 org.apache.hadoop.hbase.zookeeper.ZKAssign.blockUntilNoRIT(ZKAssign.java:1002)
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver(TestRSKilledWhenMasterInitializing.java:177)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 and with this:
 {code}
 java.lang.NullPointerException
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.tearDownAfterClass(TestRSKilledWhenMasterInitializing.java:83)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:33)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8984) Test online snapshots with online merge

2013-07-22 Thread Matteo Bertozzi (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715338#comment-13715338
 ] 

Matteo Bertozzi commented on HBASE-8984:


the admin instance is owned by the HBaseTestingUtility and closed by 
shutdownMiniHBaseCluster()
part of the @AfterClass cleanupTest() that contains UTIL.shutdownMiniCluster();

 Test online snapshots with online merge
 ---

 Key: HBASE-8984
 URL: https://issues.apache.org/jira/browse/HBASE-8984
 Project: HBase
  Issue Type: Test
  Components: snapshots
Affects Versions: 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.95.2

 Attachments: HBASE-8984.patch


 Add unit tests to verify that after an online merge:
  * taking a snapshot still works
  * snapshots and cloned tables are still in a good state.
 Also on the same line, fix the unit test to have more than one region, and 
 test multi RS/multi regions online snapshot.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9012) TestBlockReorder.testBlockLocationReorder fails

2013-07-22 Thread stack (JIRA)
stack created HBASE-9012:


 Summary: TestBlockReorder.testBlockLocationReorder fails
 Key: HBASE-9012
 URL: https://issues.apache.org/jira/browse/HBASE-9012
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack


http://54.241.6.143/job/HBase-0.95/669/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.fs/TestBlockReorder/testBlockLocationReorder/

java.net.BindException: Address already in use
at java.net.PlainSocketImpl.socketBind(Native Method)
at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:383)
at java.net.ServerSocket.bind(ServerSocket.java:328)
at java.net.ServerSocket.init(ServerSocket.java:194)
at java.net.ServerSocket.init(ServerSocket.java:106)
at 
org.apache.hadoop.hbase.fs.TestBlockReorder.testBlockLocationReorder(TestBlockReorder.java:182)



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8900) TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8900?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715306#comment-13715306
 ] 

stack commented on HBASE-8900:
--

[~ram_krish] Anything in the log on what happened to the block?  On the ACE 
issue, was the log from before I disabled short-circuit read for all unit tests 
do you know?  Thanks for taking a look.

 TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver is flakey
 --

 Key: HBASE-8900
 URL: https://issues.apache.org/jira/browse/HBASE-8900
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: ramkrishna.s.vasudevan
 Attachments: 8900.txt


 Failed here:
 https://builds.apache.org/job/hbase-0.95-on-hadoop2/169/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/testCorrectnessWhenMasterFailOver/
 and
 http://54.241.6.143/job/HBase-0.95-Hadoop-2/579/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.regionserver/TestRSKilledWhenMasterInitializing/org_apache_hadoop_hbase_regionserver_TestRSKilledWhenMasterInitializing/
 {code}
 java.lang.Exception: test timed out after 12 milliseconds
   at java.lang.Thread.sleep(Native Method)
   at 
 org.apache.hadoop.hbase.zookeeper.ZKAssign.blockUntilNoRIT(ZKAssign.java:1002)
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.testCorrectnessWhenMasterFailOver(TestRSKilledWhenMasterInitializing.java:177)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 and with this:
 {code}
 java.lang.NullPointerException
   at 
 org.apache.hadoop.hbase.regionserver.TestRSKilledWhenMasterInitializing.tearDownAfterClass(TestRSKilledWhenMasterInitializing.java:83)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:33)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8780) a column Family can have VERSIONS less than zero

2013-07-22 Thread Demai Ni (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715367#comment-13715367
 ] 

Demai Ni commented on HBASE-8780:
-

Anoop, Ted and Chunhui, many thanks for the help and review. Is this patch 
ready to be committed to trunk? any action I should take at this moment? 
thanks... Demai

 a column Family can have VERSIONS less than zero 
 -

 Key: HBASE-8780
 URL: https://issues.apache.org/jira/browse/HBASE-8780
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.8
Reporter: Demai Ni
Assignee: Demai Ni
Priority: Trivial
 Attachments: HBASE-8780-0.94.8-v0.patch, HBASE-8780-test.txt, 
 HBASE-8780-trunk.patch, HBASE-8780-trunk.patch, HBASE-8780-trunk-v1.patch, 
 HBASE-8780-trunk-v2.patch


 User can create/alter a columnfam and set its VERSION(aka maxVERSIONS) to a 
 negative or zero value. Although there is a checking in 
 HColumnDesciptor#construtor, hbase shell command will invoke the 
 setter(setMaxVersions and setMinVersions) directly, hence by pass the 
 checking.  For example:
 {code:title=set VERSIONS = -1}
 hbase(main):016:0 create 't5_dn',{NAME='cf1',VERSIONS=-1}
 0 row(s) in 1.0420 seconds
 hbase(main):017:0 put 't5_dn','row1','cf1:q1','row1cf1_v1'
 0 row(s) in 0.0700 seconds
 hbase(main):018:0 scan 't5_dn'
 ROW   COLUMN+CELL 
  
 0 row(s) in 0.0090 seconds
 hbase(main):019:0 describe 't5_dn'
 DESCRIPTION  ENABLED  
   
 't5_dn', {NAME = 'cf1', REPLICATION_SCOPE = '0',  true  
 KEEP_DELETED_CELLS = 'false', COMPRESSION = 'NONE   
  
 ', ENCODE_ON_DISK = 'true', BLOCKCACHE = 'true',
 MIN_VERSIONS = '0', DATA_BLOCK_ENCODING = 'NONE',   
  
  IN_MEMORY = 'false', BLOOMFILTER = 'NONE', TTL =   
  
  '2147483647', VERSIONS = '-1', BLOCKSIZE = '655   
   
 36'}  
 1 row(s) in 0.0410 seconds
 {code}
 above example shows VERSIONS = '-1', and put/scan doesn't keep the data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8414) Hbck still refers to -ROOT- table to locate .META.

2013-07-22 Thread Himanshu Vashishtha (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8414?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715365#comment-13715365
 ] 

Himanshu Vashishtha commented on HBASE-8414:


I didn't know until today that 8627 was not committed. That one has a patch and 
reviews looong time ago.

 Hbck still refers to -ROOT- table to locate .META.
 --

 Key: HBASE-8414
 URL: https://issues.apache.org/jira/browse/HBASE-8414
 Project: HBase
  Issue Type: Bug
  Components: hbck
Affects Versions: 0.95.0
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha

 In the current ROOT-less trunk, hbck still tries to fix meta by looking its 
 location in the .ROOT. table. This happens if there is no .META. assigned 
 when hbck is ran.
 HbaseFsck.java:
 {code}
 boolean checkMetaRegion() {
 ...
   HRegionLocation rootLocation = connection.locateRegion(
 HConstants.ROOT_TABLE_NAME, HConstants.EMPTY_START_ROW);
 ...
 }
 {code}
 Running hbck while meta is in transition:
 {code}
 bin/hbase hbck
 Version: 0.95.0-SNAPSHOT
 ERROR: META region or some of its attributes are null.
 ERROR: Fatal error: unable to get root region location. Exiting...
 Summary:
 2 inconsistencies detected.
 Status: INCONSISTENT
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9012) TestBlockReorder.testBlockLocationReorder fails

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715371#comment-13715371
 ] 

stack commented on HBASE-9012:
--

Test was added by HBASE-6435.

[~nkeywal] What you think?  The DN had not gone down by the time we call:

ServerSocket ss = new ServerSocket(port);// We're taking the port to have a 
timeout issue later.






 TestBlockReorder.testBlockLocationReorder fails
 ---

 Key: HBASE-9012
 URL: https://issues.apache.org/jira/browse/HBASE-9012
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack

 http://54.241.6.143/job/HBase-0.95/669/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.fs/TestBlockReorder/testBlockLocationReorder/
 java.net.BindException: Address already in use
   at java.net.PlainSocketImpl.socketBind(Native Method)
   at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:383)
   at java.net.ServerSocket.bind(ServerSocket.java:328)
   at java.net.ServerSocket.init(ServerSocket.java:194)
   at java.net.ServerSocket.init(ServerSocket.java:106)
   at 
 org.apache.hadoop.hbase.fs.TestBlockReorder.testBlockLocationReorder(TestBlockReorder.java:182)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9012) TestBlockReorder.testBlockLocationReorder fails

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9012:
-

Attachment: 9012.txt

Retry bindexceptions.  What you think [~nkeywal]  If good, I'll commit (come to 
think of it, @nkeywal is on vacation this week...)

 TestBlockReorder.testBlockLocationReorder fails
 ---

 Key: HBASE-9012
 URL: https://issues.apache.org/jira/browse/HBASE-9012
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
 Attachments: 9012.txt


 http://54.241.6.143/job/HBase-0.95/669/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.fs/TestBlockReorder/testBlockLocationReorder/
 java.net.BindException: Address already in use
   at java.net.PlainSocketImpl.socketBind(Native Method)
   at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:383)
   at java.net.ServerSocket.bind(ServerSocket.java:328)
   at java.net.ServerSocket.init(ServerSocket.java:194)
   at java.net.ServerSocket.init(ServerSocket.java:106)
   at 
 org.apache.hadoop.hbase.fs.TestBlockReorder.testBlockLocationReorder(TestBlockReorder.java:182)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8984) Test online snapshots with online merge

2013-07-22 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8984?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715343#comment-13715343
 ] 

Ted Yu commented on HBASE-8984:
---

Good.

 Test online snapshots with online merge
 ---

 Key: HBASE-8984
 URL: https://issues.apache.org/jira/browse/HBASE-8984
 Project: HBase
  Issue Type: Test
  Components: snapshots
Affects Versions: 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.95.2

 Attachments: HBASE-8984.patch


 Add unit tests to verify that after an online merge:
  * taking a snapshot still works
  * snapshots and cloned tables are still in a good state.
 Also on the same line, fix the unit test to have more than one region, and 
 test multi RS/multi regions online snapshot.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9012) TestBlockReorder.testBlockLocationReorder fails

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9012:
-

Assignee: stack
  Status: Patch Available  (was: Open)

 TestBlockReorder.testBlockLocationReorder fails
 ---

 Key: HBASE-9012
 URL: https://issues.apache.org/jira/browse/HBASE-9012
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: stack
 Attachments: 9012.txt


 http://54.241.6.143/job/HBase-0.95/669/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.fs/TestBlockReorder/testBlockLocationReorder/
 java.net.BindException: Address already in use
   at java.net.PlainSocketImpl.socketBind(Native Method)
   at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:383)
   at java.net.ServerSocket.bind(ServerSocket.java:328)
   at java.net.ServerSocket.init(ServerSocket.java:194)
   at java.net.ServerSocket.init(ServerSocket.java:106)
   at 
 org.apache.hadoop.hbase.fs.TestBlockReorder.testBlockLocationReorder(TestBlockReorder.java:182)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8983) HBaseConnection#deleteAllConnections does not always delete

2013-07-22 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715414#comment-13715414
 ] 

Ted Yu commented on HBASE-8983:
---

v3 is fine with me too.

 HBaseConnection#deleteAllConnections does not always delete
 ---

 Key: HBASE-8983
 URL: https://issues.apache.org/jira/browse/HBASE-8983
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.95.1
Reporter: Nicolas Liochon
Assignee: Nicolas Liochon
Priority: Trivial
 Fix For: 0.98.0, 0.95.2

 Attachments: 8983.v1.patch, 8983.v2.patch, 8983.v3.patch, 8983-v4.txt


 Cf; mailing list 
 http://search-hadoop.com/m/wurpu1s8Fhs/liochonsubj=Re+Connection+reference+counting

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8983) HBaseConnection#deleteAllConnections does not always delete

2013-07-22 Thread Jean-Daniel Cryans (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715405#comment-13715405
 ] 

Jean-Daniel Cryans commented on HBASE-8983:
---

[~yuzhih...@gmail.com] what was wrong with v3?

 HBaseConnection#deleteAllConnections does not always delete
 ---

 Key: HBASE-8983
 URL: https://issues.apache.org/jira/browse/HBASE-8983
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.95.1
Reporter: Nicolas Liochon
Assignee: Nicolas Liochon
Priority: Trivial
 Fix For: 0.98.0, 0.95.2

 Attachments: 8983.v1.patch, 8983.v2.patch, 8983.v3.patch, 8983-v4.txt


 Cf; mailing list 
 http://search-hadoop.com/m/wurpu1s8Fhs/liochonsubj=Re+Connection+reference+counting

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-8954:
---

Status: Open  (was: Patch Available)

Looked the trace again.  The log split worker wasn't started at all at that 
moment.

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-8954-2.patch, trunk-8954-v1.patch, 
 trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9014) TestHLog.testAppendClose fails

2013-07-22 Thread stack (JIRA)
stack created HBASE-9014:


 Summary: TestHLog.testAppendClose fails
 Key: HBASE-9014
 URL: https://issues.apache.org/jira/browse/HBASE-9014
 Project: HBase
  Issue Type: Bug
Reporter: stack


http://54.241.6.143/job/HBase-0.95/665/org.apache.hbase$hbase-server/testReport/org.apache.hadoop.hbase.regionserver.wal/TestHLog/testAppendClose/

{code}
Error Message

Problem binding to localhost/127.0.0.1:37036 : Address already in use
Stacktrace

java.net.BindException: Problem binding to localhost/127.0.0.1:37036 : Address 
already in use
at org.apache.hadoop.ipc.Server.bind(Server.java:228)
at org.apache.hadoop.ipc.Server$Listener.init(Server.java:302)
at org.apache.hadoop.ipc.Server.init(Server.java:1488)
at org.apache.hadoop.ipc.RPC$Server.init(RPC.java:560)
at org.apache.hadoop.ipc.RPC.getServer(RPC.java:521)
at 
org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:302)
at 
org.apache.hadoop.hdfs.server.namenode.NameNode.init(NameNode.java:536)
at 
org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1410)
at org.apache.hadoop.hdfs.MiniDFSCluster.init(MiniDFSCluster.java:278)
at 
org.apache.hadoop.hbase.HBaseTestingUtility.startMiniDFSClusterForTestHLog(HBaseTestingUtility.java:525)
...
{code}


This testAppendClose stops hdfs and starts it again.  It looks problematic.  
Has waits of 7 seconds for the hdfs cluster to go down but in this test it 
seems like it needs even more time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9013) Backport HBASE-8994 (Adding log to chaos monkey actions to show what're performed) to 0.94

2013-07-22 Thread Andrew Purtell (JIRA)
Andrew Purtell created HBASE-9013:
-

 Summary: Backport HBASE-8994 (Adding log to chaos monkey actions 
to show what're performed) to 0.94
 Key: HBASE-9013
 URL: https://issues.apache.org/jira/browse/HBASE-9013
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.11
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Minor


Keep the debugability of the 0.94 chaos monkey in line with 0.95/trunk.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9013) Backport HBASE-8994 (Adding log to chaos monkey actions to show what're performed) to 0.94

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9013?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715460#comment-13715460
 ] 

stack commented on HBASE-9013:
--

+1

 Backport HBASE-8994 (Adding log to chaos monkey actions to show what're 
 performed) to 0.94
 --

 Key: HBASE-9013
 URL: https://issues.apache.org/jira/browse/HBASE-9013
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.94.11
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Minor

 Keep the debugability of the 0.94 chaos monkey in line with 0.95/trunk.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9014) TestHLog.testAppendClose fails

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9014?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715463#comment-13715463
 ] 

stack commented on HBASE-9014:
--

I tried a few things.  The big long 7 second wait seems pretty necessary 
otherwise I get:

{code}
 3 Tests run: 13, Failures: 0, Errors: 5, Skipped: 0, Time elapsed: 52.038 sec 
 FAILURE!
  4 testAppendClose(org.apache.hadoop.hbase.regionserver.wal.TestHLog)  Time 
elapsed: 40.121 sec   ERROR!
  5 java.io.IOException: Cannot lock storage 
/Users/stack/checkouts/trunk/hbase-server/target/test-data/b7723583-fda7-46c7-a3b5-bde04f2f9b77/dfscluster_945339f9-1cd2-416f-a3e1-0e8a89a4e10a/dfs/name1.
 T#
  6 ,...at 
org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.lock(Storage.java:599)
  7 ,...at 
org.apache.hadoop.hdfs.server.common.Storage$StorageDirectory.analyzeStorage(Storage.java:452)
  8 ,...at 
org.apache.hadoop.hdfs.server.namenode.FSImage.recoverTransitionRead(FSImage.java:298)
  9 ,...at 
org.apache.hadoop.hdfs.server.namenode.FSDirectory.loadFSImage(FSDirectory.java:100)
 10 ,...at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.initialize(FSNamesystem.java:411)
 11 ,...at 
org.apache.hadoop.hdfs.server.namenode.FSNamesystem.init(FSNamesystem.java:379)
 12 ,...at 
org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:284)
 13 ,...at 
org.apache.hadoop.hdfs.server.namenode.NameNode.init(NameNode.java:536)
 14 ,...at 
org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1410)
...
{code}

I am tempted to disable this test but it is kinda important.

Leaving open for now to keep an eye on it.  Any input appreciated.

 TestHLog.testAppendClose fails
 --

 Key: HBASE-9014
 URL: https://issues.apache.org/jira/browse/HBASE-9014
 Project: HBase
  Issue Type: Bug
Reporter: stack

 http://54.241.6.143/job/HBase-0.95/665/org.apache.hbase$hbase-server/testReport/org.apache.hadoop.hbase.regionserver.wal/TestHLog/testAppendClose/
 {code}
 Error Message
 Problem binding to localhost/127.0.0.1:37036 : Address already in use
 Stacktrace
 java.net.BindException: Problem binding to localhost/127.0.0.1:37036 : 
 Address already in use
   at org.apache.hadoop.ipc.Server.bind(Server.java:228)
   at org.apache.hadoop.ipc.Server$Listener.init(Server.java:302)
   at org.apache.hadoop.ipc.Server.init(Server.java:1488)
   at org.apache.hadoop.ipc.RPC$Server.init(RPC.java:560)
   at org.apache.hadoop.ipc.RPC.getServer(RPC.java:521)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.initialize(NameNode.java:302)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.init(NameNode.java:536)
   at 
 org.apache.hadoop.hdfs.server.namenode.NameNode.createNameNode(NameNode.java:1410)
   at org.apache.hadoop.hdfs.MiniDFSCluster.init(MiniDFSCluster.java:278)
   at 
 org.apache.hadoop.hbase.HBaseTestingUtility.startMiniDFSClusterForTestHLog(HBaseTestingUtility.java:525)
 ...
 {code}
 This testAppendClose stops hdfs and starts it again.  It looks problematic.  
 Has waits of 7 seconds for the hdfs cluster to go down but in this test it 
 seems like it needs even more time.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9015) Client side coprocessor framework

2013-07-22 Thread Andrew Purtell (JIRA)
Andrew Purtell created HBASE-9015:
-

 Summary: Client side coprocessor framework
 Key: HBASE-9015
 URL: https://issues.apache.org/jira/browse/HBASE-9015
 Project: HBase
  Issue Type: New Feature
Affects Versions: 0.98.0
Reporter: Andrew Purtell


Create a set of pre and post operation hooks within the client where user code 
can be mixed in via upcalls, similar to how server side coprocessors work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-8954:
---

Attachment: trunk-8954-3.patch

Attached patch 3, removed creating hconnection when starting the worker.  At 
least this is not needed for distributed log splitting.  It's introduced for 
log replay (HBASE-8680).  It means each worker will hold a HConnection even it 
is not needed all the time.  Can we just create it on demand?

[~jeffreyz], do we really need to create the connection in starting the worker 
for log replay?

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9015) Client side coprocessor framework

2013-07-22 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9015?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715485#comment-13715485
 ] 

Andrew Purtell commented on HBASE-9015:
---

This was a request I heard multiple times at HBaseCon.

Observers for the client should come first and cover HTableInterface and 
HBaseAdmin, then consider going deeper into HConnection. 

As follow on work, it might be possible to have transparent remote service 
invocation if both ends of a coprocessor Service can be implemented by 
extension code, e.g. a client side Exec coprocessor can be registered via 
configuration, and a factory could build GeneratedHTable01234 implements 
HTableInterface with extra methods from those registered extensions. 

 Client side coprocessor framework
 -

 Key: HBASE-9015
 URL: https://issues.apache.org/jira/browse/HBASE-9015
 Project: HBase
  Issue Type: New Feature
Affects Versions: 0.98.0
Reporter: Andrew Purtell

 Create a set of pre and post operation hooks within the client where user 
 code can be mixed in via upcalls, similar to how server side coprocessors 
 work. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9012) TestBlockReorder.testBlockLocationReorder fails

2013-07-22 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715480#comment-13715480
 ] 

Hadoop QA commented on HBASE-9012:
--

{color:green}+1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12593551/9012.txt
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6423//console

This message is automatically generated.

 TestBlockReorder.testBlockLocationReorder fails
 ---

 Key: HBASE-9012
 URL: https://issues.apache.org/jira/browse/HBASE-9012
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: stack
 Attachments: 9012.txt


 http://54.241.6.143/job/HBase-0.95/669/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.fs/TestBlockReorder/testBlockLocationReorder/
 java.net.BindException: Address already in use
   at java.net.PlainSocketImpl.socketBind(Native Method)
   at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:383)
   at java.net.ServerSocket.bind(ServerSocket.java:328)
   at java.net.ServerSocket.init(ServerSocket.java:194)
   at java.net.ServerSocket.init(ServerSocket.java:106)
   at 
 org.apache.hadoop.hbase.fs.TestBlockReorder.testBlockLocationReorder(TestBlockReorder.java:182)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9008) Reenable TestReplicationKillSlaveRS.killOneSlaveRS

2013-07-22 Thread Jean-Daniel Cryans (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715498#comment-13715498
 ] 

Jean-Daniel Cryans commented on HBASE-9008:
---

[~stack] that this is failing means either tailing HLogs is broken or that we 
have data loss in 0.96.

Looking at this run 
http://54.241.6.143/job/HBase-TRUNK/org.apache.hbase$hbase-server/428/testReport/org.apache.hadoop.hbase.replication/TestReplicationKillSlaveRS/killOneSlaveRS/,
 I can see that at the end there's 3 rows missing. All the test is doing is 
inserting data in one cluster that replicates to another, and during that time 
we kill one RS on the latter cluster.

 Reenable TestReplicationKillSlaveRS.killOneSlaveRS
 --

 Key: HBASE-9008
 URL: https://issues.apache.org/jira/browse/HBASE-9008
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans

 hbase-9007 disabled it as flakey.  See it for listing of fails and 
 HBASE-8961.  Assigning to [~jdcryans] to take a looksee.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8780) a column Family can have VERSIONS less than zero

2013-07-22 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715503#comment-13715503
 ] 

Ted Yu commented on HBASE-8780:
---

I am having trouble checking in code because of LDAP issue which is being 
worked on by infrastructure team.

 a column Family can have VERSIONS less than zero 
 -

 Key: HBASE-8780
 URL: https://issues.apache.org/jira/browse/HBASE-8780
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.8
Reporter: Demai Ni
Assignee: Demai Ni
Priority: Trivial
 Attachments: HBASE-8780-0.94.8-v0.patch, HBASE-8780-test.txt, 
 HBASE-8780-trunk.patch, HBASE-8780-trunk.patch, HBASE-8780-trunk-v1.patch, 
 HBASE-8780-trunk-v2.patch


 User can create/alter a columnfam and set its VERSION(aka maxVERSIONS) to a 
 negative or zero value. Although there is a checking in 
 HColumnDesciptor#construtor, hbase shell command will invoke the 
 setter(setMaxVersions and setMinVersions) directly, hence by pass the 
 checking.  For example:
 {code:title=set VERSIONS = -1}
 hbase(main):016:0 create 't5_dn',{NAME='cf1',VERSIONS=-1}
 0 row(s) in 1.0420 seconds
 hbase(main):017:0 put 't5_dn','row1','cf1:q1','row1cf1_v1'
 0 row(s) in 0.0700 seconds
 hbase(main):018:0 scan 't5_dn'
 ROW   COLUMN+CELL 
  
 0 row(s) in 0.0090 seconds
 hbase(main):019:0 describe 't5_dn'
 DESCRIPTION  ENABLED  
   
 't5_dn', {NAME = 'cf1', REPLICATION_SCOPE = '0',  true  
 KEEP_DELETED_CELLS = 'false', COMPRESSION = 'NONE   
  
 ', ENCODE_ON_DISK = 'true', BLOCKCACHE = 'true',
 MIN_VERSIONS = '0', DATA_BLOCK_ENCODING = 'NONE',   
  
  IN_MEMORY = 'false', BLOOMFILTER = 'NONE', TTL =   
  
  '2147483647', VERSIONS = '-1', BLOCKSIZE = '655   
   
 36'}  
 1 row(s) in 0.0410 seconds
 {code}
 above example shows VERSIONS = '-1', and put/scan doesn't keep the data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715511#comment-13715511
 ] 

Jeffrey Zhong commented on HBASE-8954:
--

[~jxiang] I think you're referring to the following code 
{code}
  // initialize a new connection for splitlogworker configuration
  HConnectionManager.getConnection(conf);
{code}

This reason to do a pre-initialization is that during test I found it take 
about 3+ seconds to initialize a connection. Since we want to recovery happen 
immediately, so we make the connection ready ASAP. The connection is cached for 
a configuration instance so there will be only one connection instance for the 
region server.

While you can add a boolean to skip the pre-initialization if 
distributedLogReplay is turn off. Thanks. 

  

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8778) Region assigments scan table directory making them slow for huge tables

2013-07-22 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-8778:
-

Attachment: 8778-dirmodtime.txt

Trivial patch that does the table dir modtime check before the table dir is 
enumerated.

 Region assigments scan table directory making them slow for huge tables
 ---

 Key: HBASE-8778
 URL: https://issues.apache.org/jira/browse/HBASE-8778
 Project: HBase
  Issue Type: Improvement
Reporter: Dave Latham
Assignee: Dave Latham
 Fix For: 0.98.0, 0.95.2, 0.94.11

 Attachments: 8778-dirmodtime.txt, HBASE-8778-0.94.5.patch, 
 HBASE-8778-0.94.5-v2.patch


 On a table with 130k regions it takes about 3 seconds for a region server to 
 open a region once it has been assigned.
 Watching the threads for a region server running 0.94.5 that is opening many 
 such regions shows the thread opening the reigon in code like this:
 {noformat}
 PRI IPC Server handler 4 on 60020 daemon prio=10 tid=0x2aaac07e9000 
 nid=0x6566 runnable [0x4c46d000]
java.lang.Thread.State: RUNNABLE
 at java.lang.String.indexOf(String.java:1521)
 at java.net.URI$Parser.scan(URI.java:2912)
 at java.net.URI$Parser.parse(URI.java:3004)
 at java.net.URI.init(URI.java:736)
 at org.apache.hadoop.fs.Path.initialize(Path.java:145)
 at org.apache.hadoop.fs.Path.init(Path.java:126)
 at org.apache.hadoop.fs.Path.init(Path.java:50)
 at 
 org.apache.hadoop.hdfs.protocol.HdfsFileStatus.getFullPath(HdfsFileStatus.java:215)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.makeQualified(DistributedFileSystem.java:252)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:311)
 at 
 org.apache.hadoop.fs.FilterFileSystem.listStatus(FilterFileSystem.java:159)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:842)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:867)
 at org.apache.hadoop.hbase.util.FSUtils.listStatus(FSUtils.java:1168)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:269)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:255)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoModtime(FSTableDescriptors.java:368)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:155)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:126)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2834)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2807)
 at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:320)
 at 
 org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1426)
 {noformat}
 To open the region, the region server first loads the latest 
 HTableDescriptor.  Since HBASE-4553 HTableDescriptor's are stored in the file 
 system at /hbase/tableDir/.tableinfo.sequenceNum.  The file with the 
 largest sequenceNum is the current descriptor.  This is done so that the 
 current descirptor is updated atomically.  However, since the filename is not 
 known in advance FSTableDescriptors it has to do a FileSystem.listStatus 
 operation which has to list all files in the directory to find it.  The 
 directory also contains all the region directories, so in our case it has to 
 load 130k FileStatus objects.  Even using a globStatus matching function 
 still transfers all the objects to the client before performing the pattern 
 matching.  Furthermore HDFS uses a default of transferring 1000 directory 
 entries in each RPC call, so it requires 130 roundtrips to the namenode to 
 fetch all the directory entries.
 Consequently, to reassign all the regions of a table (or a constant fraction 
 thereof) requires time proportional to the square of the number of regions.
 In our case, if a region server fails with 200 such regions, it takes 10+ 
 minutes for them all to be reassigned, after the zk expiration and log 
 splitting.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8778) Region assigments scan table directory making them slow for huge tables

2013-07-22 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-8778:
-

Status: Patch Available  (was: Open)

Let's get a test run with that.

 Region assigments scan table directory making them slow for huge tables
 ---

 Key: HBASE-8778
 URL: https://issues.apache.org/jira/browse/HBASE-8778
 Project: HBase
  Issue Type: Improvement
Reporter: Dave Latham
Assignee: Dave Latham
 Fix For: 0.98.0, 0.95.2, 0.94.11

 Attachments: 8778-dirmodtime.txt, HBASE-8778-0.94.5.patch, 
 HBASE-8778-0.94.5-v2.patch


 On a table with 130k regions it takes about 3 seconds for a region server to 
 open a region once it has been assigned.
 Watching the threads for a region server running 0.94.5 that is opening many 
 such regions shows the thread opening the reigon in code like this:
 {noformat}
 PRI IPC Server handler 4 on 60020 daemon prio=10 tid=0x2aaac07e9000 
 nid=0x6566 runnable [0x4c46d000]
java.lang.Thread.State: RUNNABLE
 at java.lang.String.indexOf(String.java:1521)
 at java.net.URI$Parser.scan(URI.java:2912)
 at java.net.URI$Parser.parse(URI.java:3004)
 at java.net.URI.init(URI.java:736)
 at org.apache.hadoop.fs.Path.initialize(Path.java:145)
 at org.apache.hadoop.fs.Path.init(Path.java:126)
 at org.apache.hadoop.fs.Path.init(Path.java:50)
 at 
 org.apache.hadoop.hdfs.protocol.HdfsFileStatus.getFullPath(HdfsFileStatus.java:215)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.makeQualified(DistributedFileSystem.java:252)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:311)
 at 
 org.apache.hadoop.fs.FilterFileSystem.listStatus(FilterFileSystem.java:159)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:842)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:867)
 at org.apache.hadoop.hbase.util.FSUtils.listStatus(FSUtils.java:1168)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:269)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:255)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoModtime(FSTableDescriptors.java:368)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:155)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:126)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2834)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2807)
 at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:320)
 at 
 org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1426)
 {noformat}
 To open the region, the region server first loads the latest 
 HTableDescriptor.  Since HBASE-4553 HTableDescriptor's are stored in the file 
 system at /hbase/tableDir/.tableinfo.sequenceNum.  The file with the 
 largest sequenceNum is the current descriptor.  This is done so that the 
 current descirptor is updated atomically.  However, since the filename is not 
 known in advance FSTableDescriptors it has to do a FileSystem.listStatus 
 operation which has to list all files in the directory to find it.  The 
 directory also contains all the region directories, so in our case it has to 
 load 130k FileStatus objects.  Even using a globStatus matching function 
 still transfers all the objects to the client before performing the pattern 
 matching.  Furthermore HDFS uses a default of transferring 1000 directory 
 entries in each RPC call, so it requires 130 roundtrips to the namenode to 
 fetch all the directory entries.
 Consequently, to reassign all the regions of a table (or a constant fraction 
 thereof) requires time proportional to the square of the number of regions.
 In our case, if a region server fails with 200 such regions, it takes 10+ 
 minutes for them all to be reassigned, after the zk expiration and log 
 splitting.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jimmy Xiang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715535#comment-13715535
 ] 

Jimmy Xiang commented on HBASE-8954:


bq. While you can add a boolean to skip the pre-initialization if 
distributedLogReplay is turn off.
Yes, that's my second choice in my mind.

bq. This reason to do a pre-initialization is that during test I found it take 
about 3+ seconds to initialize a connection.
Do we have a breakdown on where the time was spent?


 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9007) TestReplicationKillSlaveRS.killOneSlaveRS fails

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715536#comment-13715536
 ] 

stack commented on HBASE-9007:
--

[~jdcryans] Thanks for looking.  They both fail actually and I'm thinking of 
filing a new issue to disable the test mentioned above.  Let me fix the title 
here though meantime.  Thanks for pointing out my messup.

 TestReplicationKillSlaveRS.killOneSlaveRS fails
 ---

 Key: HBASE-9007
 URL: https://issues.apache.org/jira/browse/HBASE-9007
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Fix For: 0.95.2

 Attachments: 9007.txt


 http://54.241.6.143/job/HBase-TRUNK/org.apache.hbase$hbase-server/428/testReport/org.apache.hadoop.hbase.replication/TestReplicationKillSlaveRS/killOneSlaveRS/
 {code}
 java.lang.AssertionError: Waited too much time for queueFailover replication. 
 Waited 13733ms.
   at org.junit.Assert.fail(Assert.java:88)
   at 
 org.apache.hadoop.hbase.replication.TestReplicationKillRS.loadTableAndKillRS(TestReplicationKillRS.java:89)
   at 
 org.apache.hadoop.hbase.replication.TestReplicationKillSlaveRS.killOneSlaveRS(TestReplicationKillSlaveRS.java:33)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 I'm going to disable for now.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9007) TestReplicationKillSlaveRS.killOneSlaveRS fails

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9007?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715539#comment-13715539
 ] 

stack commented on HBASE-9007:
--

Rather, let me fix the patch.

 TestReplicationKillSlaveRS.killOneSlaveRS fails
 ---

 Key: HBASE-9007
 URL: https://issues.apache.org/jira/browse/HBASE-9007
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Fix For: 0.95.2

 Attachments: 9007.txt


 http://54.241.6.143/job/HBase-TRUNK/org.apache.hbase$hbase-server/428/testReport/org.apache.hadoop.hbase.replication/TestReplicationKillSlaveRS/killOneSlaveRS/
 {code}
 java.lang.AssertionError: Waited too much time for queueFailover replication. 
 Waited 13733ms.
   at org.junit.Assert.fail(Assert.java:88)
   at 
 org.apache.hadoop.hbase.replication.TestReplicationKillRS.loadTableAndKillRS(TestReplicationKillRS.java:89)
   at 
 org.apache.hadoop.hbase.replication.TestReplicationKillSlaveRS.killOneSlaveRS(TestReplicationKillSlaveRS.java:33)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 I'm going to disable for now.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9016) Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)

2013-07-22 Thread Alex Newman (JIRA)
Alex Newman created HBASE-9016:
--

 Summary: Cleanup of HRegion (javadoc, unnecessary inits, 
unnecessary unboxing)
 Key: HBASE-9016
 URL: https://issues.apache.org/jira/browse/HBASE-9016
 Project: HBase
  Issue Type: Bug
Reporter: Alex Newman
Assignee: Alex Newman
Priority: Trivial




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9016) Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)

2013-07-22 Thread Alex Newman (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Newman updated HBASE-9016:
---

Status: Patch Available  (was: Open)

 Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)
 -

 Key: HBASE-9016
 URL: https://issues.apache.org/jira/browse/HBASE-9016
 Project: HBase
  Issue Type: Bug
Reporter: Alex Newman
Assignee: Alex Newman
Priority: Trivial
 Attachments: 9016.txt




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9016) Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)

2013-07-22 Thread Alex Newman (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9016?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Alex Newman updated HBASE-9016:
---

Attachment: 9016.txt

 Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)
 -

 Key: HBASE-9016
 URL: https://issues.apache.org/jira/browse/HBASE-9016
 Project: HBase
  Issue Type: Bug
Reporter: Alex Newman
Assignee: Alex Newman
Priority: Trivial
 Attachments: 9016.txt




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-8954:
---

Fix Version/s: 0.95.2
   0.98.0
   Status: Patch Available  (was: Open)

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-4.patch, trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9017) Consolidate ChaosMonkey's random objects

2013-07-22 Thread Jimmy Xiang (JIRA)
Jimmy Xiang created HBASE-9017:
--

 Summary: Consolidate ChaosMonkey's random objects
 Key: HBASE-9017
 URL: https://issues.apache.org/jira/browse/HBASE-9017
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor


ChaosMonkey has several random objects.  We can just share the same random 
object.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-8954:
---

Attachment: trunk-8954-4.patch

Attached v4 which just skips hconnection init in case of distributed log 
splitting instead of replay.

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-4.patch, trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Himanshu Vashishtha (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715574#comment-13715574
 ] 

Himanshu Vashishtha commented on HBASE-8954:


+1 looks good as long as log replay is turned off by default. Otherwise, the 
test would become flaky again.

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-4.patch, trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8778) Region assigments scan table directory making them slow for huge tables

2013-07-22 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715575#comment-13715575
 ] 

Hadoop QA commented on HBASE-8778:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12593576/8778-dirmodtime.txt
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6424//console

This message is automatically generated.

 Region assigments scan table directory making them slow for huge tables
 ---

 Key: HBASE-8778
 URL: https://issues.apache.org/jira/browse/HBASE-8778
 Project: HBase
  Issue Type: Improvement
Reporter: Dave Latham
Assignee: Dave Latham
 Fix For: 0.98.0, 0.95.2, 0.94.11

 Attachments: 8778-dirmodtime.txt, HBASE-8778-0.94.5.patch, 
 HBASE-8778-0.94.5-v2.patch


 On a table with 130k regions it takes about 3 seconds for a region server to 
 open a region once it has been assigned.
 Watching the threads for a region server running 0.94.5 that is opening many 
 such regions shows the thread opening the reigon in code like this:
 {noformat}
 PRI IPC Server handler 4 on 60020 daemon prio=10 tid=0x2aaac07e9000 
 nid=0x6566 runnable [0x4c46d000]
java.lang.Thread.State: RUNNABLE
 at java.lang.String.indexOf(String.java:1521)
 at java.net.URI$Parser.scan(URI.java:2912)
 at java.net.URI$Parser.parse(URI.java:3004)
 at java.net.URI.init(URI.java:736)
 at org.apache.hadoop.fs.Path.initialize(Path.java:145)
 at org.apache.hadoop.fs.Path.init(Path.java:126)
 at org.apache.hadoop.fs.Path.init(Path.java:50)
 at 
 org.apache.hadoop.hdfs.protocol.HdfsFileStatus.getFullPath(HdfsFileStatus.java:215)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.makeQualified(DistributedFileSystem.java:252)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:311)
 at 
 org.apache.hadoop.fs.FilterFileSystem.listStatus(FilterFileSystem.java:159)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:842)
 at 

[jira] [Commented] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Jeffrey Zhong (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715584#comment-13715584
 ] 

Jeffrey Zhong commented on HBASE-8954:
--

{code}
Do we have a breakdown on where the time was spent?
{code}
No, I didn't profile the connection creation call.

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-4.patch, trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8984) Test online snapshots with online merge

2013-07-22 Thread Matteo Bertozzi (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8984?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-8984:
---

   Resolution: Fixed
Fix Version/s: 0.98.0
   Status: Resolved  (was: Patch Available)

committed to 0.95 and trunk, thanks for the review!

 Test online snapshots with online merge
 ---

 Key: HBASE-8984
 URL: https://issues.apache.org/jira/browse/HBASE-8984
 Project: HBase
  Issue Type: Test
  Components: snapshots
Affects Versions: 0.95.1
Reporter: Matteo Bertozzi
Assignee: Matteo Bertozzi
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8984.patch


 Add unit tests to verify that after an online merge:
  * taking a snapshot still works
  * snapshots and cloned tables are still in a good state.
 Also on the same line, fix the unit test to have more than one region, and 
 test multi RS/multi regions online snapshot.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8778) Region assigments scan table directory making them slow for huge tables

2013-07-22 Thread Dave Latham (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715608#comment-13715608
 ] 

Dave Latham commented on HBASE-8778:


I began working on a trunk/0.96 patch that would move the table descriptor 
files to a known sub directory as well as take the refactoring, cleanup and 
documentation from the 0.94.5 patch above but adding a one time migration 
instead of the locking or rolling upgrade support.  One issue I ran into is 
support for snapshots.  The snapshot code calls into FSTableDescriptors to 
write a table descriptor file in the snapshot directory.  How should this work 
when FSTableDescriptors is putting descriptors into a known subdir?

Options I see:
 - Snapshots behave just like actual table directories and put their 
descriptors into a known subdir.  During migration, snapshots are migrated to 
move their descriptor into their known subdir.
 - New snapshots put table descriptors into known subdir.  Reading snapshots 
support finding the table descriptor in the subdir or the orig directory so no 
migration of snapshots are needed.
 - Snapshots continue to store table descriptors directly in the snapshot 
directory and FSTableDescriptors is refactored to share code to write sequenced 
descriptors in any directory.

Thoughts?  I'm leaning toward the last option.

 Region assigments scan table directory making them slow for huge tables
 ---

 Key: HBASE-8778
 URL: https://issues.apache.org/jira/browse/HBASE-8778
 Project: HBase
  Issue Type: Improvement
Reporter: Dave Latham
Assignee: Dave Latham
 Fix For: 0.98.0, 0.95.2, 0.94.11

 Attachments: 8778-dirmodtime.txt, HBASE-8778-0.94.5.patch, 
 HBASE-8778-0.94.5-v2.patch


 On a table with 130k regions it takes about 3 seconds for a region server to 
 open a region once it has been assigned.
 Watching the threads for a region server running 0.94.5 that is opening many 
 such regions shows the thread opening the reigon in code like this:
 {noformat}
 PRI IPC Server handler 4 on 60020 daemon prio=10 tid=0x2aaac07e9000 
 nid=0x6566 runnable [0x4c46d000]
java.lang.Thread.State: RUNNABLE
 at java.lang.String.indexOf(String.java:1521)
 at java.net.URI$Parser.scan(URI.java:2912)
 at java.net.URI$Parser.parse(URI.java:3004)
 at java.net.URI.init(URI.java:736)
 at org.apache.hadoop.fs.Path.initialize(Path.java:145)
 at org.apache.hadoop.fs.Path.init(Path.java:126)
 at org.apache.hadoop.fs.Path.init(Path.java:50)
 at 
 org.apache.hadoop.hdfs.protocol.HdfsFileStatus.getFullPath(HdfsFileStatus.java:215)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.makeQualified(DistributedFileSystem.java:252)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:311)
 at 
 org.apache.hadoop.fs.FilterFileSystem.listStatus(FilterFileSystem.java:159)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:842)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:867)
 at org.apache.hadoop.hbase.util.FSUtils.listStatus(FSUtils.java:1168)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:269)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:255)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoModtime(FSTableDescriptors.java:368)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:155)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:126)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2834)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2807)
 at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:320)
 at 
 org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1426)
 {noformat}
 To open the region, the region server first loads the latest 
 HTableDescriptor.  Since HBASE-4553 HTableDescriptor's are stored in the file 
 system at /hbase/tableDir/.tableinfo.sequenceNum.  The file with the 
 largest sequenceNum is the current descriptor.  This is done so that the 
 current descirptor is updated atomically.  However, since the filename is not 
 known in advance FSTableDescriptors it has to do a 

[jira] [Commented] (HBASE-8995) Add hadoop-1.2 profile

2013-07-22 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715616#comment-13715616
 ] 

Ted Yu commented on HBASE-8995:
---

Integrated to 0.94

Thanks for the review, Lars.

 Add hadoop-1.2 profile
 --

 Key: HBASE-8995
 URL: https://issues.apache.org/jira/browse/HBASE-8995
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.94.9
Reporter: Ted Yu
Assignee: Ted Yu
 Attachments: 8995-v1.txt


 Hadoop 1.2.0 is Beta release.
 We should add profile in pom.xml for this release.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8780) A column Family can have VERSIONS less than zero

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-8780:
--

Summary: A column Family can have VERSIONS less than zero   (was: a column 
Family can have VERSIONS less than zero )

 A column Family can have VERSIONS less than zero 
 -

 Key: HBASE-8780
 URL: https://issues.apache.org/jira/browse/HBASE-8780
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.8
Reporter: Demai Ni
Assignee: Demai Ni
Priority: Trivial
 Attachments: HBASE-8780-0.94.8-v0.patch, HBASE-8780-test.txt, 
 HBASE-8780-trunk.patch, HBASE-8780-trunk.patch, HBASE-8780-trunk-v1.patch, 
 HBASE-8780-trunk-v2.patch


 User can create/alter a columnfam and set its VERSION(aka maxVERSIONS) to a 
 negative or zero value. Although there is a checking in 
 HColumnDesciptor#construtor, hbase shell command will invoke the 
 setter(setMaxVersions and setMinVersions) directly, hence by pass the 
 checking.  For example:
 {code:title=set VERSIONS = -1}
 hbase(main):016:0 create 't5_dn',{NAME='cf1',VERSIONS=-1}
 0 row(s) in 1.0420 seconds
 hbase(main):017:0 put 't5_dn','row1','cf1:q1','row1cf1_v1'
 0 row(s) in 0.0700 seconds
 hbase(main):018:0 scan 't5_dn'
 ROW   COLUMN+CELL 
  
 0 row(s) in 0.0090 seconds
 hbase(main):019:0 describe 't5_dn'
 DESCRIPTION  ENABLED  
   
 't5_dn', {NAME = 'cf1', REPLICATION_SCOPE = '0',  true  
 KEEP_DELETED_CELLS = 'false', COMPRESSION = 'NONE   
  
 ', ENCODE_ON_DISK = 'true', BLOCKCACHE = 'true',
 MIN_VERSIONS = '0', DATA_BLOCK_ENCODING = 'NONE',   
  
  IN_MEMORY = 'false', BLOOMFILTER = 'NONE', TTL =   
  
  '2147483647', VERSIONS = '-1', BLOCKSIZE = '655   
   
 36'}  
 1 row(s) in 0.0410 seconds
 {code}
 above example shows VERSIONS = '-1', and put/scan doesn't keep the data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8780) A column Family can have VERSIONS less than zero

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8780?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-8780:
--

Fix Version/s: 0.98.0
 Hadoop Flags: Reviewed

Integrated to trunk - removed the else in the second check because exception is 
thrown in the first check.

Thanks for the patch, Demai.

Thanks for the reviews, Chunhui and Anoop.

 A column Family can have VERSIONS less than zero 
 -

 Key: HBASE-8780
 URL: https://issues.apache.org/jira/browse/HBASE-8780
 Project: HBase
  Issue Type: Bug
  Components: Client
Affects Versions: 0.94.8
Reporter: Demai Ni
Assignee: Demai Ni
Priority: Trivial
 Fix For: 0.98.0

 Attachments: HBASE-8780-0.94.8-v0.patch, HBASE-8780-test.txt, 
 HBASE-8780-trunk.patch, HBASE-8780-trunk.patch, HBASE-8780-trunk-v1.patch, 
 HBASE-8780-trunk-v2.patch


 User can create/alter a columnfam and set its VERSION(aka maxVERSIONS) to a 
 negative or zero value. Although there is a checking in 
 HColumnDesciptor#construtor, hbase shell command will invoke the 
 setter(setMaxVersions and setMinVersions) directly, hence by pass the 
 checking.  For example:
 {code:title=set VERSIONS = -1}
 hbase(main):016:0 create 't5_dn',{NAME='cf1',VERSIONS=-1}
 0 row(s) in 1.0420 seconds
 hbase(main):017:0 put 't5_dn','row1','cf1:q1','row1cf1_v1'
 0 row(s) in 0.0700 seconds
 hbase(main):018:0 scan 't5_dn'
 ROW   COLUMN+CELL 
  
 0 row(s) in 0.0090 seconds
 hbase(main):019:0 describe 't5_dn'
 DESCRIPTION  ENABLED  
   
 't5_dn', {NAME = 'cf1', REPLICATION_SCOPE = '0',  true  
 KEEP_DELETED_CELLS = 'false', COMPRESSION = 'NONE   
  
 ', ENCODE_ON_DISK = 'true', BLOCKCACHE = 'true',
 MIN_VERSIONS = '0', DATA_BLOCK_ENCODING = 'NONE',   
  
  IN_MEMORY = 'false', BLOOMFILTER = 'NONE', TTL =   
  
  '2147483647', VERSIONS = '-1', BLOCKSIZE = '655   
   
 36'}  
 1 row(s) in 0.0410 seconds
 {code}
 above example shows VERSIONS = '-1', and put/scan doesn't keep the data

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715640#comment-13715640
 ] 

Hadoop QA commented on HBASE-8954:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12593581/trunk-8954-4.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6425//console

This message is automatically generated.

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-4.patch, trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 

[jira] [Updated] (HBASE-9007) TestReplicationKillSlaveRS.killOneSlaveRS fails

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9007?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9007:
-

Attachment: 9007.addendum.txt

Made patch agree w/ the subject.

This class of tests all seems to have the same invisible fail problem.

 TestReplicationKillSlaveRS.killOneSlaveRS fails
 ---

 Key: HBASE-9007
 URL: https://issues.apache.org/jira/browse/HBASE-9007
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Fix For: 0.95.2

 Attachments: 9007.addendum.txt, 9007.txt


 http://54.241.6.143/job/HBase-TRUNK/org.apache.hbase$hbase-server/428/testReport/org.apache.hadoop.hbase.replication/TestReplicationKillSlaveRS/killOneSlaveRS/
 {code}
 java.lang.AssertionError: Waited too much time for queueFailover replication. 
 Waited 13733ms.
   at org.junit.Assert.fail(Assert.java:88)
   at 
 org.apache.hadoop.hbase.replication.TestReplicationKillRS.loadTableAndKillRS(TestReplicationKillRS.java:89)
   at 
 org.apache.hadoop.hbase.replication.TestReplicationKillSlaveRS.killOneSlaveRS(TestReplicationKillSlaveRS.java:33)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {code}
 I'm going to disable for now.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8954) TestSplitLogWorker#testPreemptTask failed

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8954?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715644#comment-13715644
 ] 

stack commented on HBASE-8954:
--

+1 for now.

 TestSplitLogWorker#testPreemptTask failed
 -

 Key: HBASE-8954
 URL: https://issues.apache.org/jira/browse/HBASE-8954
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-8954-2.patch, trunk-8954-3.patch, 
 trunk-8954-4.patch, trunk-8954-v1.patch, trunk-8954_v2.patch


 http://54.241.6.143/job/HBase-0.95-Hadoop-2/org.apache.hbase$hbase-server/627/testReport/junit/org.apache.hadoop.hbase.regionserver/TestSplitLogWorker/testPreemptTask/
 {noformat}
 java.lang.AssertionError: expected:1 but was:0
   at org.junit.Assert.fail(Assert.java:88)
   at org.junit.Assert.failNotEquals(Assert.java:743)
   at org.junit.Assert.assertEquals(Assert.java:118)
   at org.junit.Assert.assertEquals(Assert.java:555)
   at org.junit.Assert.assertEquals(Assert.java:542)
   at 
 org.apache.hadoop.hbase.regionserver.TestSplitLogWorker.testPreemptTask(TestSplitLogWorker.java:211)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.RunBefores.evaluate(RunBefores.java:26)
   at 
 org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:27)
   at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:271)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:70)
   at 
 org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:50)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:63)
   at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:236)
   at org.junit.runners.ParentRunner.access$000(ParentRunner.java:53)
   at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:229)
   at org.junit.runners.ParentRunner.run(ParentRunner.java:309)
   at org.junit.runners.Suite.runChild(Suite.java:127)
   at org.junit.runners.Suite.runChild(Suite.java:26)
   at org.junit.runners.ParentRunner$3.run(ParentRunner.java:238)
   at 
 java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:439)
   at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:303)
   at java.util.concurrent.FutureTask.run(FutureTask.java:138)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecutor.java:895)
   at 
 java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:918)
   at java.lang.Thread.run(Thread.java:662)
 {noformat}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9012) TestBlockReorder.testBlockLocationReorder fails

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9012?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9012:
-

Fix Version/s: 0.95.2

Committed this patch to trunk and 0.95.  Lets see how it does going forward.

 TestBlockReorder.testBlockLocationReorder fails
 ---

 Key: HBASE-9012
 URL: https://issues.apache.org/jira/browse/HBASE-9012
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: stack
 Fix For: 0.95.2

 Attachments: 9012.txt


 http://54.241.6.143/job/HBase-0.95/669/org.apache.hbase$hbase-server/testReport/junit/org.apache.hadoop.hbase.fs/TestBlockReorder/testBlockLocationReorder/
 java.net.BindException: Address already in use
   at java.net.PlainSocketImpl.socketBind(Native Method)
   at java.net.PlainSocketImpl.bind(PlainSocketImpl.java:383)
   at java.net.ServerSocket.bind(ServerSocket.java:328)
   at java.net.ServerSocket.init(ServerSocket.java:194)
   at java.net.ServerSocket.init(ServerSocket.java:106)
   at 
 org.apache.hadoop.hbase.fs.TestBlockReorder.testBlockLocationReorder(TestBlockReorder.java:182)

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9018) Add timeouts on all tests in TestHLogSplit

2013-07-22 Thread stack (JIRA)
stack created HBASE-9018:


 Summary: Add timeouts on all tests in TestHLogSplit
 Key: HBASE-9018
 URL: https://issues.apache.org/jira/browse/HBASE-9018
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: stack
 Fix For: 0.95.2


This test runs twice, once as plane TestHLogSplit and then again as 
TestHLogSplitCompressed.  It shows up from time to time as an 'invisible' test 
-- a test that is not listed as one of the running tests in a failed build (but 
is present in builds that pass).  Let me try setting timeouts on the 30odd 
tests in this class to see if that makes it fail fast rather than as an 
'invisible'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8778) Region assigments scan table directory making them slow for huge tables

2013-07-22 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715656#comment-13715656
 ] 

Lars Hofhansl commented on HBASE-8778:
--

Either #1 or #3. If the layout is the same between table and snapshot, maybe #1 
makes the most sense.

What about the directory modtime check, should we just commit this to 0.94 
(provided it solves your problem as you said)?


 Region assigments scan table directory making them slow for huge tables
 ---

 Key: HBASE-8778
 URL: https://issues.apache.org/jira/browse/HBASE-8778
 Project: HBase
  Issue Type: Improvement
Reporter: Dave Latham
Assignee: Dave Latham
 Fix For: 0.98.0, 0.95.2, 0.94.11

 Attachments: 8778-dirmodtime.txt, HBASE-8778-0.94.5.patch, 
 HBASE-8778-0.94.5-v2.patch


 On a table with 130k regions it takes about 3 seconds for a region server to 
 open a region once it has been assigned.
 Watching the threads for a region server running 0.94.5 that is opening many 
 such regions shows the thread opening the reigon in code like this:
 {noformat}
 PRI IPC Server handler 4 on 60020 daemon prio=10 tid=0x2aaac07e9000 
 nid=0x6566 runnable [0x4c46d000]
java.lang.Thread.State: RUNNABLE
 at java.lang.String.indexOf(String.java:1521)
 at java.net.URI$Parser.scan(URI.java:2912)
 at java.net.URI$Parser.parse(URI.java:3004)
 at java.net.URI.init(URI.java:736)
 at org.apache.hadoop.fs.Path.initialize(Path.java:145)
 at org.apache.hadoop.fs.Path.init(Path.java:126)
 at org.apache.hadoop.fs.Path.init(Path.java:50)
 at 
 org.apache.hadoop.hdfs.protocol.HdfsFileStatus.getFullPath(HdfsFileStatus.java:215)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.makeQualified(DistributedFileSystem.java:252)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:311)
 at 
 org.apache.hadoop.fs.FilterFileSystem.listStatus(FilterFileSystem.java:159)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:842)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:867)
 at org.apache.hadoop.hbase.util.FSUtils.listStatus(FSUtils.java:1168)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:269)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:255)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoModtime(FSTableDescriptors.java:368)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:155)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:126)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2834)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2807)
 at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:320)
 at 
 org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1426)
 {noformat}
 To open the region, the region server first loads the latest 
 HTableDescriptor.  Since HBASE-4553 HTableDescriptor's are stored in the file 
 system at /hbase/tableDir/.tableinfo.sequenceNum.  The file with the 
 largest sequenceNum is the current descriptor.  This is done so that the 
 current descirptor is updated atomically.  However, since the filename is not 
 known in advance FSTableDescriptors it has to do a FileSystem.listStatus 
 operation which has to list all files in the directory to find it.  The 
 directory also contains all the region directories, so in our case it has to 
 load 130k FileStatus objects.  Even using a globStatus matching function 
 still transfers all the objects to the client before performing the pattern 
 matching.  Furthermore HDFS uses a default of transferring 1000 directory 
 entries in each RPC call, so it requires 130 roundtrips to the namenode to 
 fetch all the directory entries.
 Consequently, to reassign all the regions of a table (or a constant fraction 
 thereof) requires time proportional to the square of the number of regions.
 In our case, if a region server fails with 200 such regions, it takes 10+ 
 minutes for them all to be reassigned, after the zk expiration and log 
 splitting.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA 

[jira] [Commented] (HBASE-9016) Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)

2013-07-22 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9016?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715657#comment-13715657
 ] 

Hadoop QA commented on HBASE-9016:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12593579/9016.txt
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/6426//console

This message is automatically generated.

 Cleanup of HRegion (javadoc, unnecessary inits, unnecessary unboxing)
 -

 Key: HBASE-9016
 URL: https://issues.apache.org/jira/browse/HBASE-9016
 Project: HBase
  Issue Type: Bug
Reporter: Alex Newman
Assignee: Alex Newman
Priority: Trivial
 Attachments: 9016.txt




--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HBASE-9018) Add timeouts on all tests in TestHLogSplit

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-9018.
--

Resolution: Fixed

Committed to trunk and to 0.95.

 Add timeouts on all tests in TestHLogSplit
 --

 Key: HBASE-9018
 URL: https://issues.apache.org/jira/browse/HBASE-9018
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: stack
 Fix For: 0.95.2

 Attachments: 9018.txt


 This test runs twice, once as plane TestHLogSplit and then again as 
 TestHLogSplitCompressed.  It shows up from time to time as an 'invisible' 
 test -- a test that is not listed as one of the running tests in a failed 
 build (but is present in builds that pass).  Let me try setting timeouts on 
 the 30odd tests in this class to see if that makes it fail fast rather than 
 as an 'invisible'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9018) Add timeouts on all tests in TestHLogSplit

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9018?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-9018:
-

Attachment: 9018.txt

Adds timer on all tests.

 Add timeouts on all tests in TestHLogSplit
 --

 Key: HBASE-9018
 URL: https://issues.apache.org/jira/browse/HBASE-9018
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: stack
 Fix For: 0.95.2

 Attachments: 9018.txt


 This test runs twice, once as plane TestHLogSplit and then again as 
 TestHLogSplitCompressed.  It shows up from time to time as an 'invisible' 
 test -- a test that is not listed as one of the running tests in a failed 
 build (but is present in builds that pass).  Let me try setting timeouts on 
 the 30odd tests in this class to see if that makes it fail fast rather than 
 as an 'invisible'.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8935) IntegrationTestBigLinkedList fails under load on 0.94 due to some scan issues

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-8935:
--

Resolution: Fixed
Status: Resolved  (was: Patch Available)

 IntegrationTestBigLinkedList fails under load on 0.94 due to some scan issues
 -

 Key: HBASE-8935
 URL: https://issues.apache.org/jira/browse/HBASE-8935
 Project: HBase
  Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Attachments: HBASE-8935-v0-94-logging.patch, HBASE-8935-v0.patch


 We observe occasional failures (4-5k undefined and unreferenced nodes in the 
 list, running) when the cluster is stressed while running 
 IntegrationTestBigLinkedList on 0.94. Unfortunately debug logging is disabled.
 If Verify is rerun after the fact, the data is there, so it's not data loss. 
 All the missing keys come from very narrow range from the very beginning of  
 one region; most of this region is not affected.
 In the case I'm looking at, region range is
 {code}\xD0\x0C`=%\xA2\xD3\xA8\x0A\xF8\x917\x05\x94\x1D .. 
 \xD4\x0Bx\xAF\x88\x0C\xF47\x9A\x9F{\xCE\x0E\x8A{code}
 and problematic renage
 {code}\xD0\x0C`QLD\xED2\xD5c\x8D\xDB5\x01\xD2H] ... 
 \xD0\x0D\x89)\x11\x0E8\xC5\x08o\xD7\xE3$\xB3\xAAu{code}
 There are no splits and compactions on the region during MR job; there are no 
 compactions after that could have affected the data, although there is one 
 flush that happened long after, and region was moved and reopened (before I 
 ran verify manually that showed that data is in HBase).
 One thing that happened was that the scanners used by all the map tasks had 
 lease expirations during the MR job that had missing data, some of them twice.
 First scanner expiration for each task I looked at was exactly 1 minute after 
 Processing split log message, when the scanner is opened.
 Only the tasks where the scanners have expired twice (2 of them) logged any 
 errors; presumably one expiration in each task happened after the reading was 
 finished, because everything was slow and scanner wasn't closed in time - no 
 errors or exceptions are logged in the tasks where scanner only expired once.
 The job I ran afterwards had no scanner expirations so it does close them 
 correctly in normal case...
 The task that was reading the problematic range had one scanner expiration 
 and didn't log any errors.
 It's a little bit special (or it may be a total red herring) in that in other 
 tasks, scanner expired while the task was splitting partial outputs (which 
 may mean end of input reading?), whereas in the task that lost data spilling 
 started long (~40s) after the scanner expired.
 However there's one another such task (spilling started 25s after scanner 
 expired) and it didn't log any errors and didn't lose data.
 At this point I am not sure about the root cause but I suspect it might be 
 related to scanner expiration handling, given the patterns.
 One thing for sure is that there isn't enough logging... so I will start with 
 adding that.
  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8874) PutCombiner is skipping KeyValues while combining puts of same row during bulkload

2013-07-22 Thread Ted Yu (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715667#comment-13715667
 ] 

Ted Yu commented on HBASE-8874:
---

Shall we set the default value to 1GB ?

 PutCombiner is skipping KeyValues while combining puts of same row during 
 bulkload
 --

 Key: HBASE-8874
 URL: https://issues.apache.org/jira/browse/HBASE-8874
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Affects Versions: 0.95.0, 0.95.1
Reporter: rajeshbabu
Assignee: rajeshbabu
Priority: Critical
 Fix For: 0.98.0, 0.95.2

 Attachments: HBASE-8874_trunk.patch


 While combining puts of same row in map phase we are using below logic in 
 PutCombiner#reduce. In for loop first time we will add one Put object to puts 
 map. Next time onwards we are just overriding key values of a family with key 
 values of the same family in other put. So we are mostly writing one Put 
 object to map output and remaining will be skipped(data loss).
 {code}
 Mapbyte[], Put puts = new TreeMapbyte[], Put(Bytes.BYTES_COMPARATOR);
 for (Put p : vals) {
   cnt++;
   if (!puts.containsKey(p.getRow())) {
 puts.put(p.getRow(), p);
   } else {
 puts.get(p.getRow()).getFamilyMap().putAll(p.getFamilyMap());
   }
 }
 {code}
 We need to change logic similar as below because we are sure the rowkey of 
 all the puts will be same.
 {code}
 Put finalPut = null;
 Mapbyte[], List? extends Cell familyMap = null;
 for (Put p : vals) {
  cnt++;
   if (finalPut==null) {
 finalPut = p;
 familyMap = finalPut.getFamilyMap();
   } else {
 for (Entrybyte[], List? extends Cell entry : 
 p.getFamilyMap().entrySet()) {
   List? extends Cell list = familyMap.get(entry.getKey());
   if (list == null) {
 familyMap.put(entry.getKey(), entry.getValue());
   } else {
 (((ListKeyValue)list)).addAll((ListKeyValue)entry.getValue());
   }
 }
   }
 }
 context.write(row, finalPut);
 {code}
 Also need to implement TODOs mentioned by Nick 
 {code}
 // TODO: would be better if we knew codeK row/code and Put rowkey were
 // identical. Then this whole Put buffering business goes away.
 // TODO: Could use HeapSize to create an upper bound on the memory size of
 // the puts map and flush some portion of the content while looping. This
 // flush could result in multiple Puts for a single rowkey. That is
 // acceptable because Combiner is run as an optimization and it's not
 // critical that all Puts are grouped perfectly.
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HBASE-8059) Enhance test-patch.sh so that patch can specify hadoop-2.0 as the default profile

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8059?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu resolved HBASE-8059.
---

Resolution: Fixed

 Enhance test-patch.sh so that patch can specify hadoop-2.0 as the default 
 profile
 -

 Key: HBASE-8059
 URL: https://issues.apache.org/jira/browse/HBASE-8059
 Project: HBase
  Issue Type: Improvement
Reporter: Ted Yu
 Fix For: 0.98.0

 Attachments: 7904-v5-hadoop-2.0.txt, 8059-v1.txt, 8059-v2.txt, 
 hadoop-2.0-template-pom.xml


 Over in HBASE-7904, I produced a patch which uses hadoop-2.0 as the default 
 profile.
 However, when QA tries to validate compilation against hadoop-2.0:
 {code}
 [ERROR] The build could not read 1 project - [Help 1]
 [ERROR]
 [ERROR]   The project org.apache.hbase:hbase:0.97-SNAPSHOT 
 (/Users/tyu/trunk/pom.xml) has 2 errors
 [ERROR] 'dependencyManagement.dependencies.dependency.artifactId' for 
 org.apache.hbase:${compat.module}:jar with value '${compat.module}' does not 
 match a valid id pattern. @ line 979, column 21
 [ERROR] 'dependencyManagement.dependencies.dependency.artifactId' for 
 org.apache.hbase:${compat.module}:test-jar with value '${compat.module}' does 
 not match a valid id pattern. @ line 984, column 21
 {code}
 We should enhance test-patch.sh so that patch with hadoop-2.0 as default 
 profile doesn't go through validation step against hadoop-2.0
 Ideally, the changes in various pom.xml files should be saved as template. 
 User can specify the hadoop profile to test against in the header of patch 
 file.
 e.g.
 {code}
 This patch uses hadoop-2.0 as default profile
 Index: 
 hbase-server/src/test/java/org/apache/hadoop/hbase/HBaseTestingUtility.java
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8778) Region assigments scan table directory making them slow for huge tables

2013-07-22 Thread Dave Latham (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8778?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715673#comment-13715673
 ] 

Dave Latham commented on HBASE-8778:


The directory modtime check looks good to me.  For us we will probably stick 
with the patch I have above for awhile, but I think the modtime would help 
others.

For snapshots I can't find a good reference for their actual hdfs layout, but 
from a brief look at the code I think it tries to mirror an actual table 
directory except for using hfile refs.  For #1 I wonder if there are guarantees 
that would make sure we can find all snapshots during a migration.

 Region assigments scan table directory making them slow for huge tables
 ---

 Key: HBASE-8778
 URL: https://issues.apache.org/jira/browse/HBASE-8778
 Project: HBase
  Issue Type: Improvement
Reporter: Dave Latham
Assignee: Dave Latham
 Fix For: 0.98.0, 0.95.2, 0.94.11

 Attachments: 8778-dirmodtime.txt, HBASE-8778-0.94.5.patch, 
 HBASE-8778-0.94.5-v2.patch


 On a table with 130k regions it takes about 3 seconds for a region server to 
 open a region once it has been assigned.
 Watching the threads for a region server running 0.94.5 that is opening many 
 such regions shows the thread opening the reigon in code like this:
 {noformat}
 PRI IPC Server handler 4 on 60020 daemon prio=10 tid=0x2aaac07e9000 
 nid=0x6566 runnable [0x4c46d000]
java.lang.Thread.State: RUNNABLE
 at java.lang.String.indexOf(String.java:1521)
 at java.net.URI$Parser.scan(URI.java:2912)
 at java.net.URI$Parser.parse(URI.java:3004)
 at java.net.URI.init(URI.java:736)
 at org.apache.hadoop.fs.Path.initialize(Path.java:145)
 at org.apache.hadoop.fs.Path.init(Path.java:126)
 at org.apache.hadoop.fs.Path.init(Path.java:50)
 at 
 org.apache.hadoop.hdfs.protocol.HdfsFileStatus.getFullPath(HdfsFileStatus.java:215)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.makeQualified(DistributedFileSystem.java:252)
 at 
 org.apache.hadoop.hdfs.DistributedFileSystem.listStatus(DistributedFileSystem.java:311)
 at 
 org.apache.hadoop.fs.FilterFileSystem.listStatus(FilterFileSystem.java:159)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:842)
 at org.apache.hadoop.fs.FileSystem.listStatus(FileSystem.java:867)
 at org.apache.hadoop.hbase.util.FSUtils.listStatus(FSUtils.java:1168)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:269)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoPath(FSTableDescriptors.java:255)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.getTableInfoModtime(FSTableDescriptors.java:368)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:155)
 at 
 org.apache.hadoop.hbase.util.FSTableDescriptors.get(FSTableDescriptors.java:126)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2834)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.openRegion(HRegionServer.java:2807)
 at sun.reflect.GeneratedMethodAccessor64.invoke(Unknown Source)
 at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
 at java.lang.reflect.Method.invoke(Method.java:597)
 at 
 org.apache.hadoop.hbase.ipc.WritableRpcEngine$Server.call(WritableRpcEngine.java:320)
 at 
 org.apache.hadoop.hbase.ipc.HBaseServer$Handler.run(HBaseServer.java:1426)
 {noformat}
 To open the region, the region server first loads the latest 
 HTableDescriptor.  Since HBASE-4553 HTableDescriptor's are stored in the file 
 system at /hbase/tableDir/.tableinfo.sequenceNum.  The file with the 
 largest sequenceNum is the current descriptor.  This is done so that the 
 current descirptor is updated atomically.  However, since the filename is not 
 known in advance FSTableDescriptors it has to do a FileSystem.listStatus 
 operation which has to list all files in the directory to find it.  The 
 directory also contains all the region directories, so in our case it has to 
 load 130k FileStatus objects.  Even using a globStatus matching function 
 still transfers all the objects to the client before performing the pattern 
 matching.  Furthermore HDFS uses a default of transferring 1000 directory 
 entries in each RPC call, so it requires 130 roundtrips to the namenode to 
 fetch all the directory entries.
 Consequently, to reassign all the regions of a table (or a constant fraction 
 thereof) requires time proportional to the square of the number of regions.
 In our case, if a region server fails with 200 

[jira] [Resolved] (HBASE-7841) Parallelize offline snapshot in DisabledTableSnapshotHandler

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-7841?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu resolved HBASE-7841.
---

Resolution: Won't Fix

 Parallelize offline snapshot in DisabledTableSnapshotHandler
 

 Key: HBASE-7841
 URL: https://issues.apache.org/jira/browse/HBASE-7841
 Project: HBase
  Issue Type: Sub-task
Reporter: Ted Yu
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 7841.txt, 7841-v2.txt, 7841-v3.txt, 7841-v4.txt


 In DisabledTableSnapshotHandler, there is TODO:
 {code}
   // TODO consider parallelizing these operations since they are independent. 
 Right now its just
   // easier to keep them serial though
   @Override
   public void snapshotRegions(ListPairHRegionInfo, ServerName 
 regionsAndLocations) throws IOException,
 {code}

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8935) IntegrationTestBigLinkedList fails under load on 0.94 due to some scan issues - add logging

2013-07-22 Thread Sergey Shelukhin (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Shelukhin updated HBASE-8935:


Summary: IntegrationTestBigLinkedList fails under load on 0.94 due to some 
scan issues - add logging  (was: IntegrationTestBigLinkedList fails under load 
on 0.94 due to some scan issues)

 IntegrationTestBigLinkedList fails under load on 0.94 due to some scan issues 
 - add logging
 ---

 Key: HBASE-8935
 URL: https://issues.apache.org/jira/browse/HBASE-8935
 Project: HBase
  Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Attachments: HBASE-8935-v0-94-logging.patch, HBASE-8935-v0.patch


 We observe occasional failures (4-5k undefined and unreferenced nodes in the 
 list, running) when the cluster is stressed while running 
 IntegrationTestBigLinkedList on 0.94. Unfortunately debug logging is disabled.
 If Verify is rerun after the fact, the data is there, so it's not data loss. 
 All the missing keys come from very narrow range from the very beginning of  
 one region; most of this region is not affected.
 In the case I'm looking at, region range is
 {code}\xD0\x0C`=%\xA2\xD3\xA8\x0A\xF8\x917\x05\x94\x1D .. 
 \xD4\x0Bx\xAF\x88\x0C\xF47\x9A\x9F{\xCE\x0E\x8A{code}
 and problematic renage
 {code}\xD0\x0C`QLD\xED2\xD5c\x8D\xDB5\x01\xD2H] ... 
 \xD0\x0D\x89)\x11\x0E8\xC5\x08o\xD7\xE3$\xB3\xAAu{code}
 There are no splits and compactions on the region during MR job; there are no 
 compactions after that could have affected the data, although there is one 
 flush that happened long after, and region was moved and reopened (before I 
 ran verify manually that showed that data is in HBase).
 One thing that happened was that the scanners used by all the map tasks had 
 lease expirations during the MR job that had missing data, some of them twice.
 First scanner expiration for each task I looked at was exactly 1 minute after 
 Processing split log message, when the scanner is opened.
 Only the tasks where the scanners have expired twice (2 of them) logged any 
 errors; presumably one expiration in each task happened after the reading was 
 finished, because everything was slow and scanner wasn't closed in time - no 
 errors or exceptions are logged in the tasks where scanner only expired once.
 The job I ran afterwards had no scanner expirations so it does close them 
 correctly in normal case...
 The task that was reading the problematic range had one scanner expiration 
 and didn't log any errors.
 It's a little bit special (or it may be a total red herring) in that in other 
 tasks, scanner expired while the task was splitting partial outputs (which 
 may mean end of input reading?), whereas in the task that lost data spilling 
 started long (~40s) after the scanner expired.
 However there's one another such task (spilling started 25s after scanner 
 expired) and it didn't log any errors and didn't lose data.
 At this point I am not sure about the root cause but I suspect it might be 
 related to scanner expiration handling, given the patterns.
 One thing for sure is that there isn't enough logging... so I will start with 
 adding that.
  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8690) Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-8690:
--

Resolution: Fixed
Status: Resolved  (was: Patch Available)

 Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners
 ---

 Key: HBASE-8690
 URL: https://issues.apache.org/jira/browse/HBASE-8690
 Project: HBase
  Issue Type: Improvement
  Components: master
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Minor
 Attachments: 8690-trunk-v3.patch, 8690-trunk-v4.patch, 
 HBASE-8690-0.94-v1.patch, HBASE-8690-trunk-v1.patch, HBASE-8690-trunk-v2.patch


 For each in file in archive dir, the TimeToLiveHFileCleaner need call 
 getFileStatus to get the modify time of file. Actually the CleanerChore have 
 had the file status when listing its parent dir. 
 When we set the TTL to 7 days in our cluster for data security, the number of 
 files left in archive dir is up to 65 thousands. In each clean period, 
 TimeToLiveHFileCleaner will generate ten thousand getFileStatus call in a 
 short time, which is very heavy for hdfs namenode.
 Fix: Change the path param to FileStatus in isFileDeletable method and reduce 
 unnecessary getFileStatus hdfs calls in TTL cleaners.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8935) IntegrationTestBigLinkedList fails under load on 0.94 due to some scan issues - add logging

2013-07-22 Thread Sergey Shelukhin (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8935?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sergey Shelukhin updated HBASE-8935:


Fix Version/s: 0.94.10
   0.95.2

 IntegrationTestBigLinkedList fails under load on 0.94 due to some scan issues 
 - add logging
 ---

 Key: HBASE-8935
 URL: https://issues.apache.org/jira/browse/HBASE-8935
 Project: HBase
  Issue Type: Bug
Reporter: Sergey Shelukhin
Assignee: Sergey Shelukhin
 Fix For: 0.95.2, 0.94.10

 Attachments: HBASE-8935-v0-94-logging.patch, HBASE-8935-v0.patch


 We observe occasional failures (4-5k undefined and unreferenced nodes in the 
 list, running) when the cluster is stressed while running 
 IntegrationTestBigLinkedList on 0.94. Unfortunately debug logging is disabled.
 If Verify is rerun after the fact, the data is there, so it's not data loss. 
 All the missing keys come from very narrow range from the very beginning of  
 one region; most of this region is not affected.
 In the case I'm looking at, region range is
 {code}\xD0\x0C`=%\xA2\xD3\xA8\x0A\xF8\x917\x05\x94\x1D .. 
 \xD4\x0Bx\xAF\x88\x0C\xF47\x9A\x9F{\xCE\x0E\x8A{code}
 and problematic renage
 {code}\xD0\x0C`QLD\xED2\xD5c\x8D\xDB5\x01\xD2H] ... 
 \xD0\x0D\x89)\x11\x0E8\xC5\x08o\xD7\xE3$\xB3\xAAu{code}
 There are no splits and compactions on the region during MR job; there are no 
 compactions after that could have affected the data, although there is one 
 flush that happened long after, and region was moved and reopened (before I 
 ran verify manually that showed that data is in HBase).
 One thing that happened was that the scanners used by all the map tasks had 
 lease expirations during the MR job that had missing data, some of them twice.
 First scanner expiration for each task I looked at was exactly 1 minute after 
 Processing split log message, when the scanner is opened.
 Only the tasks where the scanners have expired twice (2 of them) logged any 
 errors; presumably one expiration in each task happened after the reading was 
 finished, because everything was slow and scanner wasn't closed in time - no 
 errors or exceptions are logged in the tasks where scanner only expired once.
 The job I ran afterwards had no scanner expirations so it does close them 
 correctly in normal case...
 The task that was reading the problematic range had one scanner expiration 
 and didn't log any errors.
 It's a little bit special (or it may be a total red herring) in that in other 
 tasks, scanner expired while the task was splitting partial outputs (which 
 may mean end of input reading?), whereas in the task that lost data spilling 
 started long (~40s) after the scanner expired.
 However there's one another such task (spilling started 25s after scanner 
 expired) and it didn't log any errors and didn't lose data.
 At this point I am not sure about the root cause but I suspect it might be 
 related to scanner expiration handling, given the patterns.
 One thing for sure is that there isn't enough logging... so I will start with 
 adding that.
  

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9008) Reenable TestReplicationKillSlaveRS.killOneSlaveRS

2013-07-22 Thread Jean-Daniel Cryans (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jean-Daniel Cryans updated HBASE-9008:
--

Attachment: HBASE-9008.patch

This patch re-enables the test and changes the way we kill region servers. I 
think there's a bug in the way we read from HLogs with half-written edits, so 
instead of forcefully expiring the session I'm doing a clean shutdown. I have 
to write more low-level unit tests to verify this, but in the mean time I'm 
hoping this will make the tests less flakey.

I'm also adding more stats to when we are done recovering a queue so that we 
can see up to where it replicated.

 Reenable TestReplicationKillSlaveRS.killOneSlaveRS
 --

 Key: HBASE-9008
 URL: https://issues.apache.org/jira/browse/HBASE-9008
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Attachments: HBASE-9008.patch


 hbase-9007 disabled it as flakey.  See it for listing of fails and 
 HBASE-8961.  Assigning to [~jdcryans] to take a looksee.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9008) Reenable TestReplicationKillSlaveRS.killOneSlaveRS

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715692#comment-13715692
 ] 

stack commented on HBASE-9008:
--

+1 on patch.  Thanks [~jdcryans]

 Reenable TestReplicationKillSlaveRS.killOneSlaveRS
 --

 Key: HBASE-9008
 URL: https://issues.apache.org/jira/browse/HBASE-9008
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Attachments: HBASE-9008.patch


 hbase-9007 disabled it as flakey.  See it for listing of fails and 
 HBASE-8961.  Assigning to [~jdcryans] to take a looksee.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8642) [Snapshot] List and delete snapshot by table

2013-07-22 Thread Matteo Bertozzi (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8642?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715694#comment-13715694
 ] 

Matteo Bertozzi commented on HBASE-8642:


patch looks good to me, now doesn't apply cleanly but the fix is trivial.
my main concern with this feature is related to the rename semantic, where you 
rename a table and listing/deleting with the old rename results in including 
the snapshots made on before the rename table. Which may be fine, since may be 
considered not totally wrong and we don't have rename support in the core.

so, I'm +1 on the code and +0 on the merge, I'll let you decide.

 [Snapshot] List and delete snapshot by table
 

 Key: HBASE-8642
 URL: https://issues.apache.org/jira/browse/HBASE-8642
 Project: HBase
  Issue Type: Improvement
  Components: snapshots
Affects Versions: 0.98.0, 0.95.0, 0.95.1, 0.95.2
Reporter: Julian Zhou
Assignee: Julian Zhou
Priority: Minor
 Fix For: 0.98.0, 0.95.0, 0.95.2

 Attachments: 8642-trunk-0.95-v0.patch, 8642-trunk-0.95-v1.patch, 
 8642-trunk-0.95-v2.patch


 Support list and delete snapshot by table name.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8065) bulkload can load the hfile into hbase table,but this mechanism can't remove prior data

2013-07-22 Thread Ted Yu (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-8065:
--

Status: Open  (was: Patch Available)

 bulkload can load the hfile into hbase table,but this mechanism can't remove 
 prior data
 ---

 Key: HBASE-8065
 URL: https://issues.apache.org/jira/browse/HBASE-8065
 Project: HBase
  Issue Type: Improvement
  Components: IPC/RPC, mapreduce, regionserver
Affects Versions: 0.94.0
 Environment: hadoop-1.0.2、hbase-0.94.0
Reporter: Yuan Kang
Assignee: Yuan Kang
Priority: Critical
 Attachments: LoadIncrementalHFiles-bulkload-can-clean-olddata.patch


 this patch can do bulkload for one more parameter ‘need to refresh’,when this 
 parameter is true.bulkload can clean the old date in the hbase table ,then do 
 the new date load

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8690) Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners

2013-07-22 Thread Lars Hofhansl (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lars Hofhansl updated HBASE-8690:
-

Fix Version/s: 0.98.0

 Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners
 ---

 Key: HBASE-8690
 URL: https://issues.apache.org/jira/browse/HBASE-8690
 Project: HBase
  Issue Type: Improvement
  Components: master
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Minor
 Fix For: 0.98.0

 Attachments: 8690-trunk-v3.patch, 8690-trunk-v4.patch, 
 HBASE-8690-0.94-v1.patch, HBASE-8690-trunk-v1.patch, HBASE-8690-trunk-v2.patch


 For each in file in archive dir, the TimeToLiveHFileCleaner need call 
 getFileStatus to get the modify time of file. Actually the CleanerChore have 
 had the file status when listing its parent dir. 
 When we set the TTL to 7 days in our cluster for data security, the number of 
 files left in archive dir is up to 65 thousands. In each clean period, 
 TimeToLiveHFileCleaner will generate ten thousand getFileStatus call in a 
 short time, which is very heavy for hdfs namenode.
 Fix: Change the path param to FileStatus in isFileDeletable method and reduce 
 unnecessary getFileStatus hdfs calls in TTL cleaners.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-8690) Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners

2013-07-22 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8690?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715735#comment-13715735
 ] 

Lars Hofhansl commented on HBASE-8690:
--

Seems both 0.94 and 0.95 would benefit from this.

 Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners
 ---

 Key: HBASE-8690
 URL: https://issues.apache.org/jira/browse/HBASE-8690
 Project: HBase
  Issue Type: Improvement
  Components: master
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Minor
 Fix For: 0.98.0

 Attachments: 8690-trunk-v3.patch, 8690-trunk-v4.patch, 
 HBASE-8690-0.94-v1.patch, HBASE-8690-trunk-v1.patch, HBASE-8690-trunk-v2.patch


 For each in file in archive dir, the TimeToLiveHFileCleaner need call 
 getFileStatus to get the modify time of file. Actually the CleanerChore have 
 had the file status when listing its parent dir. 
 When we set the TTL to 7 days in our cluster for data security, the number of 
 files left in archive dir is up to 65 thousands. In each clean period, 
 TimeToLiveHFileCleaner will generate ten thousand getFileStatus call in a 
 short time, which is very heavy for hdfs namenode.
 Fix: Change the path param to FileStatus in isFileDeletable method and reduce 
 unnecessary getFileStatus hdfs calls in TTL cleaners.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9017) Consolidate ChaosMonkey's random objects

2013-07-22 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9017:
---

Attachment: trunk-9017.patch

 Consolidate ChaosMonkey's random objects
 

 Key: HBASE-9017
 URL: https://issues.apache.org/jira/browse/HBASE-9017
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-9017.patch


 ChaosMonkey has several random objects.  We can just share the same random 
 object.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9017) Consolidate ChaosMonkey's random objects

2013-07-22 Thread Jimmy Xiang (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9017:
---

Fix Version/s: 0.95.2
   0.98.0
   Status: Patch Available  (was: Open)

 Consolidate ChaosMonkey's random objects
 

 Key: HBASE-9017
 URL: https://issues.apache.org/jira/browse/HBASE-9017
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-9017.patch


 ChaosMonkey has several random objects.  We can just share the same random 
 object.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Created] (HBASE-9019) Port HBASE-8690: Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners to 0.94

2013-07-22 Thread Ted Yu (JIRA)
Ted Yu created HBASE-9019:
-

 Summary: Port HBASE-8690: Reduce unnecessary getFileStatus hdfs 
calls in TTL hfile and hlog cleanners to 0.94
 Key: HBASE-9019
 URL: https://issues.apache.org/jira/browse/HBASE-9019
 Project: HBase
  Issue Type: Improvement
Reporter: Ted Yu
Priority: Minor


See comment here: 
https://issues.apache.org/jira/browse/HBASE-8690?focusedCommentId=13715735page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13715735

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9008) Reenable TestReplicationKillSlaveRS.killOneSlaveRS

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9008?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715739#comment-13715739
 ] 

stack commented on HBASE-9008:
--

bq. stack that this is failing means either tailing HLogs is broken or that we 
have data loss in 0.96.

I am trying to get us to a state where clean builds are the default.  Toward 
that end, I am disabling tests that no one is looking at or that folks are 
looking at but meantime the build is failing because of the test so I can move 
on to find the next obstacle to a clean build.  Most tests I try to triage.  
Tests I can't make sense of w/o spending a good bit of time on, I disable and 
try and get an expert to look into it.

It'd be cool if you were able to find we are losing edits via this unit test.  
Thanks for looking.

 Reenable TestReplicationKillSlaveRS.killOneSlaveRS
 --

 Key: HBASE-9008
 URL: https://issues.apache.org/jira/browse/HBASE-9008
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Attachments: HBASE-9008.patch


 hbase-9007 disabled it as flakey.  See it for listing of fails and 
 HBASE-8961.  Assigning to [~jdcryans] to take a looksee.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Resolved] (HBASE-9019) Port HBASE-8690: Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners to 0.94

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-9019.
--

Resolution: Invalid

Resolving issue ill-specified.  When I click on the link it takes me to the end 
of the issue; I should not have to read a whole issue to figure out what a new 
issue is about.

If you don't know how to make an issue, don't do it [~ted_yu]

 Port HBASE-8690: Reduce unnecessary getFileStatus hdfs calls in TTL hfile and 
 hlog cleanners to 0.94
 

 Key: HBASE-9019
 URL: https://issues.apache.org/jira/browse/HBASE-9019
 Project: HBase
  Issue Type: Improvement
Reporter: Ted Yu
Priority: Minor

 See comment here: 
 https://issues.apache.org/jira/browse/HBASE-8690?focusedCommentId=13715735page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-13715735

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Commented] (HBASE-9017) Consolidate ChaosMonkey's random objects

2013-07-22 Thread stack (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-9017?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13715747#comment-13715747
 ] 

stack commented on HBASE-9017:
--

lgtm

 Consolidate ChaosMonkey's random objects
 

 Key: HBASE-9017
 URL: https://issues.apache.org/jira/browse/HBASE-9017
 Project: HBase
  Issue Type: Test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.95.2

 Attachments: trunk-9017.patch


 ChaosMonkey has several random objects.  We can just share the same random 
 object.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-9008) Reenable TestReplicationKillSlaveRS.killOneSlaveRS

2013-07-22 Thread Jean-Daniel Cryans (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-9008?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jean-Daniel Cryans updated HBASE-9008:
--

Status: Patch Available  (was: Open)

 Reenable TestReplicationKillSlaveRS.killOneSlaveRS
 --

 Key: HBASE-9008
 URL: https://issues.apache.org/jira/browse/HBASE-9008
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: stack
Assignee: Jean-Daniel Cryans
 Attachments: HBASE-9008.patch


 hbase-9007 disabled it as flakey.  See it for listing of fails and 
 HBASE-8961.  Assigning to [~jdcryans] to take a looksee.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


[jira] [Updated] (HBASE-8690) Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners

2013-07-22 Thread stack (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-8690:
-

Attachment: 8690v4.095.txt

What I applied to 0.95.

(Trunk patch didn't apply.  Interfaces have had the 'public' removed by LarsF 
from method names etc., so would not apply as is to 0.94)

 Reduce unnecessary getFileStatus hdfs calls in TTL hfile and hlog cleanners
 ---

 Key: HBASE-8690
 URL: https://issues.apache.org/jira/browse/HBASE-8690
 Project: HBase
  Issue Type: Improvement
  Components: master
Reporter: Liu Shaohui
Assignee: Liu Shaohui
Priority: Minor
 Fix For: 0.98.0

 Attachments: 8690-trunk-v3.patch, 8690-trunk-v4.patch, 
 8690v4.095.txt, HBASE-8690-0.94-v1.patch, HBASE-8690-trunk-v1.patch, 
 HBASE-8690-trunk-v2.patch


 For each in file in archive dir, the TimeToLiveHFileCleaner need call 
 getFileStatus to get the modify time of file. Actually the CleanerChore have 
 had the file status when listing its parent dir. 
 When we set the TTL to 7 days in our cluster for data security, the number of 
 files left in archive dir is up to 65 thousands. In each clean period, 
 TimeToLiveHFileCleaner will generate ten thousand getFileStatus call in a 
 short time, which is very heavy for hdfs namenode.
 Fix: Change the path param to FileStatus in isFileDeletable method and reduce 
 unnecessary getFileStatus hdfs calls in TTL cleaners.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira


  1   2   >