date:20141023


[ 
https://issues.apache.org/jira/browse/HBASE-11915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181072#comment-14181072
 ] 

Hudson commented on HBASE-11915:


FAILURE: Integrated in HBase-1.0 #347 (See 
[https://builds.apache.org/job/HBase-1.0/347/])
HBASE-11915 Document and test 0.94 - 1.0.0 update -- ADDENDUM (stack: rev 
46e4bffc2c7209b8d6620bdd187d9b43200770b8)
* hbase-server/src/main/java/org/apache/hadoop/hbase/util/FSUtils.java


 Document and test 0.94 - 1.0.0 update
 --

 Key: HBASE-11915
 URL: https://issues.apache.org/jira/browse/HBASE-11915
 Project: HBase
  Issue Type: Sub-task
Reporter: Enis Soztutar
Assignee: stack
Priority: Critical
 Fix For: 0.99.2

 Attachments: 11915.addendum.txt, 11915.txt, upgrade.txt


 We explicitly did not remove some of the upgrade related stuff in branch-1 
 for the possibility of supporting 0.94 - 1.0, similar to 0.94 - 0.98 
 support. 
 We should document, and test this support before 1.0 comes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-11915) Document and test 0.94 - 1.0.0 update


[ 
https://issues.apache.org/jira/browse/HBASE-11915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181081#comment-14181081
 ] 

Hudson commented on HBASE-11915:


FAILURE: Integrated in HBase-0.98 #628 (See 
[https://builds.apache.org/job/HBase-0.98/628/])
HBASE-11915 Document and test 0.94 - 1.0.0 update -- ADDENDUM (stack: rev 
046c4ce62da624f1a98a78e3fcd5204a7a420585)
* hbase-server/src/main/java/org/apache/hadoop/hbase/util/FSUtils.java


 Document and test 0.94 - 1.0.0 update
 --

 Key: HBASE-11915
 URL: https://issues.apache.org/jira/browse/HBASE-11915
 Project: HBase
  Issue Type: Sub-task
Reporter: Enis Soztutar
Assignee: stack
Priority: Critical
 Fix For: 0.99.2

 Attachments: 11915.addendum.txt, 11915.txt, upgrade.txt


 We explicitly did not remove some of the upgrade related stuff in branch-1 
 for the possibility of supporting 0.94 - 1.0, similar to 0.94 - 0.98 
 support. 
 We should document, and test this support before 1.0 comes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-11915) Document and test 0.94 - 1.0.0 update


[ 
https://issues.apache.org/jira/browse/HBASE-11915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181087#comment-14181087
 ] 

Hudson commented on HBASE-11915:


SUCCESS: Integrated in HBase-TRUNK #5693 (See 
[https://builds.apache.org/job/HBase-TRUNK/5693/])
HBASE-11915 Document and test 0.94 - 1.0.0 update -- ADDENDUM (stack: rev 
96f84594eee58b4e9a9347541baa3343a4ed3b97)
* hbase-server/src/main/java/org/apache/hadoop/hbase/util/FSUtils.java


 Document and test 0.94 - 1.0.0 update
 --

 Key: HBASE-11915
 URL: https://issues.apache.org/jira/browse/HBASE-11915
 Project: HBase
  Issue Type: Sub-task
Reporter: Enis Soztutar
Assignee: stack
Priority: Critical
 Fix For: 0.99.2

 Attachments: 11915.addendum.txt, 11915.txt, upgrade.txt


 We explicitly did not remove some of the upgrade related stuff in branch-1 
 for the possibility of supporting 0.94 - 1.0, similar to 0.94 - 0.98 
 support. 
 We should document, and test this support before 1.0 comes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12325) Add Utility to remove snapshot from a directory

2014-10-23 Thread Matteo Bertozzi (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-12325?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Matteo Bertozzi updated HBASE-12325:

Attachment: DeleteRemoteSnapshotTool.java

I had a tool that I wrote long time ago, I haven't checked if it work now but
it should.

anyway, you are using export in a wrong way :)
The initial design was exporting from one hbase cluster to another, not from
hbase to a backup disk. mainly because you need someone that takes care of
cleaning unused file.

so in case you want to export to disk you can:
* Create one folder per snapshot, which allows you to use rm -rf to drop the
snapshot, but you don't get delta updates.
* Create a month-x folder and drop all the snapshot of month x, which allows
you to get the delta updates, and allows you drop all the snapshot of month-x
with a simple rm -rf

...or, you can use the tool. but the tool requires coordination.
You can not run both the tools and export snapshot together, otherwise the tool
may remove the files in progress.
so, in my opinion this tool doesn't belong to hbase-core, because once your
stuff are not under the hbase control it is your responsibility do the
coordination (so, don't try to propose a zk-lock taken by both the tool and
export or similar).

Add Utility to remove snapshot from a directory
---

Key: HBASE-12325
URL: https://issues.apache.org/jira/browse/HBASE-12325
Project: HBase
Issue Type: Bug
Affects Versions: 2.0.0
Reporter: Elliott Clark
Assignee: Elliott Clark
Attachments: DeleteRemoteSnapshotTool.java

If there are several snapshots exported to a single directory, it's nice to
be able to remove the oldest one. Since snapshots in the same directory can
share files it's not as simple as just removing all files in a snapshot.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-12327) MetricsHBaseServerSourceFactory#createContextName has wrong conditions

Sanghyun Yun created HBASE-12327:


 Summary: MetricsHBaseServerSourceFactory#createContextName has 
wrong conditions
 Key: HBASE-12327
 URL: https://issues.apache.org/jira/browse/HBASE-12327
 Project: HBase
  Issue Type: Bug
Reporter: Sanghyun Yun


MetricsHBaseServerSourceFactory#createContextName has wrong conditions.

It checks serverName contains HMaster or HRegion.

{code:title=MetricsHBaseServerSourceFactory.java}
...
  protected static String createContextName(String serverName) {
if (serverName.contains(HMaster)) {
  return Master;
} else if (serverName.contains(HRegion)) {
  return RegionServer;
}
return IPC;
  }
...
{code}

But, passed serverName actually contains master or regionserver by 
HMaster#getProcessName and HRegionServer#getProcessName.

{code:title=HMaster.java}
...
  // MASTER is name of the webapp and the attribute name used stuffing this
  //instance into web context.
  public static final String MASTER = master;
...
  protected String getProcessName() {
return MASTER;
  }
...
{code}

{code:title=HRegionServer.java}
...
  /** region server process name */
  public static final String REGIONSERVER = regionserver;
...
  protected String getProcessName() {
return REGIONSERVER;
  }
...
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-11915) Document and test 0.94 - 1.0.0 update


[ 
https://issues.apache.org/jira/browse/HBASE-11915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181103#comment-14181103
 ] 

Hudson commented on HBASE-11915:


SUCCESS: Integrated in HBase-0.98-on-Hadoop-1.1 #599 (See 
[https://builds.apache.org/job/HBase-0.98-on-Hadoop-1.1/599/])
HBASE-11915 Document and test 0.94 - 1.0.0 update -- ADDENDUM (stack: rev 
046c4ce62da624f1a98a78e3fcd5204a7a420585)
* hbase-server/src/main/java/org/apache/hadoop/hbase/util/FSUtils.java


 Document and test 0.94 - 1.0.0 update
 --

 Key: HBASE-11915
 URL: https://issues.apache.org/jira/browse/HBASE-11915
 Project: HBase
  Issue Type: Sub-task
Reporter: Enis Soztutar
Assignee: stack
Priority: Critical
 Fix For: 0.99.2

 Attachments: 11915.addendum.txt, 11915.txt, upgrade.txt


 We explicitly did not remove some of the upgrade related stuff in branch-1 
 for the possibility of supporting 0.94 - 1.0, similar to 0.94 - 0.98 
 support. 
 We should document, and test this support before 1.0 comes. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12327) MetricsHBaseServerSourceFactory#createContextName has wrong conditions


 [ 
https://issues.apache.org/jira/browse/HBASE-12327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sanghyun Yun updated HBASE-12327:
-
Attachment: HBASE-12327.patch

So, I changed some conditions.

 MetricsHBaseServerSourceFactory#createContextName has wrong conditions
 --

 Key: HBASE-12327
 URL: https://issues.apache.org/jira/browse/HBASE-12327
 Project: HBase
  Issue Type: Bug
Reporter: Sanghyun Yun
 Attachments: HBASE-12327.patch


 MetricsHBaseServerSourceFactory#createContextName has wrong conditions.
 It checks serverName contains HMaster or HRegion.
 {code:title=MetricsHBaseServerSourceFactory.java}
 ...
   protected static String createContextName(String serverName) {
 if (serverName.contains(HMaster)) {
   return Master;
 } else if (serverName.contains(HRegion)) {
   return RegionServer;
 }
 return IPC;
   }
 ...
 {code}
 But, passed serverName actually contains master or regionserver by 
 HMaster#getProcessName and HRegionServer#getProcessName.
 {code:title=HMaster.java}
 ...
   // MASTER is name of the webapp and the attribute name used stuffing this
   //instance into web context.
   public static final String MASTER = master;
 ...
   protected String getProcessName() {
 return MASTER;
   }
 ...
 {code}
 {code:title=HRegionServer.java}
 ...
   /** region server process name */
   public static final String REGIONSERVER = regionserver;
 ...
   protected String getProcessName() {
 return REGIONSERVER;
   }
 ...
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12327) MetricsHBaseServerSourceFactory#createContextName has wrong conditions


 [ 
https://issues.apache.org/jira/browse/HBASE-12327?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sanghyun Yun updated HBASE-12327:
-
Status: Patch Available  (was: Open)

 MetricsHBaseServerSourceFactory#createContextName has wrong conditions
 --

 Key: HBASE-12327
 URL: https://issues.apache.org/jira/browse/HBASE-12327
 Project: HBase
  Issue Type: Bug
Reporter: Sanghyun Yun
 Attachments: HBASE-12327.patch


 MetricsHBaseServerSourceFactory#createContextName has wrong conditions.
 It checks serverName contains HMaster or HRegion.
 {code:title=MetricsHBaseServerSourceFactory.java}
 ...
   protected static String createContextName(String serverName) {
 if (serverName.contains(HMaster)) {
   return Master;
 } else if (serverName.contains(HRegion)) {
   return RegionServer;
 }
 return IPC;
   }
 ...
 {code}
 But, passed serverName actually contains master or regionserver by 
 HMaster#getProcessName and HRegionServer#getProcessName.
 {code:title=HMaster.java}
 ...
   // MASTER is name of the webapp and the attribute name used stuffing this
   //instance into web context.
   public static final String MASTER = master;
 ...
   protected String getProcessName() {
 return MASTER;
   }
 ...
 {code}
 {code:title=HRegionServer.java}
 ...
   /** region server process name */
   public static final String REGIONSERVER = regionserver;
 ...
   protected String getProcessName() {
 return REGIONSERVER;
   }
 ...
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-12328) Need to separate JvmMetrics for Master and RegionServer

Sanghyun Yun created HBASE-12328:


 Summary: Need to separate JvmMetrics for Master and RegionServer
 Key: HBASE-12328
 URL: https://issues.apache.org/jira/browse/HBASE-12328
 Project: HBase
  Issue Type: Improvement
Reporter: Sanghyun Yun
Priority: Minor


tag.ProcessName of JvmMetrics is IPC.
It is same both Master and RegionServer.

{code:title=HBase(Master and RegionServer)'s Metrics Dump}
...
name: Hadoop:service=HBase,name=JvmMetrics,
modelerType: JvmMetrics,
tag.Context: jvm,
tag.ProcessName: IPC,
tag.SessionId: ,
...
{code}

When I use HBase with Ganglia,
I wrote tagsForPrefix.jvm=ProcessName in hadoop-metrics2-hbase.properties.
{code:title=hadoop-metrics2-hbase.properties}
...
*.sink.ganglia.class=org.apache.hadoop.metrics2.sink.ganglia.GangliaSink31
hbase.sink.ganglia.tagsForPrefix.jvm=ProcessName
...
{code}

But, Ganglia generate only one RRD file because tag.ProcessName is IPC both 
Master and Regionserver.

I think it need to separate JvmMetrics for Master and RegionServer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12328) Need to separate JvmMetrics for Master and RegionServer


 [ 
https://issues.apache.org/jira/browse/HBASE-12328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sanghyun Yun updated HBASE-12328:
-
Attachment: HBASE-12328.patch

I changed some code for create separated JvmMetrics.
Master's tag.ProcessName = Master
RegionServer's tag.ProcessName = RegionServer

 Need to separate JvmMetrics for Master and RegionServer
 ---

 Key: HBASE-12328
 URL: https://issues.apache.org/jira/browse/HBASE-12328
 Project: HBase
  Issue Type: Improvement
Reporter: Sanghyun Yun
Priority: Minor
 Attachments: HBASE-12328.patch


 tag.ProcessName of JvmMetrics is IPC.
 It is same both Master and RegionServer.
 {code:title=HBase(Master and RegionServer)'s Metrics Dump}
 ...
 name: Hadoop:service=HBase,name=JvmMetrics,
 modelerType: JvmMetrics,
 tag.Context: jvm,
 tag.ProcessName: IPC,
 tag.SessionId: ,
 ...
 {code}
 When I use HBase with Ganglia,
 I wrote tagsForPrefix.jvm=ProcessName in hadoop-metrics2-hbase.properties.
 {code:title=hadoop-metrics2-hbase.properties}
 ...
 *.sink.ganglia.class=org.apache.hadoop.metrics2.sink.ganglia.GangliaSink31
 hbase.sink.ganglia.tagsForPrefix.jvm=ProcessName
 ...
 {code}
 But, Ganglia generate only one RRD file because tag.ProcessName is IPC both 
 Master and Regionserver.
 I think it need to separate JvmMetrics for Master and RegionServer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12328) Need to separate JvmMetrics for Master and RegionServer


 [ 
https://issues.apache.org/jira/browse/HBASE-12328?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sanghyun Yun updated HBASE-12328:
-
Status: Patch Available  (was: Open)

 Need to separate JvmMetrics for Master and RegionServer
 ---

 Key: HBASE-12328
 URL: https://issues.apache.org/jira/browse/HBASE-12328
 Project: HBase
  Issue Type: Improvement
Reporter: Sanghyun Yun
Priority: Minor
 Attachments: HBASE-12328.patch


 tag.ProcessName of JvmMetrics is IPC.
 It is same both Master and RegionServer.
 {code:title=HBase(Master and RegionServer)'s Metrics Dump}
 ...
 name: Hadoop:service=HBase,name=JvmMetrics,
 modelerType: JvmMetrics,
 tag.Context: jvm,
 tag.ProcessName: IPC,
 tag.SessionId: ,
 ...
 {code}
 When I use HBase with Ganglia,
 I wrote tagsForPrefix.jvm=ProcessName in hadoop-metrics2-hbase.properties.
 {code:title=hadoop-metrics2-hbase.properties}
 ...
 *.sink.ganglia.class=org.apache.hadoop.metrics2.sink.ganglia.GangliaSink31
 hbase.sink.ganglia.tagsForPrefix.jvm=ProcessName
 ...
 {code}
 But, Ganglia generate only one RRD file because tag.ProcessName is IPC both 
 Master and Regionserver.
 I think it need to separate JvmMetrics for Master and RegionServer.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-11683) Metrics for MOB

2014-10-23 Thread Li Jiajia (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-11683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Li Jiajia updated HBASE-11683:
--
Attachment: HBASE-11683-V6.diff

Update the patch(HBASE-11683-V6) based on the Jon's comments.

 Metrics for MOB
 ---

 Key: HBASE-11683
 URL: https://issues.apache.org/jira/browse/HBASE-11683
 Project: HBase
  Issue Type: Sub-task
  Components: regionserver, Scanners
Affects Versions: 2.0.0
Reporter: Jonathan Hsieh
Assignee: Jingcheng Du
 Attachments: HBASE-11683-V2.diff, HBASE-11683-V3.diff, 
 HBASE-11683-V4.diff, HBASE-11683-V5.diff, HBASE-11683-V6.diff, 
 HBASE-11683.diff


 We need to make sure to capture metrics about mobs.
 Some basic ones include:
 # of mob writes
 # of mob reads
 # avg size of mob (?)
 # mob files
 # of mob compactions / sweeps



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12328) Need to separate JvmMetrics for Master and RegionServer

2014-10-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12328?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181154#comment-14181154
 ] 

Hadoop QA commented on HBASE-12328:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12676542/HBASE-12328.patch
  against trunk revision .
  ATTACHMENT ID: 12676542

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 checkstyle{color}.  The applied patch does not increase the 
total number of checkstyle errors

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:red}-1 release audit{color}.  The applied patch generated 1 release 
audit warnings (more than the trunk's current 0 warnings).

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   org.apache.hadoop.hbase.ipc.TestRpcMetrics

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//testReport/
Release audit warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/patchReleaseAuditWarnings.txt
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Checkstyle Errors: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//artifact/patchprocess/checkstyle-aggregate.html

  Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11452//console

This message is automatically generated.

 Need to separate JvmMetrics for Master and RegionServer
 ---

 Key: HBASE-12328
 URL: https://issues.apache.org/jira/browse/HBASE-12328
 Project: HBase
  Issue Type: Improvement
Reporter: Sanghyun Yun
Priority: Minor
 Attachments: HBASE-12328.patch


 tag.ProcessName of JvmMetrics is IPC.
 It is same both Master and RegionServer.
 {code:title=HBase(Master and RegionServer)'s Metrics Dump}
 ...
 name: Hadoop:service=HBase,name=JvmMetrics,
 modelerType: JvmMetrics,
 tag.Context: jvm,
 tag.ProcessName: IPC,
 tag.SessionId: ,
 ...
 {code}
 When I use HBase with Ganglia,
 I wrote tagsForPrefix.jvm=ProcessName in hadoop-metrics2-hbase.properties.
 {code:title=hadoop-metrics2-hbase.properties}
 ...
 *.sink.ganglia.class=org.apache.hadoop.metrics2.sink.ganglia.GangliaSink31
 hbase.sink.ganglia.tagsForPrefix.jvm=ProcessName
 ...
 {code}
 But, Ganglia generate only one RRD file because tag.ProcessName is IPC both 
 Master and

[jira] [Commented] (HBASE-11368) Multi-column family BulkLoad fails if compactions go on too long

2014-10-23 Thread Qiang Tian (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-11368?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181168#comment-14181168
 ] 

Qiang Tian commented on HBASE-11368:


initial YCSB test:

Env:
---
hadoop 2.2.0
YCSB 1.0.4(Andrew's branch)
3 nodes, 1 master, 2 RS  //ignore cluster details since just to evaluate the 
new lock

Steps:
---
Followed Andrew's steps(see http://search-hadoop.com/m/DHED4hl7pC/)
the seed table has 3 CFs, pre-split to 20 regions
load 1 million rows to CF 'f1', using workloada
run 3 iterations for workloadc and workloada respectively. the parameter in 
each run:
bq. -p columnfamily=f1 -p operationcount=100 -s -threads 10


Results:
---
0.98.5:
workload c:
[READ], AverageLatency(us), 496.225811
[READ], AverageLatency(us), 510.206831
[READ], AverageLatency(us), 501.256123

workload a:
[READ], AverageLatency(us), 676.4527555821747
[READ], AverageLatency(us), 622.5544771452717
[READ], AverageLatency(us), 628.1365657163067


0.98.5+patch:
workload c:
[READ], AverageLatency(us), 536.334437
[READ], AverageLatency(us), 508.40
[READ], AverageLatency(us), 491.416182


workload a:
[READ], AverageLatency(us), 640.3625218319231
[READ], AverageLatency(us), 642.9719823488798
[READ], AverageLatency(us), 631.7491770928287

looks little performance penalty.

I also ran PE in the cluster, since the test table has only 1 CF, the new lock 
is actually not used. interestingly, with the patch the performance is even a 
bit better...

 Multi-column family BulkLoad fails if compactions go on too long
 

 Key: HBASE-11368
 URL: https://issues.apache.org/jira/browse/HBASE-11368
 Project: HBase
  Issue Type: Bug
Reporter: stack
Assignee: Qiang Tian
 Attachments: hbase-11368-0.98.5.patch


 Compactions take a read lock.  If a multi-column family region, before bulk 
 loading, we want to take a write lock on the region.  If the compaction takes 
 too long, the bulk load fails.
 Various recipes include:
 + Making smaller regions (lame)
 + [~victorunique] suggests major compacting just before bulk loading over in 
 HBASE-10882 as a work around.
 Does the compaction need a read lock for that long?  Does the bulk load need 
 a full write lock when multiple column families?  Can we fail more gracefully 
 at least?



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12327) MetricsHBaseServerSourceFactory#createContextName has wrong conditions

2014-10-23 Thread Hadoop QA (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181178#comment-14181178
 ] 

Hadoop QA commented on HBASE-12327:
---

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12676540/HBASE-12327.patch
  against trunk revision .
  ATTACHMENT ID: 12676540

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 checkstyle{color}.  The applied patch does not increase the 
total number of checkstyle errors

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 2.0.3) warnings.

{color:red}-1 release audit{color}.  The applied patch generated 1 release 
audit warnings (more than the trunk's current 0 warnings).

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//testReport/
Release audit warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/patchReleaseAuditWarnings.txt
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Checkstyle Errors: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//artifact/patchprocess/checkstyle-aggregate.html

  Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/11451//console

This message is automatically generated.

 MetricsHBaseServerSourceFactory#createContextName has wrong conditions
 --

 Key: HBASE-12327
 URL: https://issues.apache.org/jira/browse/HBASE-12327
 Project: HBase
  Issue Type: Bug
Reporter: Sanghyun Yun
 Attachments: HBASE-12327.patch


 MetricsHBaseServerSourceFactory#createContextName has wrong conditions.
 It checks serverName contains HMaster or HRegion.
 {code:title=MetricsHBaseServerSourceFactory.java}
 ...
   protected static String createContextName(String serverName) {
 if (serverName.contains(HMaster)) {
   return Master;
 } else if (serverName.contains(HRegion)) {
   return RegionServer;
 }
 return IPC;
   }
 ...
 {code}
 But, passed serverName actually contains master or regionserver by 
 HMaster#getProcessName and HRegionServer#getProcessName.
 {code:title=HMaster.java}
 ...
   // MASTER is name of the webapp and the attribute name used stuffing this
   //instance into web context.
   public static final String MASTER = master;
 ...

[jira] [Commented] (HBASE-12324) Improve compaction speed and process for immutable short lived datasets


[ 
https://issues.apache.org/jira/browse/HBASE-12324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181317#comment-14181317
 ] 

Sean Busbey commented on HBASE-12324:
-

In the case where we have  table-wide TTL, is there any reason not to just do a 
delete-only optimization in the general compaction policy?

We could add to the fixed trailer the newest timestamp of all the cells in the 
HFile.

 Improve compaction speed and process for immutable short lived datasets
 ---

 Key: HBASE-12324
 URL: https://issues.apache.org/jira/browse/HBASE-12324
 Project: HBase
  Issue Type: New Feature
  Components: Compaction
Affects Versions: 0.98.0, 0.96.0
Reporter: Sheetal Dolas

 We have seen multiple cases where HBase is used to store immutable data and 
 the data lives for short period of time (few days)
 On very high volume systems, major compactions become very costly and 
 slowdown ingestion rates.
 In all such use cases (immutable data, high write rate and moderate read 
 rates and shorter ttl), avoiding any compactions and just deleting old data 
 brings lot of performance benefits.
 We should have a compaction policy that can only delete/archive files older 
 than TTL and not compact any files.
 Also attaching a patch that can do so.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12327) MetricsHBaseServerSourceFactory#createContextName has wrong conditions

2014-10-23 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12327?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181373#comment-14181373
 ] 

Ted Yu commented on HBASE-12327:


Why are both HMaster and master checked ?

Thanks

 MetricsHBaseServerSourceFactory#createContextName has wrong conditions
 --

 Key: HBASE-12327
 URL: https://issues.apache.org/jira/browse/HBASE-12327
 Project: HBase
  Issue Type: Bug
Reporter: Sanghyun Yun
 Attachments: HBASE-12327.patch


 MetricsHBaseServerSourceFactory#createContextName has wrong conditions.
 It checks serverName contains HMaster or HRegion.
 {code:title=MetricsHBaseServerSourceFactory.java}
 ...
   protected static String createContextName(String serverName) {
 if (serverName.contains(HMaster)) {
   return Master;
 } else if (serverName.contains(HRegion)) {
   return RegionServer;
 }
 return IPC;
   }
 ...
 {code}
 But, passed serverName actually contains master or regionserver by 
 HMaster#getProcessName and HRegionServer#getProcessName.
 {code:title=HMaster.java}
 ...
   // MASTER is name of the webapp and the attribute name used stuffing this
   //instance into web context.
   public static final String MASTER = master;
 ...
   protected String getProcessName() {
 return MASTER;
   }
 ...
 {code}
 {code:title=HRegionServer.java}
 ...
   /** region server process name */
   public static final String REGIONSERVER = regionserver;
 ...
   protected String getProcessName() {
 return REGIONSERVER;
   }
 ...
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12287) Add retry runners to the most commonly failing tests

2014-10-23 Thread Alex Newman (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-12287?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14181426#comment-14181426
 ] 

Alex Newman commented on HBASE-12287:
-

3 out of 5 makes sense. I think it might also make sense if we were still 
alerted of it. And if we could track it some way. Ideas?


Sent from Alex Newman on a small glass box.
404.507.6749

On Wed, Oct 22, 2014 at 5:22 PM, Andrew Purtell (JIRA) j...@apache.org



 Add retry runners to the most commonly failing tests
 

 Key: HBASE-12287
 URL: https://issues.apache.org/jira/browse/HBASE-12287
 Project: HBase
  Issue Type: Sub-task
Reporter: Alex Newman
Assignee: Alex Newman
 Attachments: HBASE-12287.patch


 Many of our tests have nondeterministic behavior due to inter-test 
 interference. Usually restarting the test is enough to verify whether it is 
 test interference or a broken test. Lets use a retry runner which runs the 
 aftertest-before test and reruns the tests 10 times.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Created] (HBASE-12329) Table create with duplicate column family names quietly succeeds

Sean Busbey created HBASE-12329:
---

 Summary: Table create with duplicate column family names quietly 
succeeds
 Key: HBASE-12329
 URL: https://issues.apache.org/jira/browse/HBASE-12329
 Project: HBase
  Issue Type: Bug
  Components: Client, shell
Reporter: Sean Busbey
Priority: Minor


From the mailing list

{quote}
I was expecting that it is forbidden, **but** this call does not throw any
exception
{code}
String[] families = {cf, cf};
HTableDescriptor desc = new HTableDescriptor(name);
for (String cf : families) {
  HColumnDescriptor coldef = new HColumnDescriptor(cf);
  desc.addFamily(coldef);
}
try {
admin.createTable(desc);
} catch (TableExistsException e) {
throw new IOException(table \' + name + \' already exists);
}
{code}
{quote}

And Ted's follow up replicates in the shell
{quote}
hbase(main):001:0 create 't2', {NAME = 'f1'}, {NAME = 'f1'}

The table got created - with 1 column family:

hbase(main):002:0 describe 't2'
DESCRIPTION
   ENABLED
 't2', {NAME = 'f1', DATA_BLOCK_ENCODING = 'NONE', BLOOMFILTER = 'ROW',
REPLICATION_SCOPE = '0 true
 ', VERSIONS = '1', COMPRESSION = 'NONE', MIN_VERSIONS = '0', TTL =
'2147483647', KEEP_DELETED
 _CELLS = 'false', BLOCKSIZE = '65536', IN_MEMORY = 'false', BLOCKCACHE
= 'true'}
1 row(s) in 0.1000 seconds
{quote}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Updated] (HBASE-12329) Table create with duplicate column family names quietly succeeds


 [ 
https://issues.apache.org/jira/browse/HBASE-12329?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sean Busbey updated HBASE-12329:

Description: 
From the mailing list

{quote}
I was expecting that it is forbidden, **but** this call does not throw any
exception
{code}
String[] families = {cf, cf};
HTableDescriptor desc = new HTableDescriptor(name);
for (String cf : families) {
  HColumnDescriptor coldef = new HColumnDescriptor(cf);
  desc.addFamily(coldef);
}
try {
admin.createTable(desc);
} catch (TableExistsException e) {
throw new IOException(table \' + name + \' already exists);
}
{code}
{quote}

And Ted's follow up replicates in the shell
{code}
hbase(main):001:0 create 't2', {NAME = 'f1'}, {NAME = 'f1'}

The table got created - with 1 column family:

hbase(main):002:0 describe 't2'
DESCRIPTION
   ENABLED
 't2', {NAME = 'f1', DATA_BLOCK_ENCODING = 'NONE', BLOOMFILTER = 'ROW',
REPLICATION_SCOPE = '0 true
 ', VERSIONS = '1', COMPRESSION = 'NONE', MIN_VERSIONS = '0', TTL =
'2147483647', KEEP_DELETED
 _CELLS = 'false', BLOCKSIZE = '65536', IN_MEMORY = 'false', BLOCKCACHE
= 'true'}
1 row(s) in 0.1000 seconds
{code}

  was:
From the mailing list

{quote}
I was expecting that it is forbidden, **but** this call does not throw any
exception
{code}
String[] families = {cf, cf};
HTableDescriptor desc = new HTableDescriptor(name);
for (String cf : families) {
  HColumnDescriptor coldef = new HColumnDescriptor(cf);
  desc.addFamily(coldef);
}
try {
admin.createTable(desc);
} catch (TableExistsException e) {
throw new IOException(table \' + name + \' already exists);
}
{code}
{quote}

And Ted's follow up replicates in the shell
{quote}
hbase(main):001:0 create 't2', {NAME = 'f1'}, {NAME = 'f1'}

The table got created - with 1 column family:

hbase(main):002:0 describe 't2'
DESCRIPTION
   ENABLED
 't2', {NAME = 'f1', DATA_BLOCK_ENCODING = 'NONE', BLOOMFILTER = 'ROW',
REPLICATION_SCOPE = '0 true
 ', VERSIONS = '1', COMPRESSION = 'NONE', MIN_VERSIONS = '0', TTL =
'2147483647', KEEP_DELETED
 _CELLS = 'false', BLOCKSIZE = '65536', IN_MEMORY = 'false', BLOCKCACHE
= 'true'}
1 row(s) in 0.1000 seconds
{quote}


 Table create with duplicate column family names quietly succeeds
 

 Key: HBASE-12329
 URL: https://issues.apache.org/jira/browse/HBASE-12329
 Project: HBase
  Issue Type: Bug
  Components: Client, shell
Reporter: Sean Busbey
Priority: Minor

 From the mailing list
 {quote}
 I was expecting that it is forbidden, **but** this call does not throw any
 exception
 {code}
 String[] families = {cf, cf};
 HTableDescriptor desc = new HTableDescriptor(name);
 for (String cf : families) {
   HColumnDescriptor coldef = new HColumnDescriptor(cf);
   desc.addFamily(coldef);
 }
 try {
 admin.createTable(desc);
 } catch (TableExistsException e) {
 throw new IOException(table \' + name + \' already exists);
 }
 {code}
 {quote}
 And Ted's follow up replicates in the shell
 {code}
 hbase(main):001:0 create 't2', {NAME = 'f1'}, {NAME = 'f1'}
 The table got created - with 1 column family:
 hbase(main):002:0 describe 't2'
 DESCRIPTION
ENABLED
  't2', {NAME = 'f1', DATA_BLOCK_ENCODING = 'NONE', BLOOMFILTER = 'ROW',
 REPLICATION_SCOPE = '0 true
  ', VERSIONS = '1', COMPRESSION = 'NONE', MIN_VERSIONS = '0', TTL =
 '2147483647', KEEP_DELETED
  _CELLS = 'false', BLOCKSIZE = '65536', IN_MEMORY = 'false', BLOCKCACHE
 = 'true'}
 1 row(s) in 0.1000 seconds
 {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-12329) Table create with duplicate column family names quietly succeeds