subject:"\[jira\] \[Commented\] \(HBASE\-10201\) Port 'Make flush decisions per column family' to trunk"

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-21 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14255185#comment-14255185
 ] 

Ted Yu commented on HBASE-10201:


Addendum integrated to master and branch-1

Thanks Duo.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201-addendum_1.patch, HBASE-10201.patch, 
 HBASE-10201_1.patch, HBASE-10201_10.patch, HBASE-10201_11.patch, 
 HBASE-10201_12.patch, HBASE-10201_13.patch, HBASE-10201_13.patch, 
 HBASE-10201_14.patch, HBASE-10201_15.patch, HBASE-10201_16.patch, 
 HBASE-10201_17.patch, HBASE-10201_18.patch, HBASE-10201_19.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-21 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14255212#comment-14255212
 ] 

Hudson commented on HBASE-10201:


SUCCESS: Integrated in HBase-1.1 #16 (See 
[https://builds.apache.org/job/HBase-1.1/16/])
HBASE-10201 Addendum fixes typo of putIfAbsent (Duo Zhang) (tedyu: rev 
fbc852b6809184bdba0bbccb8ef3e1fe848d6f22)
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FSHLog.java


 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201-addendum_1.patch, HBASE-10201.patch, 
 HBASE-10201_1.patch, HBASE-10201_10.patch, HBASE-10201_11.patch, 
 HBASE-10201_12.patch, HBASE-10201_13.patch, HBASE-10201_13.patch, 
 HBASE-10201_14.patch, HBASE-10201_15.patch, HBASE-10201_16.patch, 
 HBASE-10201_17.patch, HBASE-10201_18.patch, HBASE-10201_19.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-21 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14255222#comment-14255222
 ] 

Hudson commented on HBASE-10201:


SUCCESS: Integrated in HBase-TRUNK #5955 (See 
[https://builds.apache.org/job/HBase-TRUNK/5955/])
HBASE-10201 Addendum fixes typo of putIfAbsent (Duo Zhang) (tedyu: rev 
51334fb951232aa56add118d142e6b82da204494)
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FSHLog.java


 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201-addendum_1.patch, HBASE-10201.patch, 
 HBASE-10201_1.patch, HBASE-10201_10.patch, HBASE-10201_11.patch, 
 HBASE-10201_12.patch, HBASE-10201_13.patch, HBASE-10201_13.patch, 
 HBASE-10201_14.patch, HBASE-10201_15.patch, HBASE-10201_16.patch, 
 HBASE-10201_17.patch, HBASE-10201_18.patch, HBASE-10201_19.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-19 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14254418#comment-14254418
 ] 

stack commented on HBASE-10201:
---

[~Apache9] Any chance of your taking a look at the test failure here: 
https://builds.apache.org/job/PreCommit-HBASE-Build/12165//testReport/ It is 
per column family flushing

https://builds.apache.org/job/PreCommit-HBASE-Build/12165/artifact/hbase-server/target/surefire-reports/org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush-output.txt

Says this:

---
Test set: org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush
---
Tests run: 1, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 248.398 sec  
FAILURE! - in org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush
testCompareStoreFileCount(org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush)
  Time elapsed: 53.153 sec   FAILURE!
java.lang.AssertionError: null
at org.junit.Assert.fail(Assert.java:86)
at org.junit.Assert.assertTrue(Assert.java:41)
at org.junit.Assert.assertTrue(Assert.java:52)
at 
org.apache.hadoop.hbase.regionserver.TestPerColumnFamilyFlush.testCompareStoreFileCount(TestPerColumnFamilyFlush.java:589)

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-19 Thread zhangduo (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14254432#comment-14254432
 ] 

zhangduo commented on HBASE-10201:
--

[~stack] Yeah, the testcase is flakey.. It is used to confirm that per column 
family flush generates less store files.

But flush is asynchronized, so there maybe a change that the original flush is 
delayed more than the per column family flush scenario and generate less store 
files, it depends on the machine's state that running the testcase...

I think we can make an addendum to remove it for now to get a stable testing 
result. I will try to find a more stable way to confirm that per column family 
flush does work.

Thanks~

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-19 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14254450#comment-14254450
 ] 

stack commented on HBASE-10201:
---

Thanks [~Apache9] Do it in new issue when you get a chance since this one is 
long enough already (smile).  Thanks.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-18 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252596#comment-14252596
 ] 

stack commented on HBASE-10201:
---

Committed to branch-1 so will be in 1.1.0.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-18 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14252761#comment-14252761
 ] 

Hudson commented on HBASE-10201:


SUCCESS: Integrated in HBase-1.1 #5 (See 
[https://builds.apache.org/job/HBase-1.1/5/])
HBASE-10201 Port 'Make flush decisions per column family' to trunk (stack: rev 
e55ef7a663dd9a18fa88a506afd8fe0ced10563d)
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestWALReplay.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushRequester.java
* hbase-client/src/main/java/org/apache/hadoop/hbase/HTableDescriptor.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/LogRoller.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestPerColumnFamilyFlush.java
* hbase-server/src/test/java/org/apache/hadoop/hbase/wal/TestWALFactory.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushLargeStoresPolicy.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestFlushRegionEntry.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/RSRpcServices.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushAllStoresPolicy.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/wal/TestDefaultWALProvider.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushPolicy.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestHeapMemoryManager.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FSWALEntry.java
* hbase-server/src/test/java/org/apache/hadoop/hbase/TestIOFencing.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/wal/DisabledWALProvider.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FSHLog.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushPolicyFactory.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/MemStoreFlusher.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestFSHLog.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/wal/WAL.java
* hbase-common/src/main/resources/hbase-default.xml
HBASE-10201 Addendum changes TestPerColumnFamilyFlush to LargeTest (stack: rev 
5d34d2d02af39037a2426fe4fb5be9a447202bd7)
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestPerColumnFamilyFlush.java


 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0, 1.1.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread Enis Soztutar (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248724#comment-14248724
 ] 

Enis Soztutar commented on HBASE-10201:
---

I don't think we should have this in 1.0.0. I am planning on cutting the RC 
tomorrow, and this seems to be a huge change for the last minute. Can we target 
1.1 instead? 

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch, 
 HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch, 
 HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch, 
 compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread Jeffrey Zhong (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248725#comment-14248725
 ] 

Jeffrey Zhong commented on HBASE-10201:
---

Looks good to me(+1) for master branch.  Branch-1 should rely on [~enis]'s 
feedbacks.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch, 
 HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch, 
 HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch, 
 compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248793#comment-14248793
 ] 

stack commented on HBASE-10201:
---

bq. Can we target 1.1 instead?

Sure.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch, 
 HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch, 
 HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch, 
 compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248816#comment-14248816
 ] 

stack commented on HBASE-10201:
---

I forgot to say thank you [~Apache9] for your persistence on getting this in.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch, 
 HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch, 
 HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch, 
 compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14248943#comment-14248943
 ] 

Hudson commented on HBASE-10201:


FAILURE: Integrated in HBase-TRUNK #5930 (See 
[https://builds.apache.org/job/HBase-TRUNK/5930/])
HBASE-10201 Port 'Make flush decisions per column family' to trunk (stack: rev 
c7fad665f34fd3c17999d5cc60b04d3faff6a7f5)
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestFSHLog.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/wal/WAL.java
* hbase-common/src/main/resources/hbase-default.xml
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/MemStoreFlusher.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestFlushRegionEntry.java
* hbase-client/src/main/java/org/apache/hadoop/hbase/HTableDescriptor.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/TestWALReplay.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/HRegion.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/RSRpcServices.java
* hbase-server/src/test/java/org/apache/hadoop/hbase/wal/TestWALFactory.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestHeapMemoryManager.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushLargeStoresPolicy.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushRequester.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestPerColumnFamilyFlush.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/wal/DisabledWALProvider.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushAllStoresPolicy.java
* hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/LogRoller.java
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/wal/TestDefaultWALProvider.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushPolicy.java
* hbase-server/src/test/java/org/apache/hadoop/hbase/TestIOFencing.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FSWALEntry.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/wal/FSHLog.java
* 
hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/FlushPolicyFactory.java


 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch, 
 HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch, 
 HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch, 
 compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14249188#comment-14249188
 ] 

stack commented on HBASE-10201:
---

That'll work. Thanks.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread Ted Yu (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14249191#comment-14249191
 ] 

Ted Yu commented on HBASE-10201:


Addendum pushed to master branch.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread zhangduo (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14249199#comment-14249199
 ] 

zhangduo commented on HBASE-10201:
--

{quote}
I forgot to say thank you zhangduo for your persistence on getting this in.
{quote}

It's my pleasure to contribute code to a famous project:)
Thanks


 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-16 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14249275#comment-14249275
 ] 

Hudson commented on HBASE-10201:


FAILURE: Integrated in HBase-TRUNK #5933 (See 
[https://builds.apache.org/job/HBase-TRUNK/5933/])
HBASE-10201 Addendum changes TestPerColumnFamilyFlush to LargeTest (tedyu: rev 
885b065683499540f467cb54086a3f60e64b9c8a)
* 
hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/TestPerColumnFamilyFlush.java


 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 2.0.0

 Attachments: 10201-addendum.txt, 3149-trunk-v1.txt, 
 HBASE-10201-0.98.patch, HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, 
 HBASE-10201-0.99.patch, HBASE-10201.patch, HBASE-10201_1.patch, 
 HBASE-10201_10.patch, HBASE-10201_11.patch, HBASE-10201_12.patch, 
 HBASE-10201_13.patch, HBASE-10201_13.patch, HBASE-10201_14.patch, 
 HBASE-10201_15.patch, HBASE-10201_16.patch, HBASE-10201_17.patch, 
 HBASE-10201_18.patch, HBASE-10201_19.patch, HBASE-10201_2.patch, 
 HBASE-10201_3.patch, HBASE-10201_4.patch, HBASE-10201_5.patch, 
 HBASE-10201_6.patch, HBASE-10201_7.patch, HBASE-10201_8.patch, 
 HBASE-10201_9.patch, compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-14 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14246385#comment-14246385
 ] 

stack commented on HBASE-10201:
---

I'm +1 on this going into master branch.  I am +1 on this going into branch-1 
but with it disabled by default as an experimental feature; users would have to 
enable the FlushLargeStoresPolicy explicitly (You ok w/ that [~enis])?

Any chance of more +1s?  [~jeffreyz]? Any other reviews out there? This is an 
old issue, nicely addressed, that can make a nice dent in our i/o profile when 
more than one column family but it would be good to get more eyes on it given 
its messing with sequenceids. Thanks.

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch, 
 HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch, 
 HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch, 
 compactions.png, count.png, io.png, memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-13 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14245245#comment-14245245
]

Hadoop QA commented on HBASE-10201:
---

{color:green}+1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12687023/HBASE-10201_19.patch
against master branch at commit a0e473730e2cd819e7442dbd2b332d7833755ba2.
ATTACHMENT ID: 12687023

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 32 new
or modified tests.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 checkstyle{color}. The applied patch does not increase the
total number of checkstyle errors

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100

{color:green}+1 site{color}. The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}. The patch passed unit tests in .

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Checkstyle Errors:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//artifact/patchprocess/checkstyle-aggregate.html

Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/12065//console

This message is automatically generated.

Port 'Make flush decisions per column family' to trunk
--

Key: HBASE-10201
URL: https://issues.apache.org/jira/browse/HBASE-10201
Project: HBase
Issue Type: Improvement
Components: wal
Reporter: Ted Yu
Assignee: zhangduo
Fix For: 1.0.0, 2.0.0

Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch,
HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch,
HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch,
HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch,
HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch,
HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch,
HBASE-10201_19.patch, HBASE-10201_2.patch, HBASE-10201_3.patch,
HBASE-10201_4.patch, HBASE-10201_5.patch, HBASE-10201_6.patch,
HBASE-10201_7.patch, HBASE-10201_8.patch, HBASE-10201_9.patch,
compactions.png, count.png, io.png, memstore.png

Currently the flush decision is made using the aggregate size of all column
families. When large and small column families co-exist, this causes many
small flushes of the smaller CF. We need to make per-CF flush decisions.

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-12 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14245187#comment-14245187
]

Hadoop QA commented on HBASE-10201:
---

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12687009/HBASE-10201_19.patch
against master branch at commit a0e473730e2cd819e7442dbd2b332d7833755ba2.
ATTACHMENT ID: 12687009

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 32 new
or modified tests.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:red}-1 javadoc{color}. The javadoc tool appears to have generated 1
warning messages.

{color:green}+1 checkstyle{color}. The applied patch does not increase the
total number of checkstyle errors

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 2.0.3) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100

{color:green}+1 site{color}. The mvn site goal succeeds with this patch.

{color:green}+1 core tests{color}. The patch passed unit tests in .

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-annotations.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-rest.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Checkstyle Errors:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/checkstyle-aggregate.html

Javadoc warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//artifact/patchprocess/patchJavadocWarnings.txt
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/12064//console

This message is automatically generated.

Port 'Make flush decisions per column family' to trunk
--

Key: HBASE-10201
URL: https://issues.apache.org/jira/browse/HBASE-10201
Project: HBase
Issue Type: Improvement
Components: wal
Reporter: Ted Yu
Assignee: zhangduo
Fix For: 1.0.0, 2.0.0

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread Jeffrey Zhong (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243274#comment-14243274
 ] 

Jeffrey Zhong commented on HBASE-10201:
---

{quote}
Now I always generate a new flushSeqId and use this as the seqId of flushed 
StoreFiles. And use a maxFlushedSeqId to record completeSequenceId that passed 
to HMaster. Is it OK?
{quote}
Sounds good to me. 

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243345#comment-14243345
 ] 

stack commented on HBASE-10201:
---

[~jeffreyz] What about the comment on issue w/ 1. above? See 
https://issues.apache.org/jira/browse/HBASE-10201?focusedCommentId=14240737page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel#comment-14240737

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread Jeffrey Zhong (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243528#comment-14243528
 ] 

Jeffrey Zhong commented on HBASE-10201:
---

[~saint@gmail.com] 
{quote}
Are you referring to the following: Will this mean we drop edits because 
region thinks its sequenceid is higher than it should be?
{quote}
Yes, as of today during replay edits in both modes, we drop WAL edits whose 
seqId less than relating store Seq Ids. There some edge cases(like a new PUT, 
region move to a different RS, DELETE on the new PUT, major compaction, move 
back to the original RS and the RS crashes) we have to know the hFile seqId 
accurately otherwise the PUT may be restored after recovery. 

We need to pass flushed seqIds per store to master so that we can optimize 
recovery process but doesn't impact correctness. 

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread stack (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243546#comment-14243546
]

stack commented on HBASE-10201:
---

[~jeffreyz] I'm referring to the fact that if three column families, and one
has edit #1, another edit #2 (which came later) and the third had edit #3 and
then if the policy decides flush the third CF, we'll write it out with a seqid
of #3 but edits #1 and #2 are still in memory. We report to the master our
lowest number is #1 but master crashes (so we lose info that #1 is earliest
safe edit number). The RS hosting the three column famiilies also crashes. On
recovery, we open the region and see a hfile with seqid #3 so we set the region
current seqid to #4.. even though #1 and #2 were never persisted. This is
possible with this patch as is especially when policy is disconnected from
flush.

bq. We need to pass flushed seqIds per store to master so that we can optimize
recovery process but doesn't impact correctness.

This would not fix the above case? The master might know that #3 was persisted
and that column family 1 and 2 had edits less than #3 but if it crashes, we're
back in the scenario described above (unless we persist the flush reports?)

Thanks.

Port 'Make flush decisions per column family' to trunk
--

Key: HBASE-10201
URL: https://issues.apache.org/jira/browse/HBASE-10201
Project: HBase
Issue Type: Improvement
Components: wal
Reporter: Ted Yu
Assignee: zhangduo
Fix For: 1.0.0, 2.0.0

Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch,
HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch,
HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch,
HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch,
HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch,
HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch,
HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch,
HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch,
HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png,
memstore.png

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread zhangduo (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243610#comment-14243610
 ] 

zhangduo commented on HBASE-10201:
--

[~stack] In your scenario, I think we will use #1 to skip edits, not #4.
As I see code in replayRecoveredEditsIfAny
{code}
long minSeqIdForTheRegion = -1;
for (Long maxSeqIdInStore : maxSeqIdInStores.values()) {
  if (maxSeqIdInStore  minSeqIdForTheRegion || minSeqIdForTheRegion == -1) 
{
minSeqIdForTheRegion = maxSeqIdInStore;
  }
}
{code}
And this
{code}
  maxSeqId = Math.abs(Long.parseLong(fileName));
  if (maxSeqId = minSeqIdForTheRegion) {
if (LOG.isDebugEnabled()) {
  String msg = Maximum sequenceid for this wal is  + maxSeqId
+  and minimum sequenceid for the region is  + 
minSeqIdForTheRegion
+ , skipped the whole file, path= + edits;
  LOG.debug(msg);
}
continue;
  }
{code}
And in replayRecoveredEdits, we skip edit cells using per store seqId
{code}
// Now, figure if we should skip this edit.
if (key.getLogSeqNum() = maxSeqIdInStores.get(store.getFamily()
.getName())) {
  skippedEdits++;
  continue;
}
{code}

And when splitting log, we use a lastSeqId got from HMaster to skip edits. If 
master crash and loss the information, then we will not skip any edits? I'm not 
sure but I didn't find the code to get lastSeqId from any place other than 
HMaster. [~jeffreyz]

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread Jeffrey Zhong (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243631#comment-14243631
 ] 

Jeffrey Zhong commented on HBASE-10201:
---

[~saint@gmail.com] Besides [~Apache9] mentioned, we skip edits using seqId 
of each relating store, the #4(which is  #3) is only set after region is full 
recovered(i.e all WAL edits are already replayed).

{quote}
 If master crash and loss the information, then we will not skip any edits?
{quote}
yes, we'll lose the info and will replay more edits. 

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243714#comment-14243714
 ] 

stack commented on HBASE-10201:
---

Yes. I think it is going to be ok. I missed the 'skip edits using seqid of each 
relating store' bit. My calc was region based.  Thanks for entertaining my 
question.  In my scenario, the first column family that had edit #1 should have 
a store seqid of -1 which would mean we'd not skip edit #1 when it came into 
replayRecoveredEditsIfAny,

I'm wondering how to make a unit test.  One thought was to stand up a single 
HRegion of multiple column families and populate it in various ways, out of 
balance, and then add a means of 'killing' the region.  Then create a 
'recoved.edits' file and reopen the region to verify edits are as expected (and 
do same for DLR replay scenario)?





 

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread zhangduo (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243728#comment-14243728
 ] 

zhangduo commented on HBASE-10201:
--

{quote}
I'm wondering how to make a unit test.
{quote}
TestPerColumnFamilyFlush.testLogReplay has tested log replay for selective 
flush. I think it only misses the things that it does not kill HMaster when log 
replay. I can add a testcase to test the scenario that we can not get up to 
date lastSeqId from HMaster(kill master first, then kill regionserver, then 
restart master). [~stack], is this OK?

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-11 Thread stack (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14243758#comment-14243758
]

stack commented on HBASE-10201:
---

bq. TestPerColumnFamilyFlush.testLogReplay has tested log replay for selective
flush.

Woah. Thats a nice test. How long has that been around? I missed it in
previous reviews if it was present. I think this test is enough to give us
confidence in this radical change. The kill of master so we don't have latest
seqid is a nice to have but not necessary; we just over replay the edits.

Let me go over your last posted patch. Seems like a bunch of new stuff has
shown up (or I was blind last time I read through the patch).

Port 'Make flush decisions per column family' to trunk
--

Key: HBASE-10201
URL: https://issues.apache.org/jira/browse/HBASE-10201
Project: HBase
Issue Type: Improvement
Components: wal
Reporter: Ted Yu
Assignee: zhangduo
Fix For: 1.0.0, 2.0.0

Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch,
HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch,
HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch,
HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch,
HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch,
HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch,
HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch,
HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch,
HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png,
memstore.png

--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-10 Thread zhangduo (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14241121#comment-14241121
 ] 

zhangduo commented on HBASE-10201:
--

I ran the performance test in TestPerColumnFamilyFlush to confirm the patch is 
still work after I changed the behavior of FlushPolicy.

The result is same with previous test

metric_storeCount: 3,
metric_storeFileCount: 9,
metric_memStoreSize: 1272,
metric_storeFileSize: 4509402744,
metric_compactionsCompletedCount: 56,
metric_numBytesCompactedCount: 20654822724,
metric_numFilesCompactedCount: 184,

 Port 'Make flush decisions per column family' to trunk
 --

 Key: HBASE-10201
 URL: https://issues.apache.org/jira/browse/HBASE-10201
 Project: HBase
  Issue Type: Improvement
  Components: wal
Reporter: Ted Yu
Assignee: zhangduo
 Fix For: 1.0.0, 2.0.0

 Attachments: 3149-trunk-v1.txt, HBASE-10201-0.98.patch, 
 HBASE-10201-0.98_1.patch, HBASE-10201-0.98_2.patch, HBASE-10201-0.99.patch, 
 HBASE-10201.patch, HBASE-10201_1.patch, HBASE-10201_10.patch, 
 HBASE-10201_11.patch, HBASE-10201_12.patch, HBASE-10201_13.patch, 
 HBASE-10201_13.patch, HBASE-10201_14.patch, HBASE-10201_15.patch, 
 HBASE-10201_16.patch, HBASE-10201_17.patch, HBASE-10201_18.patch, 
 HBASE-10201_2.patch, HBASE-10201_3.patch, HBASE-10201_4.patch, 
 HBASE-10201_5.patch, HBASE-10201_6.patch, HBASE-10201_7.patch, 
 HBASE-10201_8.patch, HBASE-10201_9.patch, compactions.png, count.png, io.png, 
 memstore.png


 Currently the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (HBASE-10201) Port 'Make flush decisions per column family' to trunk

2014-12-10 Thread Hadoop QA (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14241695#comment-14241695
]

Hadoop QA commented on HBASE-10201:
---

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment
http://issues.apache.org/jira/secure/attachment/12686240/HBASE-10201_18.patch
against master branch at commit 84b41f8029fd5822832255daeee73ff2283a622a.
ATTACHMENT ID: 12686240