date:20131014


[ 
https://issues.apache.org/jira/browse/HBASE-9751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793943#comment-13793943
 ] 

Hudson commented on HBASE-9751:
---

SUCCESS: Integrated in HBase-TRUNK #4613 (See 
[https://builds.apache.org/job/HBase-TRUNK/4613/])
HBASE-9751 Excessive readpoints checks in StoreFileScanner (larsh: rev 1531790)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFileReaderV2.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileScanner.java


 Excessive  readpoints checks in StoreFileScanner
 

 Key: HBASE-9751
 URL: https://issues.apache.org/jira/browse/HBASE-9751
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.94.12, 0.96.0
Reporter: Vladimir Rodionov
Assignee: Lars Hofhansl
 Fix For: 0.98.0, 0.94.13, 0.96.1

 Attachments: 9751-0.94.txt, 9751-0.94-v2.txt, 9751-trunk.txt, 
 9751-trunk-v2.txt


 It seems that usage of skipKVsNewerThanReadpoint in StoreFileScanner can be 
 greatly reduced or even eliminated all together (HFiles are immutable and no 
 new KVs can be inserted after scanner instance is created). The same is true 
 for MemStoreScanner which checks readpoint on every next() and seek(). Each 
 readpoint check is ThreadLocal.get() and it is quite expensive.  



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9751) Excessive readpoints checks in StoreFileScanner


[ 
https://issues.apache.org/jira/browse/HBASE-9751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793948#comment-13793948
 ] 

Hudson commented on HBASE-9751:
---

SUCCESS: Integrated in hbase-0.96 #137 (See 
[https://builds.apache.org/job/hbase-0.96/137/])
HBASE-9751 Excessive readpoints checks in StoreFileScanner (larsh: rev 1531791)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFileReaderV2.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileScanner.java


 Excessive  readpoints checks in StoreFileScanner
 

 Key: HBASE-9751
 URL: https://issues.apache.org/jira/browse/HBASE-9751
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.94.12, 0.96.0
Reporter: Vladimir Rodionov
Assignee: Lars Hofhansl
 Fix For: 0.98.0, 0.94.13, 0.96.1

 Attachments: 9751-0.94.txt, 9751-0.94-v2.txt, 9751-trunk.txt, 
 9751-trunk-v2.txt


 It seems that usage of skipKVsNewerThanReadpoint in StoreFileScanner can be 
 greatly reduced or even eliminated all together (HFiles are immutable and no 
 new KVs can be inserted after scanner instance is created). The same is true 
 for MemStoreScanner which checks readpoint on every next() and seek(). Each 
 readpoint check is ThreadLocal.get() and it is quite expensive.  



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HBASE-3149) Make flush decisions per column family

2013-10-14 Thread Gaurav Menghani (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Gaurav Menghani reassigned HBASE-3149:
--

Assignee: Gaurav Menghani

Assigning this to myself, since I have committed this change in the 0.89-fb 
branch. Will upload the patch shortly.

 Make flush decisions per column family
 --

 Key: HBASE-3149
 URL: https://issues.apache.org/jira/browse/HBASE-3149
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Reporter: Karthik Ranganathan
Assignee: Gaurav Menghani
Priority: Critical

 Today, the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-3727) MultiHFileOutputFormat

2013-10-14 Thread Viral Bajaria (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13793980#comment-13793980
 ] 

Viral Bajaria commented on HBASE-3727:
--

+1 to what James says here. We also have a need for something similar. Is it 
possible to include this patch in next release ? Or are there any concerns ?

 MultiHFileOutputFormat
 --

 Key: HBASE-3727
 URL: https://issues.apache.org/jira/browse/HBASE-3727
 Project: HBase
  Issue Type: New Feature
Reporter: Andrew Purtell
Priority: Minor
 Attachments: MultiHFileOutputFormat.java, MultiHFileOutputFormat.java


 Like MultiTableOutputFormat, but outputting HFiles. Key is tablename as an 
 IBW. Creates sub-writers (code cut and pasted from HFileOutputFormat) on 
 demand that produce HFiles in per-table subdirectories of the configured 
 output path. Does not currently support partitioning for existing tables / 
 incremental update.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9605) Allow AggregationClient to skip specifying column family for row count aggregate

2013-10-14 Thread chendihao (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

chendihao updated HBASE-9605:
-

Attachment: 0605-0.94.patch

patch for 0.94

 Allow AggregationClient to skip specifying column family for row count 
 aggregate
 

 Key: HBASE-9605
 URL: https://issues.apache.org/jira/browse/HBASE-9605
 Project: HBase
  Issue Type: Improvement
Reporter: Ted Yu
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 0605-0.94.patch, 9605-v1.txt


 For rowcounter job, column family is not required as input parameter.
 AggregationClient requires the specification of one column family:
 {code}
 } else if (scan.getFamilyMap().size() != 1) {
   throw new IOException(There must be only one family.);
 }
 {code}
 We should relax the above requirement for row count aggregate where 
 FirstKeyOnlyFilter would be automatically applied.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9605) Allow AggregationClient to skip specifying column family for row count aggregate


[ 
https://issues.apache.org/jira/browse/HBASE-9605?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794016#comment-13794016
 ] 

Hadoop QA commented on HBASE-9605:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608258/0605-0.94.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7536//console

This message is automatically generated.

 Allow AggregationClient to skip specifying column family for row count 
 aggregate
 

 Key: HBASE-9605
 URL: https://issues.apache.org/jira/browse/HBASE-9605
 Project: HBase
  Issue Type: Improvement
Reporter: Ted Yu
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 0605-0.94.patch, 9605-v1.txt


 For rowcounter job, column family is not required as input parameter.
 AggregationClient requires the specification of one column family:
 {code}
 } else if (scan.getFamilyMap().size() != 1) {
   throw new IOException(There must be only one family.);
 }
 {code}
 We should relax the above requirement for row count aggregate where 
 FirstKeyOnlyFilter would be automatically applied.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9751) Excessive readpoints checks in StoreFileScanner


[ 
https://issues.apache.org/jira/browse/HBASE-9751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794078#comment-13794078
 ] 

Hudson commented on HBASE-9751:
---

SUCCESS: Integrated in hbase-0.96-hadoop2 #87 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/87/])
HBASE-9751 Excessive readpoints checks in StoreFileScanner (larsh: rev 1531791)
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFileReaderV2.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
* 
/hbase/branches/0.96/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileScanner.java


 Excessive  readpoints checks in StoreFileScanner
 

 Key: HBASE-9751
 URL: https://issues.apache.org/jira/browse/HBASE-9751
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.94.12, 0.96.0
Reporter: Vladimir Rodionov
Assignee: Lars Hofhansl
 Fix For: 0.98.0, 0.94.13, 0.96.1

 Attachments: 9751-0.94.txt, 9751-0.94-v2.txt, 9751-trunk.txt, 
 9751-trunk-v2.txt


 It seems that usage of skipKVsNewerThanReadpoint in StoreFileScanner can be 
 greatly reduced or even eliminated all together (HFiles are immutable and no 
 new KVs can be inserted after scanner instance is created). The same is true 
 for MemStoreScanner which checks readpoint on every next() and seek(). Each 
 readpoint check is ThreadLocal.get() and it is quite expensive.  



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9751) Excessive readpoints checks in StoreFileScanner


[ 
https://issues.apache.org/jira/browse/HBASE-9751?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794093#comment-13794093
 ] 

Hudson commented on HBASE-9751:
---

SUCCESS: Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #792 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/792/])
HBASE-9751 Excessive readpoints checks in StoreFileScanner (larsh: rev 1531790)
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFile.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/io/hfile/HFileReaderV2.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFile.java
* 
/hbase/trunk/hbase-server/src/main/java/org/apache/hadoop/hbase/regionserver/StoreFileScanner.java


 Excessive  readpoints checks in StoreFileScanner
 

 Key: HBASE-9751
 URL: https://issues.apache.org/jira/browse/HBASE-9751
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.94.12, 0.96.0
Reporter: Vladimir Rodionov
Assignee: Lars Hofhansl
 Fix For: 0.98.0, 0.94.13, 0.96.1

 Attachments: 9751-0.94.txt, 9751-0.94-v2.txt, 9751-trunk.txt, 
 9751-trunk-v2.txt


 It seems that usage of skipKVsNewerThanReadpoint in StoreFileScanner can be 
 greatly reduced or even eliminated all together (HFiles are immutable and no 
 new KVs can be inserted after scanner instance is created). The same is true 
 for MemStoreScanner which checks readpoint on every next() and seek(). Each 
 readpoint check is ThreadLocal.get() and it is quite expensive.  



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9747) PrefixFilter with OR condition gives wrong results

2013-10-14 Thread Deepa Remesh (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-9747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794131#comment-13794131
 ] 

Deepa Remesh commented on HBASE-9747:
-

The patch works for me too. Thanks.

 PrefixFilter with OR condition gives wrong results
 --

 Key: HBASE-9747
 URL: https://issues.apache.org/jira/browse/HBASE-9747
 Project: HBase
  Issue Type: Bug
  Components: Filters
Affects Versions: 0.95.2, 0.94.9
Reporter: Deepa Remesh
Assignee: Aditya Kishore
 Attachments: HBASE-9747_0.94.patch, HBASE-9747.patch


 PrefixFilter when used with a SingleColumnValueFilter with an OR condition 
 gives wrong results. In below example, each filter when evaluated separately 
 gives 1 row each. The OR condition with the two filters gives 3 rows instead 
 of 2. Repro below:
 create 't1', 'f1'
 put 't1','a1','f1:q2','111'
 put 't1','b1','f1:q1','112'
 put 't1','c1','f1:q1','113'
 hbase(main):020:0 scan 't1', {FILTER = PrefixFilter ('b') OR 
 SingleColumnValueFilter('f1', 'q1', =, 'binary:113')}
 ROWCOLUMN+CELL
  a1column=f1:q2, 
 timestamp=1381468905492, value=111
  b1column=f1:q1, 
 timestamp=1381468905518, value=112
  c1column=f1:q1, 
 timestamp=1381468905549, value=113
 3 row(s) in 0.1020 seconds
 hbase(main):021:0 scan 't1', {FILTER = PrefixFilter ('b')}
 ROWCOLUMN+CELL
  b1column=f1:q1, 
 timestamp=1381468905518, value=112
 1 row(s) in 0.0150 seconds
 hbase(main):002:0 scan 't1', {FILTER = SingleColumnValueFilter('f1', 'q1', 
 =, 'binary:113')}
 ROWCOLUMN+CELL
  c1column=f1:q1, 
 timestamp=1381469178679, value=113
 1 row(s) in 0.0140 seconds



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9137) Add Tag dictionary in WAL compression


[ 
https://issues.apache.org/jira/browse/HBASE-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794140#comment-13794140
 ] 

Ted Yu commented on HBASE-9137:
---

lgtm.

 Add Tag dictionary in WAL compression
 -

 Key: HBASE-9137
 URL: https://issues.apache.org/jira/browse/HBASE-9137
 Project: HBase
  Issue Type: Sub-task
Affects Versions: 0.98.0
Reporter: ramkrishna.s.vasudevan
Assignee: Anoop Sam John
 Fix For: 0.98.0

 Attachments: HBASE-9137.patch, HBASE-9137_V2.patch, 
 HBASE-9137_V3.patch


 We can add tag dictionary like we have one for rowdictionary, 
 familydictionary.  But this has to be done after stabilizing HBASE-7391.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks


[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794232#comment-13794232
 ] 

Jimmy Xiang commented on HBASE-5487:


Good. I think we are on the same page. 

bq.  just using it as a reliable storage.
We probably won't use ZK as a pure storage. Meta table + cache is a good 
alternative.

 Generic framework for Master-coordinated tasks
 --

 Key: HBASE-5487
 URL: https://issues.apache.org/jira/browse/HBASE-5487
 Project: HBase
  Issue Type: New Feature
  Components: master, regionserver, Zookeeper
Affects Versions: 0.94.0
Reporter: Mubarak Seyed
Priority: Critical
 Attachments: Region management in Master.pdf


 Need a framework to execute master-coordinated tasks in a fault-tolerant 
 manner. 
 Master-coordinated tasks such as online-scheme change and delete-range 
 (deleting region(s) based on start/end key) can make use of this framework.
 The advantages of framework are
 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
 master-coordinated tasks
 2. Ability to abstract the common functions across Master - ZK and RS - ZK
 3. Easy to plugin new master-coordinated tasks without adding code to core 
 components



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-3149) Make flush decisions per column family


[ 
https://issues.apache.org/jira/browse/HBASE-3149?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794254#comment-13794254
 ] 

stack commented on HBASE-3149:
--

Thanks [~gaurav.menghani]

 Make flush decisions per column family
 --

 Key: HBASE-3149
 URL: https://issues.apache.org/jira/browse/HBASE-3149
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Reporter: Karthik Ranganathan
Assignee: Gaurav Menghani
Priority: Critical

 Today, the flush decision is made using the aggregate size of all column 
 families. When large and small column families co-exist, this causes many 
 small flushes of the smaller CF. We need to make per-CF flush decisions.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HBASE-9757) Reenable fast region move in SlowDeterministicMonkey

Jimmy Xiang created HBASE-9757:
--

 Summary: Reenable fast region move in SlowDeterministicMonkey
 Key: HBASE-9757
 URL: https://issues.apache.org/jira/browse/HBASE-9757
 Project: HBase
  Issue Type: Task
  Components: test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang


HBASE-9338 slows down the region move CM a little so that ITBLL is green for 
0.96.0 RC. We should revert the change and make sure the test is still green.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9757) Reenable fast region move in SlowDeterministicMonkey


[ 
https://issues.apache.org/jira/browse/HBASE-9757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794259#comment-13794259
 ] 

Jimmy Xiang commented on HBASE-9757:


With 0.96 tip, after fast region move re-enabled, ITBLL is green for me.  
However, there seems to be some issue with trunk.

 Reenable fast region move in SlowDeterministicMonkey
 

 Key: HBASE-9757
 URL: https://issues.apache.org/jira/browse/HBASE-9757
 Project: HBase
  Issue Type: Task
  Components: test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang

 HBASE-9338 slows down the region move CM a little so that ITBLL is green for 
 0.96.0 RC. We should revert the change and make sure the test is still green.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Created] (HBASE-9758) Log missing rows and their regions if ITBLL fails

Jimmy Xiang created HBASE-9758:
--

 Summary: Log missing rows and their regions if ITBLL fails
 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor


Currently, when ITBLL fails, the log only shows how many rows are missing. We 
have to go to the MR log to find the rows. The row key is some binary string 
that is hard to map to a region.  It will be helpful to log the missing rows in 
the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9758) Log missing rows and their regions if ITBLL fails


 [ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9758:
---

Attachment: trunk-9758.patch

 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9757) Reenable fast region move in SlowDeterministicMonkey


[ 
https://issues.apache.org/jira/browse/HBASE-9757?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794264#comment-13794264
 ] 

stack commented on HBASE-9757:
--

I'd say lets figure the issues we have first before we up the ante.  After we 
figure whats up and then a  couple of greens, lets up the pressure.  We should 
reenable your fast move as you suggest here.  Should we then try reenabling 
online-config tests as you may have fixed what was up w/ it?  Then we should 
move to kills that do in the RS+DN, not just RS (As Elliott suggested Friday).  
That'll be a new set of issues for sure.  Good on you Jimmy.

 Reenable fast region move in SlowDeterministicMonkey
 

 Key: HBASE-9757
 URL: https://issues.apache.org/jira/browse/HBASE-9757
 Project: HBase
  Issue Type: Task
  Components: test
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang

 HBASE-9338 slows down the region move CM a little so that ITBLL is green for 
 0.96.0 RC. We should revert the change and make sure the test is still green.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9758) Log missing rows and their regions if ITBLL fails


[ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794269#comment-13794269
 ] 

stack commented on HBASE-9758:
--

+1 on patch for trunk and 0.96.  I suppose the region could have changed since 
the fail but this is way better than what we currently have.

 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9758) Log missing rows and their regions if ITBLL fails


 [ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9758:
---

Status: Patch Available  (was: Open)

This patch logs at most 50 missing rows and the corresponding regions that hold 
these rows.

 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-7379) Port '[89-fb] prevent OOM possibility due to per connection responseQueue being unbounded' to trunk


[ 
https://issues.apache.org/jira/browse/HBASE-7379?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794279#comment-13794279
 ] 

Ted Yu commented on HBASE-7379:
---

Looks like this is no longer needed.

 Port '[89-fb] prevent OOM possibility due to per connection responseQueue 
 being unbounded' to trunk
 ---

 Key: HBASE-7379
 URL: https://issues.apache.org/jira/browse/HBASE-7379
 Project: HBase
  Issue Type: Sub-task
Reporter: Ted Yu
Priority: Minor
 Attachments: 7379-trunk.txt, 7379-trunk-v2.txt


 HBASE-6728 ported '[89-fb] prevent OOM possibility due to per connection 
 responseQueue being unbounded' to 0.94 branch
 This issue is to port it to trunk



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9420) Math.max() on syncedTillHere lacks synchronization


 [ 
https://issues.apache.org/jira/browse/HBASE-9420?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9420:
--

Status: Open  (was: Patch Available)

 Math.max() on syncedTillHere lacks synchronization
 --

 Key: HBASE-9420
 URL: https://issues.apache.org/jira/browse/HBASE-9420
 Project: HBase
  Issue Type: Bug
Reporter: Ted Yu
Assignee: Ted Yu
Priority: Trivial
 Fix For: 0.98.0

 Attachments: 9420-v1.txt, 9420-v2.txt


 In FSHlog#syncer(), around line 1080:
 {code}
   this.syncedTillHere = Math.max(this.syncedTillHere, doneUpto);
 {code}
 Assignment to syncedTillHere after computing max value is not protected by 
 proper synchronization.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9755) Cannot run classes in hbase-server tests jar from command line


[ 
https://issues.apache.org/jira/browse/HBASE-9755?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794291#comment-13794291
 ] 

Nick Dimiduk commented on HBASE-9755:
-

Works for me too.

+1

 Cannot run classes in hbase-server tests jar from command line 
 ---

 Key: HBASE-9755
 URL: https://issues.apache.org/jira/browse/HBASE-9755
 Project: HBase
  Issue Type: Bug
Reporter: Enis Soztutar
Assignee: Enis Soztutar
 Fix For: 0.98.0, 0.96.1

 Attachments: hbase-9755_v1.patch


 cached_classpath.txt no longer contains references to 
 hbase-server-version-tests.jar. 
 This prevents to run classes under hbase-server/src/test from bin/hbase 
 script. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-14 Thread Sergey Shelukhin (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794304#comment-13794304
 ] 

Sergey Shelukhin commented on HBASE-5487:
-

[~jxiang] by janitor, I mean not timeout monitor, but something picking up 
timeouts of non-master ops like open.
It's a rare case and probably never happens in int tests, but there can be a 
case where RS is taking too long to open.

 Generic framework for Master-coordinated tasks
 --

 Key: HBASE-5487
 URL: https://issues.apache.org/jira/browse/HBASE-5487
 Project: HBase
  Issue Type: New Feature
  Components: master, regionserver, Zookeeper
Affects Versions: 0.94.0
Reporter: Mubarak Seyed
Priority: Critical
 Attachments: Region management in Master.pdf


 Need a framework to execute master-coordinated tasks in a fault-tolerant 
 manner. 
 Master-coordinated tasks such as online-scheme change and delete-range 
 (deleting region(s) based on start/end key) can make use of this framework.
 The advantages of framework are
 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
 master-coordinated tasks
 2. Ability to abstract the common functions across Master - ZK and RS - ZK
 3. Easy to plugin new master-coordinated tasks without adding code to core 
 components



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


 [ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9692:


Component/s: (was: test)

 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


 [ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9692:


Issue Type: Test  (was: Bug)

 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


 [ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9692:


Component/s: test

 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


 [ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9692:


Fix Version/s: 0.96.1
   0.98.0

 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


 [ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9692:


  Resolution: Fixed
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


[ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794307#comment-13794307
 ] 

Nick Dimiduk commented on HBASE-9692:
-

Committed to 0.96 and trunk. Thanks for the patch JMS.

 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks


[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794318#comment-13794318
 ] 

Jimmy Xiang commented on HBASE-5487:


I see.

 Generic framework for Master-coordinated tasks
 --

 Key: HBASE-5487
 URL: https://issues.apache.org/jira/browse/HBASE-5487
 Project: HBase
  Issue Type: New Feature
  Components: master, regionserver, Zookeeper
Affects Versions: 0.94.0
Reporter: Mubarak Seyed
Priority: Critical
 Attachments: Region management in Master.pdf


 Need a framework to execute master-coordinated tasks in a fault-tolerant 
 manner. 
 Master-coordinated tasks such as online-scheme change and delete-range 
 (deleting region(s) based on start/end key) can make use of this framework.
 The advantages of framework are
 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
 master-coordinated tasks
 2. Ability to abstract the common functions across Master - ZK and RS - ZK
 3. Easy to plugin new master-coordinated tasks without adding code to core 
 components



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794324#comment-13794324
 ] 

Nick Dimiduk commented on HBASE-9570:
-

Forgive my JRuby ignorance but this patch looks like it's unwrapping 
TableNotFound, NoSuchColumnFamily, and TableExists exceptions an extra time. Is 
that intentional?

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Priority: Minor
 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-7379) Port '[89-fb] prevent OOM possibility due to per connection responseQueue being unbounded' to trunk


 [ 
https://issues.apache.org/jira/browse/HBASE-7379?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-7379:
-

Resolution: Won't Fix
Status: Resolved  (was: Patch Available)

Resolving as 'wont fix' per Ted comment above that it is no longer needed.

 Port '[89-fb] prevent OOM possibility due to per connection responseQueue 
 being unbounded' to trunk
 ---

 Key: HBASE-7379
 URL: https://issues.apache.org/jira/browse/HBASE-7379
 Project: HBase
  Issue Type: Sub-task
Reporter: Ted Yu
Priority: Minor
 Attachments: 7379-trunk.txt, 7379-trunk-v2.txt


 HBASE-6728 ported '[89-fb] prevent OOM possibility due to per connection 
 responseQueue being unbounded' to 0.94 branch
 This issue is to port it to trunk



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794330#comment-13794330
 ] 

Nick Dimiduk commented on HBASE-9570:
-

I take it back, this appears to work as previously.

{noformat}
hbase(main):001:0 create 'foo', 'f'
0 row(s) in 6.5330 seconds

= Hbase::Table - foo
hbase(main):002:0 create 'foo', 'f'

ERROR: Table already exists: foo!
...
{noformat}

Will commit later today unless someone objects.

[~stack] do you want this in 0.96?

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Priority: Minor
 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9747) PrefixFilter with OR condition gives wrong results

2013-10-14 Thread Aditya Kishore (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-9747?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794334#comment-13794334
 ] 

Aditya Kishore commented on HBASE-9747:
---

Thanks Julian and Deepa for verifying the patch.

 PrefixFilter with OR condition gives wrong results
 --

 Key: HBASE-9747
 URL: https://issues.apache.org/jira/browse/HBASE-9747
 Project: HBase
  Issue Type: Bug
  Components: Filters
Affects Versions: 0.95.2, 0.94.9
Reporter: Deepa Remesh
Assignee: Aditya Kishore
 Attachments: HBASE-9747_0.94.patch, HBASE-9747.patch


 PrefixFilter when used with a SingleColumnValueFilter with an OR condition 
 gives wrong results. In below example, each filter when evaluated separately 
 gives 1 row each. The OR condition with the two filters gives 3 rows instead 
 of 2. Repro below:
 create 't1', 'f1'
 put 't1','a1','f1:q2','111'
 put 't1','b1','f1:q1','112'
 put 't1','c1','f1:q1','113'
 hbase(main):020:0 scan 't1', {FILTER = PrefixFilter ('b') OR 
 SingleColumnValueFilter('f1', 'q1', =, 'binary:113')}
 ROWCOLUMN+CELL
  a1column=f1:q2, 
 timestamp=1381468905492, value=111
  b1column=f1:q1, 
 timestamp=1381468905518, value=112
  c1column=f1:q1, 
 timestamp=1381468905549, value=113
 3 row(s) in 0.1020 seconds
 hbase(main):021:0 scan 't1', {FILTER = PrefixFilter ('b')}
 ROWCOLUMN+CELL
  b1column=f1:q1, 
 timestamp=1381468905518, value=112
 1 row(s) in 0.0150 seconds
 hbase(main):002:0 scan 't1', {FILTER = SingleColumnValueFilter('f1', 'q1', 
 =, 'binary:113')}
 ROWCOLUMN+CELL
  c1column=f1:q1, 
 timestamp=1381469178679, value=113
 1 row(s) in 0.0140 seconds



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


 [ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9570:


Fix Version/s: 0.98.0

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794341#comment-13794341
 ] 

stack commented on HBASE-9570:
--

[~ndimiduk] please.  usability.

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9758) Log missing rows and their regions if ITBLL fails


[ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794354#comment-13794354
 ] 

Hadoop QA commented on HBASE-9758:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608301/trunk-9758.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 3 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7537//console

This message is automatically generated.

 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9562) Make HLogPE run against configured FS

2013-10-14 Thread Matteo Bertozzi (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matteo Bertozzi updated HBASE-9562:
---

Attachment: HBASE-9562-addendum.patch

I'm late on this one, but we've a method FSUtils.setFsDefault() that set both 
(fs.default.name/fs.defaultFS) properties... also to get the root dir we should 
use FSUtils.getRootDir() that returns the qualified path needed by the 
FileSystem class.

(attached a patch that uses the methods in FSUtils)

 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9753) Excessive readpoints checks in MemstoreScanner


 [ 
https://issues.apache.org/jira/browse/HBASE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9753:
--

Attachment: 9753-v1.txt

Something like this ?

 Excessive readpoints checks in MemstoreScanner
 --

 Key: HBASE-9753
 URL: https://issues.apache.org/jira/browse/HBASE-9753
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Vladimir Rodionov
 Attachments: 9753-v1.txt


 Brought up by [~vrodionov] on the mailing list. See also HBASE-9751.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


 [ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9570:


Assignee: Yang Wang

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Assignee: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9562) Make HLogPE run against configured FS


[ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794396#comment-13794396
 ] 

stack commented on HBASE-9562:
--

+1 for 0.96 and trunk.

 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9562) Make HLogPE run against configured FS


[ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794395#comment-13794395
 ] 

stack commented on HBASE-9562:
--

This looks like what I tested already.  +1

Thanks for fixing what I broke [~mbertozzi]

 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


[ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794413#comment-13794413
 ] 

Hudson commented on HBASE-9692:
---

SUCCESS: Integrated in HBase-TRUNK #4614 (See 
[https://builds.apache.org/job/HBase-TRUNK/4614/])
HBASE-9692 TestOrderedBytes doesn't display the arrays correctly (ndimiduk: rev 
1531992)
* 
/hbase/trunk/hbase-common/src/test/java/org/apache/hadoop/hbase/util/TestOrderedBytes.java


 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9758) Log missing rows and their regions if ITBLL fails


 [ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9758:
---

   Resolution: Fixed
Fix Version/s: 0.96.1
   0.98.0
 Hadoop Flags: Reviewed
   Status: Resolved  (was: Patch Available)

Integrated into trunk and 0.96.  Thanks Stack for the review.

 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


[ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794493#comment-13794493
 ] 

Hudson commented on HBASE-9692:
---

SUCCESS: Integrated in hbase-0.96 #138 (See 
[https://builds.apache.org/job/hbase-0.96/138/])
HBASE-9692 TestOrderedBytes doesn't display the arrays correctly (ndimiduk: rev 
1531997)
* 
/hbase/branches/0.96/hbase-common/src/test/java/org/apache/hadoop/hbase/util/TestOrderedBytes.java


 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9748) Address outstanding comments raised for HBASE-9696


 [ 
https://issues.apache.org/jira/browse/HBASE-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9748:
---

Status: Patch Available  (was: Open)

 Address outstanding comments raised for HBASE-9696
 --

 Key: HBASE-9748
 URL: https://issues.apache.org/jira/browse/HBASE-9748
 Project: HBase
  Issue Type: Bug
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9748.patch


 This is a follow-up issue of HBASE-9696.
 There are some outstanding review comments in 
 https://reviews.apache.org/r/14470/ that haven't been addressed. I will 
 address them later in this jira.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9748) Address outstanding comments raised for HBASE-9696


 [ 
https://issues.apache.org/jira/browse/HBASE-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9748:
---

Attachment: trunk-9748.patch

Attached patch addressed outstanding comments raised in reviewing patch for 
HBASE-9696.  The changes are 1, fixed some variable name/method name to avoid 
confusing such as regionName - encodedName; 2, added/fixed some comments; 3, 
changed ZKAssign#deleteNode to take additional parameter (target servername) to 
be certain we are deleting the right znode.

 Address outstanding comments raised for HBASE-9696
 --

 Key: HBASE-9748
 URL: https://issues.apache.org/jira/browse/HBASE-9748
 Project: HBase
  Issue Type: Bug
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9748.patch


 This is a follow-up issue of HBASE-9696.
 There are some outstanding review comments in 
 https://reviews.apache.org/r/14470/ that haven't been addressed. I will 
 address them later in this jira.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-7544) Transparent table/CF encryption


 [ 
https://issues.apache.org/jira/browse/HBASE-7544?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-7544:
--

Attachment: 7544p4.patch
7544p3.patch
7544p2.patch
7544p1.patch

Rebase against latest trunk.

Patch '7544p4.patch' adds an encrypting protobuf WAL, currently missing support 
for dictionary compression but will add that after more testing. Another easy 
planned addition here is selective encryption of only the WALEdits for 
encrypted families.

Added some unit tests, notably one that confirms if hbck is run on the secure 
enclave with access to key material (implicitly using the same configuration as 
for regionservers) it can handle encrypted HFiles.

Also note that an ASL licensed open source accelerated JCE codec for AES in CTR 
mode is available at https://github.com/intel-hadoop/project-diceros . Will be 
used if installed and hbase.crypto.algorithm.aes.provider=DC. This is not 
required for HBASE-7544 but will substantially reduce the latency and CPU cost 
introduced by encryption compared to the default AES codec that ships with the 
Oracle/OpenJDK JRE. 

 Transparent table/CF encryption
 ---

 Key: HBASE-7544
 URL: https://issues.apache.org/jira/browse/HBASE-7544
 Project: HBase
  Issue Type: New Feature
  Components: HFile, io
Reporter: Andrew Purtell
Assignee: Andrew Purtell
 Fix For: 0.98.0

 Attachments: 7544p1.patch, 7544p1.patch, 7544p2.patch, 7544p2.patch, 
 7544p3.patch, 7544p3.patch, 7544p4.patch, historical-7544.patch, 
 historical-7544.pdf, historical-shell.patch


 Introduce transparent encryption of HBase on disk data.
 Depends on a separate contribution of an encryption codec framework to Hadoop 
 core and an AES-NI (native code) codec. This is work done in the context of 
 MAPREDUCE-4491 but I'd gather there will be additional JIRAs for common and 
 HDFS parts of it.
 Requirements:
 - Transparent encryption at the CF or table level
 - Protect against all data leakage from files at rest
 - Two-tier key architecture for consistency with best practices for this 
 feature in the RDBMS world
 - Built-in key management
 - Flexible and non-intrusive key rotation
 - Mechanisms not exposed to or modifiable by users
 - Hardware security module integration (via Java KeyStore)
 - HBCK support for transparently encrypted files (+ plugin architecture for 
 HBCK)
 Additional goals:
 - Shell support for administrative functions
 - Avoid performance impact for the null crypto codec case
 - Play nicely with other changes underway: in HFile, block coding, etc.
 We're aiming for rough parity with Oracle's transparent tablespace encryption 
 feature, described in 
 http://www.oracle.com/technetwork/database/owp-security-advanced-security-11gr-133411.pdf
  as
 {quote}
 “Transparent Data Encryption uses a 2-tier key architecture for flexible and 
 non-intrusive key rotation and least operational and performance impact: Each 
 application table with at least one encrypted column has its own table key, 
 which is applied to all encrypted columns in that table. Equally, each 
 encrypted tablespace has its own tablespace key. Table keys are stored in the 
 data dictionary of the database, while tablespace keys are stored in the 
 header of the tablespace and additionally, the header of each underlying OS 
 file that makes up the tablespace.  Each of these keys is encrypted with the 
 TDE master encryption key, which is stored outside of the database in an 
 external security module: either the Oracle Wallet (a PKCS#12 formatted file 
 that is encrypted using a passphrase supplied either by the designated 
 security administrator or DBA during setup),  or a Hardware Security Module 
 (HSM) device for higher assurance […]”
 {quote}
 Further design details forthcoming in a design document and patch as soon as 
 we have all of the clearances in place.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-3727) MultiHFileOutputFormat


[ 
https://issues.apache.org/jira/browse/HBASE-3727?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794535#comment-13794535
 ] 

Andrew Purtell commented on HBASE-3727:
---

The patch on this issue is about two years stale. This should be redone if 
needed.

 MultiHFileOutputFormat
 --

 Key: HBASE-3727
 URL: https://issues.apache.org/jira/browse/HBASE-3727
 Project: HBase
  Issue Type: New Feature
Reporter: Andrew Purtell
Priority: Minor
 Attachments: MultiHFileOutputFormat.java, MultiHFileOutputFormat.java


 Like MultiTableOutputFormat, but outputting HFiles. Key is tablename as an 
 IBW. Creates sub-writers (code cut and pasted from HFileOutputFormat) on 
 demand that produce HFiles in per-table subdirectories of the configured 
 output path. Does not currently support partitioning for existing tables / 
 incremental update.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794544#comment-13794544
 ] 

Ted Yu commented on HBASE-9754:
---

w.r.t. the StoreScanner constructors, there're two which don't have Store 
argument. These two constructors are for testing.
The other constructors accept Store argument which has reference to HRegion.

Looks like the approach described above can be simplified.

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl

 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9758) Log missing rows and their regions if ITBLL fails


[ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794546#comment-13794546
 ] 

Hudson commented on HBASE-9758:
---

SUCCESS: Integrated in HBase-TRUNK #4615 (See 
[https://builds.apache.org/job/HBase-TRUNK/4615/])
HBASE-9758 Log missing rows and their regions if ITBLL fails (jxiang: rev 
1532048)
* 
/hbase/trunk/hbase-it/src/test/java/org/apache/hadoop/hbase/test/IntegrationTestBigLinkedList.java


 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9689) Support using OperationAttributes through shell script


[ 
https://issues.apache.org/jira/browse/HBASE-9689?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794587#comment-13794587
 ] 

Andrew Purtell commented on HBASE-9689:
---

+1 patch looks ok, would be good to have coverage in TestShell for this

 Support using OperationAttributes through shell script
 --

 Key: HBASE-9689
 URL: https://issues.apache.org/jira/browse/HBASE-9689
 Project: HBase
  Issue Type: Improvement
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 0.98.0

 Attachments: HBASE-9689_1.patch, HBASE-9689.patch


 Currently the ruby scripts for Put does not support setting of Operation 
 Attributes through shell.  
 It may be useful in some cases and also for testing.  And that would give a 
 full  fledged support using shell.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9748) Address outstanding comments raised for HBASE-9696


[ 
https://issues.apache.org/jira/browse/HBASE-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794607#comment-13794607
 ] 

Hadoop QA commented on HBASE-9748:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608354/trunk-9748.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 15 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:red}-1 javadoc{color}.  The javadoc tool appears to have generated 1 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   org.apache.hadoop.hbase.regionserver.wal.TestLogRolling

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7538//console

This message is automatically generated.

 Address outstanding comments raised for HBASE-9696
 --

 Key: HBASE-9748
 URL: https://issues.apache.org/jira/browse/HBASE-9748
 Project: HBase
  Issue Type: Bug
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9748.patch


 This is a follow-up issue of HBASE-9696.
 There are some outstanding review comments in 
 https://reviews.apache.org/r/14470/ that haven't been addressed. I will 
 address them later in this jira.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9057) Support Tags in ImportTSV tool


[ 
https://issues.apache.org/jira/browse/HBASE-9057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794608#comment-13794608
 ] 

Andrew Purtell commented on HBASE-9057:
---

+1

{quote}
+  final static String DEFAULT_TAG_SEPERATOR = \r;
{quote}

I can't think of a better way to do that.

 Support Tags in ImportTSV tool
 --

 Key: HBASE-9057
 URL: https://issues.apache.org/jira/browse/HBASE-9057
 Project: HBase
  Issue Type: Sub-task
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Attachments: HBASE-9057.patch


 Once we have provision for Tags in the core, the ImportTSV also should 
 support supplying Tags with the KVs that are to be loaded.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Comment Edited] (HBASE-9057) Support Tags in ImportTSV tool


[ 
https://issues.apache.org/jira/browse/HBASE-9057?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794608#comment-13794608
 ] 

Andrew Purtell edited comment on HBASE-9057 at 10/14/13 11:05 PM:
--

+1 assuming this does not start the unit test flapping. You still seeing that 
exception Ram?

{quote}
+  final static String DEFAULT_TAG_SEPERATOR = \r;
{quote}

I can't think of a better way to do that.


was (Author: apurtell):
+1

{quote}
+  final static String DEFAULT_TAG_SEPERATOR = \r;
{quote}

I can't think of a better way to do that.

 Support Tags in ImportTSV tool
 --

 Key: HBASE-9057
 URL: https://issues.apache.org/jira/browse/HBASE-9057
 Project: HBase
  Issue Type: Sub-task
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Attachments: HBASE-9057.patch


 Once we have provision for Tags in the core, the ImportTSV also should 
 support supplying Tags with the KVs that are to be loaded.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9748) Address outstanding comments raised for HBASE-9696


 [ 
https://issues.apache.org/jira/browse/HBASE-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9748:
---

Status: Open  (was: Patch Available)

 Address outstanding comments raised for HBASE-9696
 --

 Key: HBASE-9748
 URL: https://issues.apache.org/jira/browse/HBASE-9748
 Project: HBase
  Issue Type: Bug
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9748.patch


 This is a follow-up issue of HBASE-9696.
 There are some outstanding review comments in 
 https://reviews.apache.org/r/14470/ that haven't been addressed. I will 
 address them later in this jira.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9748) Address outstanding comments raised for HBASE-9696


 [ 
https://issues.apache.org/jira/browse/HBASE-9748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jimmy Xiang updated HBASE-9748:
---

Status: Patch Available  (was: Open)

This test failure seems unrelated, try hadoop-qa again.

 Address outstanding comments raised for HBASE-9696
 --

 Key: HBASE-9748
 URL: https://issues.apache.org/jira/browse/HBASE-9748
 Project: HBase
  Issue Type: Bug
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9748.patch


 This is a follow-up issue of HBASE-9696.
 There are some outstanding review comments in 
 https://reviews.apache.org/r/14470/ that haven't been addressed. I will 
 address them later in this jira.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Comment Edited] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794544#comment-13794544
 ] 

Ted Yu edited comment on HBASE-9754 at 10/14/13 11:55 PM:
--

w.r.t. the StoreScanner constructors, there're two which don't have Store 
argument. These two constructors are for testing.
The other constructors accept Store argument which has reference to HRegion.



was (Author: yuzhih...@gmail.com):
w.r.t. the StoreScanner constructors, there're two which don't have Store 
argument. These two constructors are for testing.
The other constructors accept Store argument which has reference to HRegion.

Looks like the approach described above can be simplified.

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl

 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Assigned] (HBASE-9754) Consider eliminating threadlocals from MVCC code


 [ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu reassigned HBASE-9754:
-

Assignee: Ted Yu

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu

 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-8741) Scope sequenceid to the region rather than regionserver (WAS: Mutations on Regions in recovery mode might have same sequenceIDs)

[
https://issues.apache.org/jira/browse/HBASE-8741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794688#comment-13794688
]

Himanshu Vashishtha commented on HBASE-8741:

Thanks for all the reviews, and sorry about the long hiatus on it. I resumed
the work on it and attaching the patch for the same. It take care of the above
feedbacks.
To me, it is an important step in easing out our sequence Id story (disentangle
it from region server HLog).

In a nut shell, this is the high level idea of this patch:

1) Add a sequence Id attribute to the HRegion class, It is set to
openSequenceId when the region is opened. All append operation would use this
sequenceId at time of appending to WAL.

2) FSHLog:
i) maintain a map of (region : sequenceId) for each WAL. (latestSequenceNums).
This keeps track of latest sequence numbers used by a region when appending to
the wal.
ii) maintain a map of (Path : latestSequenceNums) for each rolled wal file.
This is used when determining whether a wal is eligible for archiving or not.

A wal is eligible for archiving when all its region has flushed - past the
point of their entry in the latestSequenceNums map (as mentioned in i)

When rolling a wal, it checks which of the older wals are eligible for
archiving.

iii) When we run over the maximum number of allowed Wal files, we check the
oldest wal file to determine which regions should be flushed so that it could
be eligible for archiving.

3) Added test cases in TestHLog to test rolling, archiving and finding
memstores to flush to archive a wal.
Refactored some test cases to add sequenceId parameter while appending to HLog.

4) Remove the sequenceId support from HLog (sequenceId and helper methods).

I tested it on jenkins; ran a patched version on a cluster with insert load
while killing region servers and ensuring data is intact. IT tests in progress.
Thanks.

Scope sequenceid to the region rather than regionserver (WAS: Mutations on
Regions in recovery mode might have same sequenceIDs)

Key: HBASE-8741
URL: https://issues.apache.org/jira/browse/HBASE-8741
Project: HBase
Issue Type: Bug
Components: MTTR
Affects Versions: 0.95.1
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha
Fix For: 0.98.0

Attachments: HBASE-8741-v0.patch, HBASE-8741-v2.patch,
HBASE-8741-v3.patch, HBASE-8741-v4-again.patch, HBASE-8741-v4-again.patch,
HBASE-8741-v4.patch, HBASE-8741-v5-again.patch, HBASE-8741-v5.patch

Currently, when opening a region, we find the maximum sequence ID from all
its HFiles and then set the LogSequenceId of the log (in case the later is at
a small value). This works good in recovered.edits case as we are not writing
to the region until we have replayed all of its previous edits.
With distributed log replay, if we want to enable writes while a region is
under recovery, we need to make sure that the logSequenceId maximum
logSequenceId of the old regionserver. Otherwise, we might have a situation
where new edits have same (or smaller) sequenceIds.
We can store region level information in the WALTrailer, than this scenario
could be avoided by:
a) reading the trailer of the last completed file, i.e., last wal file
which has a trailer and,
b) completely reading the last wal file (this file would not have the
trailer, so it needs to be read completely).
In future, if we switch to multi wal file, we could read the trailer for all
completed WAL files, and reading the remaining incomplete files.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-8741) Scope sequenceid to the region rather than regionserver (WAS: Mutations on Regions in recovery mode might have same sequenceIDs)


 [ 
https://issues.apache.org/jira/browse/HBASE-8741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Himanshu Vashishtha updated HBASE-8741:
---

Status: Open  (was: Patch Available)

 Scope sequenceid to the region rather than regionserver (WAS: Mutations on 
 Regions in recovery mode might have same sequenceIDs)
 

 Key: HBASE-8741
 URL: https://issues.apache.org/jira/browse/HBASE-8741
 Project: HBase
  Issue Type: Bug
  Components: MTTR
Affects Versions: 0.95.1
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha
 Fix For: 0.98.0

 Attachments: HBASE-8741-v0.patch, HBASE-8741-v2.patch, 
 HBASE-8741-v3.patch, HBASE-8741-v4-again.patch, HBASE-8741-v4-again.patch, 
 HBASE-8741-v4.patch, HBASE-8741-v5-again.patch, HBASE-8741-v5.patch


 Currently, when opening a region, we find the maximum sequence ID from all 
 its HFiles and then set the LogSequenceId of the log (in case the later is at 
 a small value). This works good in recovered.edits case as we are not writing 
 to the region until we have replayed all of its previous edits. 
 With distributed log replay, if we want to enable writes while a region is 
 under recovery, we need to make sure that the logSequenceId  maximum 
 logSequenceId of the old regionserver. Otherwise, we might have a situation 
 where new edits have same (or smaller) sequenceIds. 
 We can store region level information in the WALTrailer, than this scenario 
 could be avoided by:
 a) reading the trailer of the last completed file, i.e., last wal file 
 which has a trailer and,
 b) completely reading the last wal file (this file would not have the 
 trailer, so it needs to be read completely).
 In future, if we switch to multi wal file, we could read the trailer for all 
 completed WAL files, and reading the remaining incomplete files.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-8741) Scope sequenceid to the region rather than regionserver (WAS: Mutations on Regions in recovery mode might have same sequenceIDs)


 [ 
https://issues.apache.org/jira/browse/HBASE-8741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Himanshu Vashishtha updated HBASE-8741:
---

Status: Patch Available  (was: Open)

 Scope sequenceid to the region rather than regionserver (WAS: Mutations on 
 Regions in recovery mode might have same sequenceIDs)
 

 Key: HBASE-8741
 URL: https://issues.apache.org/jira/browse/HBASE-8741
 Project: HBase
  Issue Type: Bug
  Components: MTTR
Affects Versions: 0.95.1
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha
 Fix For: 0.98.0

 Attachments: HBASE-8741-trunk-v6.patch, HBASE-8741-v0.patch, 
 HBASE-8741-v2.patch, HBASE-8741-v3.patch, HBASE-8741-v4-again.patch, 
 HBASE-8741-v4-again.patch, HBASE-8741-v4.patch, HBASE-8741-v5-again.patch, 
 HBASE-8741-v5.patch


 Currently, when opening a region, we find the maximum sequence ID from all 
 its HFiles and then set the LogSequenceId of the log (in case the later is at 
 a small value). This works good in recovered.edits case as we are not writing 
 to the region until we have replayed all of its previous edits. 
 With distributed log replay, if we want to enable writes while a region is 
 under recovery, we need to make sure that the logSequenceId  maximum 
 logSequenceId of the old regionserver. Otherwise, we might have a situation 
 where new edits have same (or smaller) sequenceIds. 
 We can store region level information in the WALTrailer, than this scenario 
 could be avoided by:
 a) reading the trailer of the last completed file, i.e., last wal file 
 which has a trailer and,
 b) completely reading the last wal file (this file would not have the 
 trailer, so it needs to be read completely).
 In future, if we switch to multi wal file, we could read the trailer for all 
 completed WAL files, and reading the remaining incomplete files.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-8741) Scope sequenceid to the region rather than regionserver (WAS: Mutations on Regions in recovery mode might have same sequenceIDs)


 [ 
https://issues.apache.org/jira/browse/HBASE-8741?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Himanshu Vashishtha updated HBASE-8741:
---

Attachment: HBASE-8741-trunk-v6.patch

 Scope sequenceid to the region rather than regionserver (WAS: Mutations on 
 Regions in recovery mode might have same sequenceIDs)
 

 Key: HBASE-8741
 URL: https://issues.apache.org/jira/browse/HBASE-8741
 Project: HBase
  Issue Type: Bug
  Components: MTTR
Affects Versions: 0.95.1
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha
 Fix For: 0.98.0

 Attachments: HBASE-8741-trunk-v6.patch, HBASE-8741-v0.patch, 
 HBASE-8741-v2.patch, HBASE-8741-v3.patch, HBASE-8741-v4-again.patch, 
 HBASE-8741-v4-again.patch, HBASE-8741-v4.patch, HBASE-8741-v5-again.patch, 
 HBASE-8741-v5.patch


 Currently, when opening a region, we find the maximum sequence ID from all 
 its HFiles and then set the LogSequenceId of the log (in case the later is at 
 a small value). This works good in recovered.edits case as we are not writing 
 to the region until we have replayed all of its previous edits. 
 With distributed log replay, if we want to enable writes while a region is 
 under recovery, we need to make sure that the logSequenceId  maximum 
 logSequenceId of the old regionserver. Otherwise, we might have a situation 
 where new edits have same (or smaller) sequenceIds. 
 We can store region level information in the WALTrailer, than this scenario 
 could be avoided by:
 a) reading the trailer of the last completed file, i.e., last wal file 
 which has a trailer and,
 b) completely reading the last wal file (this file would not have the 
 trailer, so it needs to be read completely).
 In future, if we switch to multi wal file, we could read the trailer for all 
 completed WAL files, and reading the remaining incomplete files.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9758) Log missing rows and their regions if ITBLL fails


[ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794693#comment-13794693
 ] 

Hudson commented on HBASE-9758:
---

SUCCESS: Integrated in hbase-0.96 #139 (See 
[https://builds.apache.org/job/hbase-0.96/139/])
HBASE-9758 Log missing rows and their regions if ITBLL fails (jxiang: rev 
1532051)
* 
/hbase/branches/0.96/hbase-it/src/test/java/org/apache/hadoop/hbase/test/IntegrationTestBigLinkedList.java


 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9562) Make HLogPE run against configured FS


[ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794694#comment-13794694
 ] 

Hudson commented on HBASE-9562:
---

SUCCESS: Integrated in hbase-0.96 #139 (See 
[https://builds.apache.org/job/hbase-0.96/139/])
HBASE-9562 Make HLogPE run against configured FS - addendum (mbertozzi: rev 
1532087)
* 
/hbase/branches/0.96/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java


 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9562) Make HLogPE run against configured FS


[ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794720#comment-13794720
 ] 

Hudson commented on HBASE-9562:
---

SUCCESS: Integrated in hbase-0.96-hadoop2 #88 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/88/])
HBASE-9562 Make HLogPE run against configured FS - addendum (mbertozzi: rev 
1532087)
* 
/hbase/branches/0.96/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java


 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9758) Log missing rows and their regions if ITBLL fails


[ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794719#comment-13794719
 ] 

Hudson commented on HBASE-9758:
---

SUCCESS: Integrated in hbase-0.96-hadoop2 #88 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/88/])
HBASE-9758 Log missing rows and their regions if ITBLL fails (jxiang: rev 
1532051)
* 
/hbase/branches/0.96/hbase-it/src/test/java/org/apache/hadoop/hbase/test/IntegrationTestBigLinkedList.java


 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


[ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794718#comment-13794718
 ] 

Hudson commented on HBASE-9692:
---

SUCCESS: Integrated in hbase-0.96-hadoop2 #88 (See 
[https://builds.apache.org/job/hbase-0.96-hadoop2/88/])
HBASE-9692 TestOrderedBytes doesn't display the arrays correctly (ndimiduk: rev 
1531997)
* 
/hbase/branches/0.96/hbase-common/src/test/java/org/apache/hadoop/hbase/util/TestOrderedBytes.java


 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9758) Log missing rows and their regions if ITBLL fails


[ 
https://issues.apache.org/jira/browse/HBASE-9758?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794725#comment-13794725
 ] 

Hudson commented on HBASE-9758:
---

SUCCESS: Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #793 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/793/])
HBASE-9758 Log missing rows and their regions if ITBLL fails (jxiang: rev 
1532048)
* 
/hbase/trunk/hbase-it/src/test/java/org/apache/hadoop/hbase/test/IntegrationTestBigLinkedList.java


 Log missing rows and their regions if ITBLL fails
 -

 Key: HBASE-9758
 URL: https://issues.apache.org/jira/browse/HBASE-9758
 Project: HBase
  Issue Type: Task
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: trunk-9758.patch


 Currently, when ITBLL fails, the log only shows how many rows are missing. We 
 have to go to the MR log to find the rows. The row key is some binary string 
 that is hard to map to a region.  It will be helpful to log the missing rows 
 in the ITBLL log and the corresponding regions that hold these rows.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9562) Make HLogPE run against configured FS


[ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794726#comment-13794726
 ] 

Hudson commented on HBASE-9562:
---

SUCCESS: Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #793 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/793/])
HBASE-9562 Make HLogPE run against configured FS - addendum (mbertozzi: rev 
1532086)
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java


 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9692) TestOrderedBytes doesn't display the arrays correctly


[ 
https://issues.apache.org/jira/browse/HBASE-9692?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794724#comment-13794724
 ] 

Hudson commented on HBASE-9692:
---

SUCCESS: Integrated in HBase-TRUNK-on-Hadoop-2.0.0 #793 (See 
[https://builds.apache.org/job/HBase-TRUNK-on-Hadoop-2.0.0/793/])
HBASE-9692 TestOrderedBytes doesn't display the arrays correctly (ndimiduk: rev 
1531992)
* 
/hbase/trunk/hbase-common/src/test/java/org/apache/hadoop/hbase/util/TestOrderedBytes.java


 TestOrderedBytes doesn't display the arrays correctly
 -

 Key: HBASE-9692
 URL: https://issues.apache.org/jira/browse/HBASE-9692
 Project: HBase
  Issue Type: Test
Affects Versions: 0.98.0, 0.96.0
Reporter: Jean-Marc Spaggiari
Assignee: Jean-Marc Spaggiari
Priority: Minor
 Fix For: 0.98.0, 0.96.1

 Attachments: HBASE-9692-v0-trunk.patch, HBASE-9692-v1-trunk.patch, 
 HBASE-9692-v1-trunk.patch


 TestOrderedBytes should call Arrays.toString(decoded) instead of directly 
 output decoded.
 Today output will be something like:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [B@4a8c1dd9, [B@34fbb7cb, ASCENDING: actual array was null
 We want it to be:
 java.lang.AssertionError: Encoded representations do not preserve natural 
 order: [], [], ASCENDING: actual array was null



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-8741) Scope sequenceid to the region rather than regionserver (WAS: Mutations on Regions in recovery mode might have same sequenceIDs)

[
https://issues.apache.org/jira/browse/HBASE-8741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794729#comment-13794729
]

Hadoop QA commented on HBASE-8741:
--

{color:red}-1 overall{color}. Here are the results of testing the latest
attachment

http://issues.apache.org/jira/secure/attachment/12608388/HBASE-8741-trunk-v6.patch
against trunk revision .

{color:green}+1 @author{color}. The patch does not contain any @author
tags.

{color:green}+1 tests included{color}. The patch appears to include 36 new
or modified tests.

{color:green}+1 hadoop1.0{color}. The patch compiles against the hadoop
1.0 profile.

{color:green}+1 hadoop2.0{color}. The patch compiles against the hadoop
2.0 profile.

{color:green}+1 javadoc{color}. The javadoc tool did not generate any
warning messages.

{color:green}+1 javac{color}. The applied patch does not increase the
total number of javac compiler warnings.

{color:green}+1 findbugs{color}. The patch does not introduce any new
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}. The applied patch does not increase
the total number of release audit warnings.

{color:green}+1 lineLengths{color}. The patch does not introduce lines
longer than 100

{color:red}-1 site{color}. The patch appears to cause mvn site goal to
fail.

{color:red}-1 core tests{color}. The patch failed these unit tests:
org.apache.hadoop.hbase.io.TestHeapSize

Test results:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//testReport/
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop1-compat.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Console output:
https://builds.apache.org/job/PreCommit-HBASE-Build/7539//console

This message is automatically generated.

Scope sequenceid to the region rather than regionserver (WAS: Mutations on
Regions in recovery mode might have same sequenceIDs)

Attachments: HBASE-8741-trunk-v6.patch, HBASE-8741-v0.patch,
HBASE-8741-v2.patch, HBASE-8741-v3.patch, HBASE-8741-v4-again.patch,
HBASE-8741-v4-again.patch, HBASE-8741-v4.patch, HBASE-8741-v5-again.patch,
HBASE-8741-v5.patch

[jira] [Created] (HBASE-9759) IntegrationTestBulkLoad random number collision

2013-10-14 Thread Enis Soztutar (JIRA)

Enis Soztutar created HBASE-9759:


 Summary: IntegrationTestBulkLoad random number collision
 Key: HBASE-9759
 URL: https://issues.apache.org/jira/browse/HBASE-9759
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: Enis Soztutar
Assignee: Enis Soztutar
 Fix For: 0.98.0, 0.96.1


ITBL failed recently in our test harness. Inspecting the failure made me 
believe that the only reason that particular failure might have happened is 
that there is a collision in random longs generated by the test. 

The test creates 50 mappers by default, and each mapper writes a 500K random 
rows starting with row = 0. By default there are 5 iterations.

The check job outputs these counters: 
{code}
2013-10-13 07:48:01,134 Map input records=124999751
2013-10-13 07:48:01,134 Map output records=12499
{code}
The number of input records seems fine because
{code}
124999751 = 1 + 5 * (0.5M - 1) * 50
{code}
5 = num iterations, 0.5M = num rows, 50 = num mappers, and 1 is for row =0 
which every chain writes to. 

Output records should be 125M, however we see one cell missing. Since the map 
input records matches expected number of distinct rows, I suspect that row = 0 
had a collision. 

In one of the generate jobs, we can see that the reducer output count does not 
match the reducer input count. Given that we are using KVSortReducer, this 
confirms that there is a collision in KeyValues received by this task.
{code}
2013-10-13 06:48:12,738 Reduce input records=7500
2013-10-13 06:48:12,738 Reduce output records=7497
{code}
The count is off by 3 because we are writing 3 columns per row. 

My only theory for explaining this is that we had a collision in chainId's or 
one of the chains reused row = 0 as the next row. 

This is similar to HBASE-8700, however, in this the probability is much much 
much lower. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9562) Make HLogPE run against configured FS


[ 
https://issues.apache.org/jira/browse/HBASE-9562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794745#comment-13794745
 ] 

Hudson commented on HBASE-9562:
---

FAILURE: Integrated in HBase-TRUNK #4616 (See 
[https://builds.apache.org/job/HBase-TRUNK/4616/])
HBASE-9562 Make HLogPE run against configured FS - addendum (mbertozzi: rev 
1532086)
* 
/hbase/trunk/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java


 Make HLogPE run against configured FS
 -

 Key: HBASE-9562
 URL: https://issues.apache.org/jira/browse/HBASE-9562
 Project: HBase
  Issue Type: Sub-task
  Components: Performance, wal
Reporter: stack
Assignee: stack
 Fix For: 0.96.0

 Attachments: 9562.txt, HBASE-9562-addendum.patch


 HLogPE will not pick up the configured fs from *.xml files; always goes 
 against localfs
 Simple patch:
 {code}
 diff --git 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
  
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvalu
 index 9c07046..f73258b 100644
 --- 
 a/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 +++ 
 b/hbase-server/src/test/java/org/apache/hadoop/hbase/regionserver/wal/HLogPerformanceEvaluation.java
 @@ -47,6 +47,7 @@ import org.apache.hadoop.hbase.regionserver.wal.HLog.Entry;
  import org.apache.hadoop.hbase.util.Bytes;
  import org.apache.hadoop.util.Tool;
  import org.apache.hadoop.util.ToolRunner;
 +import org.apache.hadoop.hbase.HConstants;
  /**
   * This class runs performance benchmarks for {@link HLog}.
 @@ -170,8 +171,12 @@ public final class HLogPerformanceEvaluation extends 
 Configured implements Tool
  }
  // Run HLog Performance Evaluation
 +// First set the fs from configs.  Do it for both configs in case we
 +// are on hadoop1
 +getConf().set(fs.default.name, getConf().get(HConstants.HBASE_DIR));
 +getConf().set(fs.defaultFS, getConf().get(HConstants.HBASE_DIR));
  FileSystem fs = FileSystem.get(getConf());
 -LOG.info( + fs);
 +LOG.info(FileSystem:  + fs);
  try {
if (rootRegionDir == null) {
  rootRegionDir = 
 TEST_UTIL.getDataTestDir(HLogPerformanceEvaluation);
 {code}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9754) Consider eliminating threadlocals from MVCC code


 [ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9754:
--

Attachment: 9754-rp-0.txt

Initial patch.

I ran the following unit tests locally and they passed:
{code}
 1164  mt2 -Dtest=TestMemStore
 1167  mt2 
-Dtest=TestCoprocessorScanPolicy,TestMemStoreChunkPool,TestFSErrorsExposed,TestRegionObserverScannerOpenHook
 1173  mt2 -Dtest=TestHRegion
 1175  mt2 -Dtest=TestJoinedScanners
{code}
API changes are in classes marked @InterfaceAudience.Private

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794755#comment-13794755
 ] 

Ted Yu commented on HBASE-9754:
---

Patch contains patch for HBASE-9753 as well.

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9754) Consider eliminating threadlocals from MVCC code


 [ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9754:
--

Fix Version/s: 0.98.0
   Status: Patch Available  (was: Open)

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794757#comment-13794757
 ] 

Ted Yu commented on HBASE-9754:
---

Here is review request: https://reviews.apache.org/r/14642/

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


 [ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nick Dimiduk updated HBASE-9570:


Attachment: HBASE-9570-0.96.01.patch

Patch applied to trunk, not so cleanly to 0.96. I'm not sure why. [~wangy] mind 
having a look at this patch for trunk, make sure i didn't screw up something?

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Assignee: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch, 
 HBASE-9570-0.96.01.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794790#comment-13794790
 ] 

Nick Dimiduk commented on HBASE-9570:
-

Err, make that patch for 0.96.

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Assignee: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch, 
 HBASE-9570-0.96.01.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-14 Thread Sergey Shelukhin (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794792#comment-13794792
 ] 

Sergey Shelukhin commented on HBASE-5487:
-

Ok, it's harder than I thought, I don't think I will be done today... but I 
think I have a clear picture now that covers the above feedback, so I am trying 
to cover all the failover scenarios and state conflicts.

 Generic framework for Master-coordinated tasks
 --

 Key: HBASE-5487
 URL: https://issues.apache.org/jira/browse/HBASE-5487
 Project: HBase
  Issue Type: New Feature
  Components: master, regionserver, Zookeeper
Affects Versions: 0.94.0
Reporter: Mubarak Seyed
Priority: Critical
 Attachments: Region management in Master.pdf


 Need a framework to execute master-coordinated tasks in a fault-tolerant 
 manner. 
 Master-coordinated tasks such as online-scheme change and delete-range 
 (deleting region(s) based on start/end key) can make use of this framework.
 The advantages of framework are
 1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for 
 master-coordinated tasks
 2. Ability to abstract the common functions across Master - ZK and RS - ZK
 3. Easy to plugin new master-coordinated tasks without adding code to core 
 components



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794818#comment-13794818
 ] 

Hadoop QA commented on HBASE-9754:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  http://issues.apache.org/jira/secure/attachment/12608406/9754-rp-0.txt
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+1 tests included{color}.  The patch appears to include 27 new 
or modified tests.

{color:green}+1 hadoop1.0{color}.  The patch compiles against the hadoop 
1.0 profile.

{color:green}+1 hadoop2.0{color}.  The patch compiles against the hadoop 
2.0 profile.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:red}-1 findbugs{color}.  The patch appears to cause Findbugs 
(version 1.3.9) to fail.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:green}+1 lineLengths{color}.  The patch does not introduce lines 
longer than 100

{color:red}-1 site{color}.  The patch appears to cause mvn site goal to 
fail.

{color:green}+1 core tests{color}.  The patch passed unit tests in .

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7540//testReport/
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7540//console

This message is automatically generated.

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794820#comment-13794820
 ] 

Hadoop QA commented on HBASE-9570:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12608415/HBASE-9570-0.96.01.patch
  against trunk revision .

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:red}-1 tests included{color}.  The patch doesn't appear to include 
any new or modified tests.
Please justify why no new tests are needed for this 
patch.
Also please list what manual steps were performed to 
verify this patch.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/7541//console

This message is automatically generated.

 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Assignee: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch, 
 HBASE-9570-0.96.01.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9759) IntegrationTestBulkLoad random number collision

2013-10-14 Thread Enis Soztutar (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-9759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Enis Soztutar updated HBASE-9759:
-

Attachment: hbase-9759_v1.patch

Here is a patch for eliminating chain collision probability. We can apply this
patch to see whether the theory is correct and continue to monitor the test.

Tested the patch on a cluster.

IntegrationTestBulkLoad random number collision
---

Key: HBASE-9759
URL: https://issues.apache.org/jira/browse/HBASE-9759
Project: HBase
Issue Type: Bug
Components: test
Reporter: Enis Soztutar
Assignee: Enis Soztutar
Fix For: 0.98.0, 0.96.1

Attachments: hbase-9759_v1.patch

ITBL failed recently in our test harness. Inspecting the failure made me
believe that the only reason that particular failure might have happened is
that there is a collision in random longs generated by the test.
The test creates 50 mappers by default, and each mapper writes a 500K random
rows starting with row = 0. By default there are 5 iterations.
The check job outputs these counters:
{code}
2013-10-13 07:48:01,134 Map input records=124999751
2013-10-13 07:48:01,134 Map output records=12499
{code}
The number of input records seems fine because
{code}
124999751 = 1 + 5 * (0.5M - 1) * 50
{code}
5 = num iterations, 0.5M = num rows, 50 = num mappers, and 1 is for row =0
which every chain writes to.
Output records should be 125M, however we see one cell missing. Since the map
input records matches expected number of distinct rows, I suspect that row =
0 had a collision.
In one of the generate jobs, we can see that the reducer output count does
not match the reducer input count. Given that we are using KVSortReducer,
this confirms that there is a collision in KeyValues received by this task.
{code}
2013-10-13 06:48:12,738 Reduce input records=7500
2013-10-13 06:48:12,738 Reduce output records=7497
{code}
The count is off by 3 because we are writing 3 columns per row.
My only theory for explaining this is that we had a collision in chainId's or
one of the chains reused row = 0 as the next row.
This is similar to HBASE-8700, however, in this the probability is much much
much lower.

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9759) IntegrationTestBulkLoad random number collision

2013-10-14 Thread Enis Soztutar (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-9759?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Enis Soztutar updated HBASE-9759:
-

Status: Patch Available  (was: Open)

 IntegrationTestBulkLoad random number collision
 ---

 Key: HBASE-9759
 URL: https://issues.apache.org/jira/browse/HBASE-9759
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: Enis Soztutar
Assignee: Enis Soztutar
 Fix For: 0.98.0, 0.96.1

 Attachments: hbase-9759_v1.patch


 ITBL failed recently in our test harness. Inspecting the failure made me 
 believe that the only reason that particular failure might have happened is 
 that there is a collision in random longs generated by the test. 
 The test creates 50 mappers by default, and each mapper writes a 500K random 
 rows starting with row = 0. By default there are 5 iterations.
 The check job outputs these counters: 
 {code}
 2013-10-13 07:48:01,134 Map input records=124999751
 2013-10-13 07:48:01,134 Map output records=12499
 {code}
 The number of input records seems fine because
 {code}
 124999751 = 1 + 5 * (0.5M - 1) * 50
 {code}
 5 = num iterations, 0.5M = num rows, 50 = num mappers, and 1 is for row =0 
 which every chain writes to. 
 Output records should be 125M, however we see one cell missing. Since the map 
 input records matches expected number of distinct rows, I suspect that row = 
 0 had a collision. 
 In one of the generate jobs, we can see that the reducer output count does 
 not match the reducer input count. Given that we are using KVSortReducer, 
 this confirms that there is a collision in KeyValues received by this task.
 {code}
 2013-10-13 06:48:12,738 Reduce input records=7500
 2013-10-13 06:48:12,738 Reduce output records=7497
 {code}
 The count is off by 3 because we are writing 3 columns per row. 
 My only theory for explaining this is that we had a collision in chainId's or 
 one of the chains reused row = 0 as the next row. 
 This is similar to HBASE-8700, however, in this the probability is much much 
 much lower. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794827#comment-13794827
 ] 

Ted Yu commented on HBASE-9754:
---

I saw the same number of findbugs warnings in:
https://builds.apache.org/job/PreCommit-HBASE-Build/7538/console
which was for HBASE-9748

I think the cause was the following:
{code}
[INFO] Compiling 44 source files to 
/home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/trunk/hbase-thrift/target/classes
#
# There is insufficient memory for the Java Runtime Environment to continue.
# Native memory allocation (malloc) failed to allocate 90184 bytes for 
Arena::Amalloc
# An error report file with more information is saved as:
# 
/home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/trunk/hs_err_pid8382.log
/home/jenkins/jenkins-slave/workspace/PreCommit-HBASE-Build/trunk/dev-support/test-patch.sh:
 line 561:  8382 Aborted $MVN clean package findbugs:findbugs 
-D${PROJECT_NAME}PatchProcess -DskipTests  /dev/null
{code}

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9570) With AccessDeniedException, HBase shell would be better to just display the error message to be user friendly


[ 
https://issues.apache.org/jira/browse/HBASE-9570?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794849#comment-13794849
 ] 

Hudson commented on HBASE-9570:
---

SUCCESS: Integrated in HBase-TRUNK #4617 (See 
[https://builds.apache.org/job/HBase-TRUNK/4617/])
HBASE-9570 With AccessDeniedException, HBase shell would be better to just 
display the error message to be user friendly (Yang Wang) (ndimiduk: rev 
1532163)
* /hbase/trunk/hbase-shell/src/main/ruby/shell/commands.rb


 With AccessDeniedException, HBase shell would be better to just display the 
 error message to be user friendly
 -

 Key: HBASE-9570
 URL: https://issues.apache.org/jira/browse/HBASE-9570
 Project: HBase
  Issue Type: Improvement
  Components: shell
Reporter: Yang Wang
Assignee: Yang Wang
Priority: Minor
 Fix For: 0.98.0

 Attachments: HBASE-9570, HBASE-9570.01.patch, HBASE-9570.02.patch, 
 HBASE-9570-0.96.01.patch


 When access unauthorized resource like table, AccessDeniedException will be 
 thrown. In HBase shell, the error message with stack trace will be displayed 
 as follows. It would be better to just display the error message avoiding the 
 stack trace to be user friendly. 
 {noformat}
 ERROR: org.apache.hadoop.hbase.security.AccessDeniedException: Insufficient 
 permissions for user 'u1' for scanner open on table t1
 at 
 org.apache.hadoop.hbase.security.access.AccessController.preScannerOpen(AccessController.java:1116)
 at 
 org.apache.hadoop.hbase.regionserver.RegionCoprocessorHost.preScannerOpen(RegionCoprocessorHost.java:1293)
 at 
 org.apache.hadoop.hbase.regionserver.HRegionServer.scan(HRegionServer.java:3026)
 at 
 org.apache.hadoop.hbase.protobuf.generated.ClientProtos$ClientService$2.callBlockingMethod(ClientProtos.java:26971)
 at org.apache.hadoop.hbase.ipc.RpcServer.call(RpcServer.java:2083)
 at 
 org.apache.hadoop.hbase.ipc.RpcServer$CallRunner.run(RpcServer.java:1820)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.consumerLoop(SimpleRpcScheduler.java:165)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler.access$000(SimpleRpcScheduler.java:41)
 at 
 org.apache.hadoop.hbase.ipc.SimpleRpcScheduler$1.run(SimpleRpcScheduler.java:113)
 at java.lang.Thread.run(Thread.java:662)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9753) Excessive readpoints checks in MemstoreScanner

2013-10-14 Thread Lars Hofhansl (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794850#comment-13794850
 ] 

Lars Hofhansl commented on HBASE-9753:
--

That's it? Nice. +1, Let's definitely do this (0.94+).


 Excessive readpoints checks in MemstoreScanner
 --

 Key: HBASE-9753
 URL: https://issues.apache.org/jira/browse/HBASE-9753
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Vladimir Rodionov
 Attachments: 9753-v1.txt


 Brought up by [~vrodionov] on the mailing list. See also HBASE-9751.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9137) Add Tag dictionary in WAL compression

2013-10-14 Thread Anoop Sam John (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Anoop Sam John updated HBASE-9137:
--

  Resolution: Fixed
Release Note: Tag compression will work when WAL compression is enabled. 
Set hbase.regionserver.wal.tags.enablecompression false to disable tag 
compression in WAL. (This property defaults to true)
Hadoop Flags: Reviewed
  Status: Resolved  (was: Patch Available)

Committed to Trunk. Thanks for the reviews

 Add Tag dictionary in WAL compression
 -

 Key: HBASE-9137
 URL: https://issues.apache.org/jira/browse/HBASE-9137
 Project: HBase
  Issue Type: Sub-task
Affects Versions: 0.98.0
Reporter: ramkrishna.s.vasudevan
Assignee: Anoop Sam John
 Fix For: 0.98.0

 Attachments: HBASE-9137.patch, HBASE-9137_V2.patch, 
 HBASE-9137_V3.patch


 We can add tag dictionary like we have one for rowdictionary, 
 familydictionary.  But this has to be done after stabilizing HBASE-7391.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code

2013-10-14 Thread Lars Hofhansl (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794855#comment-13794855
 ] 

Lars Hofhansl commented on HBASE-9754:
--

Did a quick read through. Looks good. Need to look in more detail.
Using 0 as readPt when unknown is interesting.

Could we use the readPt to replace the hasMVCC stuff I just added in 
HBASE-9751? I.e. pass MAX_LONG as readPt when we would have passed 
hasMVCC=true, in which case we know we do not need to check the memstoreTs...?

I'd be in favor of committing HBASE-9753 first as that is simple, and 
scrutinize this one a bit more.


 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9759) IntegrationTestBulkLoad random number collision

[
https://issues.apache.org/jira/browse/HBASE-9759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794859#comment-13794859
]

stack commented on HBASE-9759:
--

+1 on trying the patch. How does it provent collision (I did not review
closely).

If you do a select on row 0, does it have more versions than other rows.

What is to prevent our clashing randomly on another row? Because our randoms
generation is within a fixed range per iteration?

IntegrationTestBulkLoad random number collision
---

Key: HBASE-9759
URL: https://issues.apache.org/jira/browse/HBASE-9759
Project: HBase
Issue Type: Bug
Components: test
Reporter: Enis Soztutar
Assignee: Enis Soztutar
Fix For: 0.98.0, 0.96.1

Attachments: hbase-9759_v1.patch

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9759) IntegrationTestBulkLoad random number collision


[ 
https://issues.apache.org/jira/browse/HBASE-9759?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794860#comment-13794860
 ] 

stack commented on HBASE-9759:
--

Why this

-s.setBatch(100);
+s.setBatch(1000);



 IntegrationTestBulkLoad random number collision
 ---

 Key: HBASE-9759
 URL: https://issues.apache.org/jira/browse/HBASE-9759
 Project: HBase
  Issue Type: Bug
  Components: test
Reporter: Enis Soztutar
Assignee: Enis Soztutar
 Fix For: 0.98.0, 0.96.1

 Attachments: hbase-9759_v1.patch


 ITBL failed recently in our test harness. Inspecting the failure made me 
 believe that the only reason that particular failure might have happened is 
 that there is a collision in random longs generated by the test. 
 The test creates 50 mappers by default, and each mapper writes a 500K random 
 rows starting with row = 0. By default there are 5 iterations.
 The check job outputs these counters: 
 {code}
 2013-10-13 07:48:01,134 Map input records=124999751
 2013-10-13 07:48:01,134 Map output records=12499
 {code}
 The number of input records seems fine because
 {code}
 124999751 = 1 + 5 * (0.5M - 1) * 50
 {code}
 5 = num iterations, 0.5M = num rows, 50 = num mappers, and 1 is for row =0 
 which every chain writes to. 
 Output records should be 125M, however we see one cell missing. Since the map 
 input records matches expected number of distinct rows, I suspect that row = 
 0 had a collision. 
 In one of the generate jobs, we can see that the reducer output count does 
 not match the reducer input count. Given that we are using KVSortReducer, 
 this confirms that there is a collision in KeyValues received by this task.
 {code}
 2013-10-13 06:48:12,738 Reduce input records=7500
 2013-10-13 06:48:12,738 Reduce output records=7497
 {code}
 The count is off by 3 because we are writing 3 columns per row. 
 My only theory for explaining this is that we had a collision in chainId's or 
 one of the chains reused row = 0 as the next row. 
 This is similar to HBASE-8700, however, in this the probability is much much 
 much lower. 



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code


[ 
https://issues.apache.org/jira/browse/HBASE-9754?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794863#comment-13794863
 ] 

Ted Yu commented on HBASE-9754:
---

bq. scrutinize this one a bit more.

Take your time.
Depending on how long the review takes, we can decide which one to commit 
first. HBASE-9753 is a subset of this.

 Consider eliminating threadlocals from MVCC code
 

 Key: HBASE-9754
 URL: https://issues.apache.org/jira/browse/HBASE-9754
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Ted Yu
 Fix For: 0.98.0

 Attachments: 9754-rp-0.txt


 Brought up by [~vrodionov] and [~yuzhih...@gmail.com].
 Currently we use ThreadLocals to communicate the current readpoint between a 
 RegionScanner and the Store\{File}Scanner's down the stack.
 Since ThreadLocals are not cheap we should consider whether it is possible to 
 pass the readpoint through the call stack instead.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Updated] (HBASE-9753) Excessive readpoints checks in MemstoreScanner


 [ 
https://issues.apache.org/jira/browse/HBASE-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-9753:
--

Status: Patch Available  (was: Open)

 Excessive readpoints checks in MemstoreScanner
 --

 Key: HBASE-9753
 URL: https://issues.apache.org/jira/browse/HBASE-9753
 Project: HBase
  Issue Type: Bug
Reporter: Lars Hofhansl
Assignee: Vladimir Rodionov
 Attachments: 9753-v1.txt


 Brought up by [~vrodionov] on the mailing list. See also HBASE-9751.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-5487) Generic framework for Master-coordinated tasks

2013-10-14 Thread Eric Newton (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794862#comment-13794862
]

Eric Newton commented on HBASE-5487:

Accumulo does manage tablet (region) assignment tracking through the metadata
table, and further, uses a distributed state machine to scale up a little
beyond a single master node. I have been meaning to write it up, but I've not
had a chance.

I've not kept up with every HBase improvement, so I don't know if it is
pertinent... the accumulo metadata table is typically spread out over 50 - 100%
of the available tablet (region) servers.

Still, the metadata table, and especially the root table(t), is subject to
hot-spotting on large map/reduce jobs where hundreds (or thousands) of clients
are learning tablet locations at the same time. Block caching is important,
but at some point massive numbers of simultaneous RPC requests to a single node
cause delays, or even timeouts and failures.

But using accumulo to store accumulo state has scaled well.

Accumulo has 2 frameworks for master tasks:

* master general state processing: a table should be online, assignments are
recorded and servers repeatedly informed
* FATE processing, where multi-stage operations are saved, executed and
progress is re-recorded

The first is general maintenance: keeping the system running. Tablets are
assigned, unassigned and in-general balanced.

The second allows for temporal deviance: tablets are taken offline for a merge,
for example. The step-by-step allocation of resources and state are walked,
each step recording progress in zookeeper.

Generic framework for Master-coordinated tasks
--

Key: HBASE-5487
URL: https://issues.apache.org/jira/browse/HBASE-5487
Project: HBase
Issue Type: New Feature
Components: master, regionserver, Zookeeper
Affects Versions: 0.94.0
Reporter: Mubarak Seyed
Priority: Critical
Attachments: Region management in Master.pdf

Need a framework to execute master-coordinated tasks in a fault-tolerant
manner.
Master-coordinated tasks such as online-scheme change and delete-range
(deleting region(s) based on start/end key) can make use of this framework.
The advantages of framework are
1. Eliminate repeated code in Master, ZooKeeper tracker and Region-server for
master-coordinated tasks
2. Ability to abstract the common functions across Master - ZK and RS - ZK
3. Easy to plugin new master-coordinated tasks without adding code to core
components

--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-8741) Scope sequenceid to the region rather than regionserver (WAS: Mutations on Regions in recovery mode might have same sequenceIDs)


[ 
https://issues.apache.org/jira/browse/HBASE-8741?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13794866#comment-13794866
 ] 

Himanshu Vashishtha commented on HBASE-8741:


I remember this, heapsize failing on apache pre-commit m/c. Let me take a
look at it.





 Scope sequenceid to the region rather than regionserver (WAS: Mutations on 
 Regions in recovery mode might have same sequenceIDs)
 

 Key: HBASE-8741
 URL: https://issues.apache.org/jira/browse/HBASE-8741
 Project: HBase
  Issue Type: Bug
  Components: MTTR
Affects Versions: 0.95.1
Reporter: Himanshu Vashishtha
Assignee: Himanshu Vashishtha
 Fix For: 0.98.0

 Attachments: HBASE-8741-trunk-v6.patch, HBASE-8741-v0.patch, 
 HBASE-8741-v2.patch, HBASE-8741-v3.patch, HBASE-8741-v4-again.patch, 
 HBASE-8741-v4-again.patch, HBASE-8741-v4.patch, HBASE-8741-v5-again.patch, 
 HBASE-8741-v5.patch


 Currently, when opening a region, we find the maximum sequence ID from all 
 its HFiles and then set the LogSequenceId of the log (in case the later is at 
 a small value). This works good in recovered.edits case as we are not writing 
 to the region until we have replayed all of its previous edits. 
 With distributed log replay, if we want to enable writes while a region is 
 under recovery, we need to make sure that the logSequenceId  maximum 
 logSequenceId of the old regionserver. Otherwise, we might have a situation 
 where new edits have same (or smaller) sequenceIds. 
 We can store region level information in the WALTrailer, than this scenario 
 could be avoided by:
 a) reading the trailer of the last completed file, i.e., last wal file 
 which has a trailer and,
 b) completely reading the last wal file (this file would not have the 
 trailer, so it needs to be read completely).
 In future, if we switch to multi wal file, we could read the trailer for all 
 completed WAL files, and reading the remaining incomplete files.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

[jira] [Commented] (HBASE-9754) Consider eliminating threadlocals from MVCC code