[jira] [Commented] (HBASE-11279) Block cache could be disabled by mistake

2014-06-01 Thread Nick Dimiduk (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11279?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14014937#comment-14014937
 ] 

Nick Dimiduk commented on HBASE-11279:
--

Strange. Do you know what the value of cacheConfig was?

 Block cache could be disabled by mistake
 

 Key: HBASE-11279
 URL: https://issues.apache.org/jira/browse/HBASE-11279
 Project: HBase
  Issue Type: Bug
Reporter: Jimmy Xiang
Assignee: Jimmy Xiang

 There is a weird test failure:
 {noformat}
 java.lang.AssertionError
   at org.junit.Assert.fail(Assert.java:86)
   at org.junit.Assert.assertTrue(Assert.java:41)
   at org.junit.Assert.assertTrue(Assert.java:52)
   at 
 org.apache.hadoop.hbase.io.hfile.TestPrefetch.readStoreFile(TestPrefetch.java:96)
   at 
 org.apache.hadoop.hbase.io.hfile.TestPrefetch.testPrefetch(TestPrefetch.java:66)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
   at java.lang.reflect.Method.invoke(Method.java:606)
   at 
 org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:47)
   at 
 org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
   at 
 org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:44)
   at 
 org.junit.internal.runners.statements.InvokeMethod.evaluate(InvokeMethod.java:17)
   at 
 org.junit.internal.runners.statements.FailOnTimeout$StatementThread.run(FailOnTimeout.java:74)
 {noformat}
 Looked into it, and found it could be because CacheConfig tries to compare a 
 float with a long. Probably we should do this instead:
 {noformat}
 -if (cachePercentage == 0L) {
 +if (cachePercentage = 0.0001f) {
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11277) RPCServer threads can wedge under high load

2014-06-01 Thread Lars Hofhansl (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11277?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015018#comment-14015018
 ] 

Lars Hofhansl commented on HBASE-11277:
---

Still delay on apache email?
[~apurtell], agree with what you said.

 RPCServer threads can wedge under high load
 ---

 Key: HBASE-11277
 URL: https://issues.apache.org/jira/browse/HBASE-11277
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.96.2, 0.98.3
Reporter: Andrew Purtell
Assignee: Andrew Purtell
Priority: Critical
 Fix For: 0.99.0, 0.96.3, 0.98.3

 Attachments: HBASE-11277.patch


 This is with 0.98.0 in an insecure setup with 7u55 and 7u60. Under high load, 
 RPCServer threads can wedge, fail to make progess, and consume 100% CPU time 
 on a core indefinitely. 
 Dumping threads, all threads are in BLOCKED or IN_NATIVE state. The IN_NATIVE 
 threads are mostly in EPollArrayWrapper.epollWait or 
 FileDispatcherImpl.read0. The number of threads found in 
 FileDispatcherImpl.read0 correspond to the number of runaway threads expected 
 based on looking at 'top' output. These look like:
 {noformat}
 Thread 64758: (state = IN_NATIVE)
  - sun.nio.ch.FileDispatcherImpl.read0(java.io.FileDescriptor, long, int) 
 @bci=0 (Compiled frame; information may be imprecise)
  - sun.nio.ch.SocketDispatcher.read(java.io.FileDescriptor, long, int) 
 @bci=4, line=39 (Compiled frame)
  - sun.nio.ch.IOUtil.readIntoNativeBuffer(java.io.FileDescriptor, 
 java.nio.ByteBuffer, long, sun.nio.ch.NativeDispatcher) @bci=114, line=223 
 (Compil
 ed frame)
  - sun.nio.ch.IOUtil.read(java.io.FileDescriptor, java.nio.ByteBuffer, long, 
 sun.nio.ch.NativeDispatcher) @bci=48, line=197 (Compiled frame)
  - sun.nio.ch.SocketChannelImpl.read(java.nio.ByteBuffer) @bci=234, line=379 
 (Compiled frame)
  - 
 org.apache.hadoop.hbase.ipc.RpcServer.channelRead(java.nio.channels.ReadableByteChannel,
  java.nio.ByteBuffer) @bci=12, line=2224 (Compiled frame)
  - org.apache.hadoop.hbase.ipc.RpcServer$Connection.readAndProcess() 
 @bci=509, line=1488 (Compiled frame)
  - 
 org.apache.hadoop.hbase.ipc.RpcServer$Listener.doRead(java.nio.channels.SelectionKey)
  @bci=23, line=790 (Compiled frame)
  - org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader.doRunLoop() @bci=97, 
 line=581 (Compiled frame)
  - org.apache.hadoop.hbase.ipc.RpcServer$Listener$Reader.run() @bci=1, 
 line=556 (Interpreted frame)
  - 
 java.util.concurrent.ThreadPoolExecutor.runWorker(java.util.concurrent.ThreadPoolExecutor$Worker)
  @bci=95, line=1145 (Interpreted frame)
  - java.util.concurrent.ThreadPoolExecutor$Worker.run() @bci=5, line=615 
 (Interpreted frame)
  - java.lang.Thread.run() @bci=11, line=745 (Interpreted frame)
 {noformat}



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: (was: HBASE-6701-rebased.patch)

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: HBASE-6701-rebased.patch

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: HBASE-6701-rebased.patch

Rebased -- this patch seems to contain some whitespace-related changes. Sorry 
about that.

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701-rebased.patch, HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: (was: HBASE-6701-rebased.patch)

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11278) Port HBASE-11201 'Enable global procedure members to return values to procedure master' to 0.98

2014-06-01 Thread Andrew Purtell (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015135#comment-14015135
 ] 

Andrew Purtell commented on HBASE-11278:


Who is requesting the backport? Is there a use case?

 Port HBASE-11201 'Enable global procedure members to return values to 
 procedure master' to 0.98
 ---

 Key: HBASE-11278
 URL: https://issues.apache.org/jira/browse/HBASE-11278
 Project: HBase
  Issue Type: Task
Reporter: Ted Yu

 HBASE-11201 adds the capability for procedure members to return data back to 
 the master.
 This issue is to backport the feature to 0.98



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: HBASE-6701-rebased.patch

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701-rebased.patch, HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015140#comment-14015140
 ] 

Hadoop QA commented on HBASE-6701:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12647844/HBASE-6701-rebased.patch
  against trunk revision .
  ATTACHMENT ID: 12647844

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+0 tests included{color}.  The patch appears to be a 
documentation patch that doesn't require tests.

{color:red}-1 patch{color}.  The patch command could not apply the patch.

Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9659//console

This message is automatically generated.

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701-rebased.patch, HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Assigned] (HBASE-11204) Document bandwidth consumption limit feature for ExportSnapshot

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones reassigned HBASE-11204:
---

Assignee: Misty Stanley-Jones

 Document bandwidth consumption limit feature for ExportSnapshot
 ---

 Key: HBASE-11204
 URL: https://issues.apache.org/jira/browse/HBASE-11204
 Project: HBase
  Issue Type: Task
  Components: snapshots
Reporter: Ted Yu
Assignee: Misty Stanley-Jones
Priority: Minor

 http://hbase.apache.org/book.html#ops.snapshots.export should document 
 bandwidth consumption limit feature which is implemented by HBASE-11083 and 
 HBASE-11090



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: (was: HBASE-6701-rebased.patch)

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-6701:
---

Attachment: HBASE-6701-rebased.patch

Had to use --no-prefix to get the patch to work.

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701-rebased.patch, HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs to 
 manually split though his keys have md5 salt.  The paragraph needs to make 
 sure it does not bring on such confusion as it would seem to in this case.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Created] (HBASE-11280) Document distributed log replay

2014-06-01 Thread Misty Stanley-Jones (JIRA)
Misty Stanley-Jones created HBASE-11280:
---

 Summary: Document distributed log replay
 Key: HBASE-11280
 URL: https://issues.apache.org/jira/browse/HBASE-11280
 Project: HBase
  Issue Type: Sub-task
  Components: MTTR, wal
Reporter: Misty Stanley-Jones
Assignee: stack
 Fix For: 0.99.0


Enable 'distributed log replay' by default.  Depends on hfilev3 being enabled.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Assigned] (HBASE-11280) Document distributed log replay

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones reassigned HBASE-11280:
---

Assignee: Misty Stanley-Jones  (was: stack)

 Document distributed log replay
 ---

 Key: HBASE-11280
 URL: https://issues.apache.org/jira/browse/HBASE-11280
 Project: HBase
  Issue Type: Sub-task
  Components: documentation, MTTR, wal
Reporter: Misty Stanley-Jones
Assignee: Misty Stanley-Jones
 Fix For: 0.99.0


 Enable 'distributed log replay' by default.  Depends on hfilev3 being enabled.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11280) Document distributed log replay

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11280:


Component/s: documentation

 Document distributed log replay
 ---

 Key: HBASE-11280
 URL: https://issues.apache.org/jira/browse/HBASE-11280
 Project: HBase
  Issue Type: Sub-task
  Components: documentation, MTTR, wal
Reporter: Misty Stanley-Jones
Assignee: stack
 Fix For: 0.99.0


 Enable 'distributed log replay' by default.  Depends on hfilev3 being enabled.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11238) Add info about SlabCache and BucketCache to Ref Guide

2014-06-01 Thread Dima Spivak (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11238?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015169#comment-14015169
 ] 

Dima Spivak commented on HBASE-11238:
-

Just as a thought: it might be worth noting on which machines particular files 
need to be modified (e.g. those running RegionServers) for things like this. 
Playing around with setting up BucketCache earlier today, I inadvertently 
modified hbase-env.sh on my gateway before understanding why my RS' couldn't 
restart without throwing OutOfMemoryError.

 Add info about SlabCache and BucketCache to Ref Guide
 -

 Key: HBASE-11238
 URL: https://issues.apache.org/jira/browse/HBASE-11238
 Project: HBase
  Issue Type: Bug
  Components: documentation
Affects Versions: 0.98.2
Reporter: Misty Stanley-Jones
Assignee: Misty Stanley-Jones
 Attachments: HBASE-11238-2.patch, HBASE-11238-3.patch, 
 HBASE-11238.patch


 Upstream issues: HBASE-11171 and HBASE-11098. Could back port some of what is 
 in these issues, the package-info.java class for instance.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-6701) Revisit thrust of paragraph on splitting

2014-06-01 Thread Hadoop QA (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-6701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015171#comment-14015171
 ] 

Hadoop QA commented on HBASE-6701:
--

{color:red}-1 overall{color}.  Here are the results of testing the latest 
attachment 
  
http://issues.apache.org/jira/secure/attachment/12647848/HBASE-6701-rebased.patch
  against trunk revision .
  ATTACHMENT ID: 12647848

{color:green}+1 @author{color}.  The patch does not contain any @author 
tags.

{color:green}+0 tests included{color}.  The patch appears to be a 
documentation patch that doesn't require tests.

{color:green}+1 javac{color}.  The applied patch does not increase the 
total number of javac compiler warnings.

{color:green}+1 javadoc{color}.  The javadoc tool did not generate any 
warning messages.

{color:green}+1 findbugs{color}.  The patch does not introduce any new 
Findbugs (version 1.3.9) warnings.

{color:green}+1 release audit{color}.  The applied patch does not increase 
the total number of release audit warnings.

{color:red}-1 lineLengths{color}.  The patch introduces the following lines 
longer than 100:
+   paraThe balancer is a periodic operation which is run on the 
master to redistribute regions on the cluster.  It is configured via
+   paraSee xref linkend=master.processes.loadbalancer / for more 
information on the LoadBalancer.
+   paraDo not turn off block cache (You'd do it by setting 
varnamehbase.block.cache.size/varname to zero).
+   Currently we do not do well if you do this because the regionserver 
will spend all its time loading hfile
+   indices over and over again.  If your working set it such that 
block cache does you no good, at least
+   size the block cache such that hfile indices will stay up in the 
cache (you can get a rough idea
+   on the size you need by surveying regionserver UIs; you'll see 
index block size accounted near the
+  titlelink 
xlink:href=http://en.wikipedia.org/wiki/Nagle's_algorithmNagle's/link or 
the small package problem/title
+  link 
xlink:href=http://search-hadoop.com/m/pduLg2fydtE/Inconsistent+scan+performance+with+caching+set+amp;subj=Re+Inconsistent+scan+performance+with+caching+set+to+1;Inconsistent
 scan performance with caching set to 1/link
+  see the graphs on the tail of link 
xlink:href=https://issues.apache.org/jira/browse/HBASE-7008;HBASE-7008 Set 
scanner caching to a better default/link

  {color:green}+1 site{color}.  The mvn site goal succeeds with this patch.

 {color:red}-1 core tests{color}.  The patch failed these unit tests:
   org.apache.hadoop.hbase.client.TestMultiParallel

Test results: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//testReport/
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-examples.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-prefix-tree.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-thrift.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-server.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-client.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-hadoop2-compat.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-common.html
Findbugs warnings: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//artifact/trunk/patchprocess/newPatchFindbugsWarningshbase-protocol.html
Console output: 
https://builds.apache.org/job/PreCommit-HBASE-Build/9660//console

This message is automatically generated.

 Revisit thrust of paragraph on splitting
 

 Key: HBASE-6701
 URL: https://issues.apache.org/jira/browse/HBASE-6701
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Assignee: Misty Stanley-Jones
 Attachments: HBASE-6701-rebased.patch, HBASE-6701.patch


 See the thread 'md5 hash key and splits' for the confusion our paragraph on 
 splitting seems to bring on (as well as good input on when manual splitting 
 might be favored).  The user is under the impression that he needs 

[jira] [Updated] (HBASE-11238) Add info about SlabCache and BucketCache to Ref Guide

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11238:


Attachment: HBASE-11238-4.patch

Rebased again and added Dima's info about where the configuration should 
happen. I also did a bit more spot-refactoring but didn't change any content.

 Add info about SlabCache and BucketCache to Ref Guide
 -

 Key: HBASE-11238
 URL: https://issues.apache.org/jira/browse/HBASE-11238
 Project: HBase
  Issue Type: Bug
  Components: documentation
Affects Versions: 0.98.2
Reporter: Misty Stanley-Jones
Assignee: Misty Stanley-Jones
 Attachments: HBASE-11238-2.patch, HBASE-11238-3.patch, 
 HBASE-11238-4.patch, HBASE-11238.patch


 Upstream issues: HBASE-11171 and HBASE-11098. Could back port some of what is 
 in these issues, the package-info.java class for instance.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-11280) Document distributed log replay

2014-06-01 Thread Misty Stanley-Jones (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-11280?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Misty Stanley-Jones updated HBASE-11280:


Release Note:   (was: Enable Distributed Log Replay (DLR) as default 
WAL/hlog splitting mechanism. Replaces Distributed Log Splitting (DLS) system.  
Improves MTTR (the more regions a server is carrying, the better the 
improvement.  Also, we can start taking writes during WAL/hlog recovery).  
Requires HFileV3 (On by default in 0.99/1.0).  See HBASE-7006 to learn more 
about DLR.)

 Document distributed log replay
 ---

 Key: HBASE-11280
 URL: https://issues.apache.org/jira/browse/HBASE-11280
 Project: HBase
  Issue Type: Sub-task
  Components: documentation, MTTR, wal
Reporter: Misty Stanley-Jones
Assignee: Misty Stanley-Jones
 Fix For: 0.99.0


 Enable 'distributed log replay' by default.  Depends on hfilev3 being enabled.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11280) Document distributed log replay

2014-06-01 Thread Misty Stanley-Jones (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11280?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015173#comment-14015173
 ] 

Misty Stanley-Jones commented on HBASE-11280:
-

http://blog.cloudera.com/blog/2012/07/hbase-log-splitting/

 Document distributed log replay
 ---

 Key: HBASE-11280
 URL: https://issues.apache.org/jira/browse/HBASE-11280
 Project: HBase
  Issue Type: Sub-task
  Components: documentation, MTTR, wal
Reporter: Misty Stanley-Jones
Assignee: Misty Stanley-Jones
 Fix For: 0.99.0


 Enable 'distributed log replay' by default.  Depends on hfilev3 being enabled.



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-10935) support snapshot policy where flush memstore can be skipped to prevent production cluster freeze

2014-06-01 Thread Tianying Chang (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-10935?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015176#comment-14015176
 ] 

Tianying Chang commented on HBASE-10935:


[~mbertozzi] Sure! Thanks!

 support snapshot policy where flush memstore can be skipped to prevent 
 production cluster freeze
 

 Key: HBASE-10935
 URL: https://issues.apache.org/jira/browse/HBASE-10935
 Project: HBase
  Issue Type: New Feature
  Components: shell, snapshots
Affects Versions: 0.94.7, 0.94.18
Reporter: Tianying Chang
Assignee: Tianying Chang
Priority: Minor
 Fix For: 0.94.7, 0.99.0, 0.94.20

 Attachments: HBASE-10935-0.94-v1.patch, HBASE-10935-0.98-v1.patch, 
 HBASE-10935-trunk-v1.patch, hbase-10935-94.patch, hbase-10935-trunk.patch


 We are using snapshot feature to do HBase disaster recovery. We will do 
 snapshot in our production cluster periodically. The current flush snapshot 
 policy require all regions of the table to coordinate to prevent write and do 
 flush at the same time. Since we use WALPlayer to complete the data that is 
 not in the snapshot HFile, we don't need the snapshot to do coordinated 
 flush. The snapshot just recored all the HFile that are already there. 
 I added the parameter in the HBase shell. So people can choose to use the 
 NoFlush snapshot when they need, like below. Otherwise, the default flush 
 snpahot support is not impacted. 
 snaphot 'TestTable', 'TestSnapshot', 'skipFlush'



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-11278) Port HBASE-11201 'Enable global procedure members to return values to procedure master' to 0.98

2014-06-01 Thread Jerry He (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-11278?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015183#comment-14015183
 ] 

Jerry He commented on HBASE-11278:
--

Hi, Andrew, Ted

I plan to use HBASE-11201 for HBASE-11148.
If  there is a need for HBASE-11148 to go into 0.98 eventually, there will be a 
need for this backport.

 Port HBASE-11201 'Enable global procedure members to return values to 
 procedure master' to 0.98
 ---

 Key: HBASE-11278
 URL: https://issues.apache.org/jira/browse/HBASE-11278
 Project: HBase
  Issue Type: Task
Reporter: Ted Yu

 HBASE-11201 adds the capability for procedure members to return data back to 
 the master.
 This issue is to backport the feature to 0.98



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Updated] (HBASE-8073) HFileOutputFormat support for offline operation

2014-06-01 Thread Jerry He (JIRA)

 [ 
https://issues.apache.org/jira/browse/HBASE-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jerry He updated HBASE-8073:


Attachment: HBASE-8073-trunk-v1.patch

 HFileOutputFormat support for offline operation
 ---

 Key: HBASE-8073
 URL: https://issues.apache.org/jira/browse/HBASE-8073
 Project: HBase
  Issue Type: Sub-task
  Components: mapreduce
Reporter: Nick Dimiduk
 Fix For: 0.99.0

 Attachments: HBASE-8073-trunk-v0.patch, HBASE-8073-trunk-v1.patch


 When using HFileOutputFormat to generate HFiles, it inspects the region 
 topology of the target table. The split points from that table are used to 
 guide the TotalOrderPartitioner. If the target table does not exist, it is 
 first created. This imposes an unnecessary dependence on an online HBase and 
 existing table.
 If the table exists, it can be used. However, the job can be smarter. For 
 example, if there's far more data going into the HFiles than the table 
 currently contains, the table regions aren't very useful for data split 
 points. Instead, the input data can be sampled to produce split points more 
 meaningful to the dataset. LoadIncrementalHFiles is already capable of 
 handling divergence between HFile boundaries and table regions, so this 
 should not pose any additional burdon at load time.
 The proper method of sampling the data likely requires a custom input format 
 and an additional map-reduce job perform the sampling. See a relevant 
 implementation: 
 https://github.com/alexholmes/hadoop-book/blob/master/src/main/java/com/manning/hip/ch4/sampler/ReservoirSamplerInputFormat.java



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HBASE-8073) HFileOutputFormat support for offline operation

2014-06-01 Thread Jerry He (JIRA)

[ 
https://issues.apache.org/jira/browse/HBASE-8073?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14015185#comment-14015185
 ] 

Jerry He commented on HBASE-8073:
-

Attached v1. Fixed the test case errors: need to mock a HTableDescriptor in the 
two test cases.

 HFileOutputFormat support for offline operation
 ---

 Key: HBASE-8073
 URL: https://issues.apache.org/jira/browse/HBASE-8073
 Project: HBase
  Issue Type: Sub-task
  Components: mapreduce
Reporter: Nick Dimiduk
 Fix For: 0.99.0

 Attachments: HBASE-8073-trunk-v0.patch, HBASE-8073-trunk-v1.patch


 When using HFileOutputFormat to generate HFiles, it inspects the region 
 topology of the target table. The split points from that table are used to 
 guide the TotalOrderPartitioner. If the target table does not exist, it is 
 first created. This imposes an unnecessary dependence on an online HBase and 
 existing table.
 If the table exists, it can be used. However, the job can be smarter. For 
 example, if there's far more data going into the HFiles than the table 
 currently contains, the table regions aren't very useful for data split 
 points. Instead, the input data can be sampled to produce split points more 
 meaningful to the dataset. LoadIncrementalHFiles is already capable of 
 handling divergence between HFile boundaries and table regions, so this 
 should not pose any additional burdon at load time.
 The proper method of sampling the data likely requires a custom input format 
 and an additional map-reduce job perform the sampling. See a relevant 
 implementation: 
 https://github.com/alexholmes/hadoop-book/blob/master/src/main/java/com/manning/hip/ch4/sampler/ReservoirSamplerInputFormat.java



--
This message was sent by Atlassian JIRA
(v6.2#6252)