date:20110729


 [ 
https://issues.apache.org/jira/browse/HBASE-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-4144:
--

Summary:  RS does not abort if the initialization of RS fails  (was:  RS 
doesnot abort if the initialization of RS fails)

  RS does not abort if the initialization of RS fails
 

 Key: HBASE-4144
 URL: https://issues.apache.org/jira/browse/HBASE-4144
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Attachments: HBASE-4144_0.90.patch, HBASE-4144_trunk.patch


  If any exception occurs while initialization of RS the RS doesnot get
  aborted whereas only the RPC server gets stopped.
   private void preRegistrationInitialization()
   throws IOException, InterruptedException {
 try {
   initializeZooKeeper();
   initializeThreads();
   int nbBlocks = conf.getInt(hbase.regionserver.nbreservationblocks,4);
   for (int i = 0; i  nbBlocks; i++) {
 reservedSpace.add(new
  byte[HConstants.DEFAULT_SIZE_RESERVATION_BLOCK]);
   }
 } catch (Throwable t) {
   // Call stop if error or process will stick around for ever since
  server
   // puts up non-daemon threads.
   LOG.error(Stopping HRS because failed initialize, t);
   this.rpcServer.stop();
 }
   }
  So if any exception occurs while initilization the RPC server gets stopped
  but RS process is still running. But the log says stopping HRegionServer.
  So in the below code the catch() block will be executed when the RPCServer
  stop fails?
  In all other cases it doesnt handle any initialization failure.
 try {
   // Do pre-registration initializations; zookeeper, lease threads,tc.
   preRegistrationInitialization();
 } catch (Exception e) {
   abort(Fatal exception during initialization, e);
 }

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4144) RS does not abort if the initialization of RS fails


[ 
https://issues.apache.org/jira/browse/HBASE-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072708#comment-13072708
 ] 

Ted Yu commented on HBASE-4144:
---

Applied to branch and TRUNK.

Thanks for the patch Ramkrishna.

  RS does not abort if the initialization of RS fails
 

 Key: HBASE-4144
 URL: https://issues.apache.org/jira/browse/HBASE-4144
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Attachments: HBASE-4144_0.90.patch, HBASE-4144_trunk.patch


  If any exception occurs while initialization of RS the RS doesnot get
  aborted whereas only the RPC server gets stopped.
   private void preRegistrationInitialization()
   throws IOException, InterruptedException {
 try {
   initializeZooKeeper();
   initializeThreads();
   int nbBlocks = conf.getInt(hbase.regionserver.nbreservationblocks,4);
   for (int i = 0; i  nbBlocks; i++) {
 reservedSpace.add(new
  byte[HConstants.DEFAULT_SIZE_RESERVATION_BLOCK]);
   }
 } catch (Throwable t) {
   // Call stop if error or process will stick around for ever since
  server
   // puts up non-daemon threads.
   LOG.error(Stopping HRS because failed initialize, t);
   this.rpcServer.stop();
 }
   }
  So if any exception occurs while initilization the RPC server gets stopped
  but RS process is still running. But the log says stopping HRegionServer.
  So in the below code the catch() block will be executed when the RPCServer
  stop fails?
  In all other cases it doesnt handle any initialization failure.
 try {
   // Do pre-registration initializations; zookeeper, lease threads,tc.
   preRegistrationInitialization();
 } catch (Exception e) {
   abort(Fatal exception during initialization, e);
 }

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4138) If zookeeper.znode.parent is not specifed explicitly in Client code then HTable object loops continuously waiting for the root region by using /hbase as the base node.


[ 
https://issues.apache.org/jira/browse/HBASE-4138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072711#comment-13072711
 ] 

Ted Yu commented on HBASE-4138:
---

I don't think failure of TestSplitLogManager was related to this JIRA either.
TestSplitLogManager hung in build 2061 on Jenkins.

 If zookeeper.znode.parent is not specifed explicitly in Client code then 
 HTable object loops continuously waiting for the root region by using /hbase 
 as the base node.
 ---

 Key: HBASE-4138
 URL: https://issues.apache.org/jira/browse/HBASE-4138
 Project: HBase
  Issue Type: Bug
  Components: client
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 0.92.0

 Attachments: HBASE-4138_trunk_1.patch, HBASE-4138_trunk_2.patch, 
 HBASE-4138_trunk_3.patch


 Change the zookeeper.znode.parent property (default is /hbase).
 Now do not specify this change in the client code.
 Use the HTable Object.
 The HTable is not able to find the root region and keeps continuously looping.
 Find the stack trace:
 
 Object.wait(long) line: not available [native method]  
 RootRegionTracker(ZooKeeperNodeTracker).blockUntilAvailable(long) line: 122
 RootRegionTracker.waitRootRegionLocation(long) line: 73
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 578
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HConnectionManager$HConnectionImplementation.locateRegionInMeta(byte[],
 byte[], byte[], boolean, Object) line: 687
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 589
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HConnectionManager$HConnectionImplementation.locateRegionInMeta(byte[],
 byte[], byte[], boolean, Object) line: 687
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 593
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HTable.init(Configuration, byte[]) line: 171 
 HTable.init(Configuration, String) line: 145 
 HBaseTest.test() line: 45

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4027) Enable direct byte buffers LruBlockCache

2011-07-29 Thread Li Pi (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-4027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072716#comment-13072716
 ] 

Li Pi commented on HBASE-4027:
--

The bug actually broke an existing test case. I'm not sure why I didn't notice 
- I did after trying to rebuild it this morning.

 Enable direct byte buffers LruBlockCache
 

 Key: HBASE-4027
 URL: https://issues.apache.org/jira/browse/HBASE-4027
 Project: HBase
  Issue Type: Improvement
Reporter: Jason Rutherglen
Assignee: Li Pi
Priority: Minor
 Attachments: 4027-v5.diff, HBase-4027.pdf, hbase-4027v6.diff, 
 slabcachepatch.diff, slabcachepatchv2.diff, slabcachepatchv3.1.diff, 
 slabcachepatchv3.2.diff, slabcachepatchv3.diff, slabcachepatchv4.5.diff, 
 slabcachepatchv4.diff


 Java offers the creation of direct byte buffers which are allocated outside 
 of the heap.
 They need to be manually free'd, which can be accomplished using an 
 documented {{clean}} method.
 The feature will be optional.  After implementing, we can benchmark for 
 differences in speed and garbage collection observances.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4144) RS does not abort if the initialization of RS fails

2011-07-29 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072744#comment-13072744
 ] 

Hudson commented on HBASE-4144:
---

Integrated in HBase-TRUNK #2062 (See 
[https://builds.apache.org/job/HBase-TRUNK/2062/])
HBASE-4144  RS does not abort if the initialization of RS fails 
(ramkrishna.s.vasudevan)

tedyu : 
Files : 
* /hbase/trunk/CHANGES.txt
* 
/hbase/trunk/src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java


  RS does not abort if the initialization of RS fails
 

 Key: HBASE-4144
 URL: https://issues.apache.org/jira/browse/HBASE-4144
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Attachments: HBASE-4144_0.90.patch, HBASE-4144_trunk.patch


  If any exception occurs while initialization of RS the RS doesnot get
  aborted whereas only the RPC server gets stopped.
   private void preRegistrationInitialization()
   throws IOException, InterruptedException {
 try {
   initializeZooKeeper();
   initializeThreads();
   int nbBlocks = conf.getInt(hbase.regionserver.nbreservationblocks,4);
   for (int i = 0; i  nbBlocks; i++) {
 reservedSpace.add(new
  byte[HConstants.DEFAULT_SIZE_RESERVATION_BLOCK]);
   }
 } catch (Throwable t) {
   // Call stop if error or process will stick around for ever since
  server
   // puts up non-daemon threads.
   LOG.error(Stopping HRS because failed initialize, t);
   this.rpcServer.stop();
 }
   }
  So if any exception occurs while initilization the RPC server gets stopped
  but RS process is still running. But the log says stopping HRegionServer.
  So in the below code the catch() block will be executed when the RPCServer
  stop fails?
  In all other cases it doesnt handle any initialization failure.
 try {
   // Do pre-registration initializations; zookeeper, lease threads,tc.
   preRegistrationInitialization();
 } catch (Exception e) {
   abort(Fatal exception during initialization, e);
 }

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss


[ 
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072769#comment-13072769
 ] 

ramkrishna.s.vasudevan commented on HBASE-3065:
---

Reason for TestSplitLogManager failures
===
Pls find the analysis
- The Recoverable zookeeper encodes the node name while creating.(This is 
already pointed out by Stack).
- The RecoverableZookeeper while writing the data adds some metadata to it.
{noformat}
byte[] newData = appendMetaData(data);
{noformat}
While we executing some of the testcases 
SplitLogManager.getDataSetWatchSuccess() gets invoked.
Here we do some state comparions like
{noformat}
if (TaskState.TASK_UNASSIGNED.equals(data)) {
  LOG.debug(task not yet acquired  + path +  ver =  + version);
  handleUnassignedTask(path);
} else if (TaskState.TASK_OWNED.equals(data)) {
  registerHeartbeat(path, version,
  TaskState.TASK_OWNED.getWriterName(data));
} else if (TaskState.TASK_RESIGNED.equals(data)) {
  LOG.info(task  + path +  entered state  + new String(data));
  resubmit(path, true);
}
{noformat}

Here the data variable is with metadata appended while writing the data whereas 
the  TaskState is without metadata.
So any comparison that we make fails.

Also one more observation is 'testOrphanTaskAcquisition()' testcase needs some 
wait mechanism before proceeding.
Because the GetDataAsyncCallback call is asynchronous.

In RecoverableZooKeeper the javadoc itself says creating a node should be 
handled carefully.
I have still not completely covered all the failures but this is the basic 
reason.  Even the test case 'testTaskDone'()' hanging is due to the same 
problem I feel.  
Am not fully aware of the splitlog feature with zookeeper will try to provide 
an addendum to this.

 Retry all 'retryable' zk operations; e.g. connection loss
 -

 Key: HBASE-3065
 URL: https://issues.apache.org/jira/browse/HBASE-3065
 Project: HBase
  Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
 Fix For: 0.92.0

 Attachments: 3065-v3.txt, 3065-v4.txt, HBase-3065[r1088475]_1.patch, 
 hbase3065_2.patch


 The 'new' master refactored our zk code tidying up all zk accesses and 
 coralling them behind nice zk utility classes.  One improvement was letting 
 out all KeeperExceptions letting the client deal.  Thats good generally 
 because in old days, we'd suppress important state zk changes in state.  But 
 there is at least one case the new zk utility could handle for the 
 application and thats the class of retryable KeeperExceptions.  The one that 
 comes to mind is conection loss.  On connection loss we should retry the 
 just-failed operation.  Usually the retry will just work.  At worse, on 
 reconnect, we'll pick up the expired session event. 
 Adding in this change shouldn't be too bad given the refactor of zk corralled 
 all zk access into one or two classes only.
 One thing to consider though is how much we should retry.  We could retry on 
 a timer or we could retry for ever as long as the Stoppable interface is 
 passed so if another thread has stopped or aborted the hosting service, we'll 
 notice and give up trying.  Doing the latter is probably better than some 
 kinda timeout.
 HBASE-3062 adds a timed retry on the first zk operation.  This issue is about 
 generalizing what is over there across all zk access.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072770#comment-13072770
]

ramkrishna.s.vasudevan commented on HBASE-3065:
---

The above analysis is for the testcase 'testOrphanTaskAcquisition()'

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBase-3065[r1088475]_1.patch,
hbase3065_2.patch

The 'new' master refactored our zk code tidying up all zk accesses and
coralling them behind nice zk utility classes. One improvement was letting
out all KeeperExceptions letting the client deal. Thats good generally
because in old days, we'd suppress important state zk changes in state. But
there is at least one case the new zk utility could handle for the
application and thats the class of retryable KeeperExceptions. The one that
comes to mind is conection loss. On connection loss we should retry the
just-failed operation. Usually the retry will just work. At worse, on
reconnect, we'll pick up the expired session event.
Adding in this change shouldn't be too bad given the refactor of zk corralled
all zk access into one or two classes only.
One thing to consider though is how much we should retry. We could retry on
a timer or we could retry for ever as long as the Stoppable interface is
passed so if another thread has stopped or aborted the hosting service, we'll
notice and give up trying. Doing the latter is probably better than some
kinda timeout.
HBASE-3062 adds a timed retry on the first zk operation. This issue is about
generalizing what is over there across all zk access.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4143) HTable.doPut(List) should check the writebuffer length every so often


[ 
https://issues.apache.org/jira/browse/HBASE-4143?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072771#comment-13072771
 ] 

Doug Meil commented on HBASE-4143:
--

re:  This effectively disables the ability to do batching.

There is already a client method called 'batch'.  I think that should be 
encouraged to be the preferred batch method if callers want a do exactly what 
I say approach.  Otherwise, put(Put) and put(List) should obey the writeBuffer 
rules.  I'm cool with the patch though.



 HTable.doPut(List) should check the writebuffer length every so often
 -

 Key: HBASE-4143
 URL: https://issues.apache.org/jira/browse/HBASE-4143
 Project: HBase
  Issue Type: Improvement
Reporter: Doug Meil
Assignee: Doug Meil
Priority: Minor
 Attachments: HBASE-4143_update.patch, client_HBASE_4143.patch


 This came up on a dist-list conversation between Andy P., Ted Yu, and myself. 
  Andy noted that extremely large lists passed into put(List) can cause 
 issues.  Ted suggested that having doPut check the write-buffer length every 
 so often (5-10 records?) so the flush doesn't happen only at the end, and I 
 think that's good idea.
  public void put(final ListPut puts) throws IOException {
 doPut(puts);
   }
   private void doPut(final ListPut puts) throws IOException {
 for (Put put : puts) {
   validatePut(put);
   writeBuffer.add(put);
   currentWriteBufferSize += put.heapSize();
 }
 if (autoFlush || currentWriteBufferSize  writeBufferSize) {
   flushCommits();
 }
   }
 Once this change is made, remove the comment in HBASE-4142 about large lists 
 being a performance problem.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-4147) StoreFile query usage report

StoreFile query usage report


 Key: HBASE-4147
 URL: https://issues.apache.org/jira/browse/HBASE-4147
 Project: HBase
  Issue Type: Improvement
Reporter: Doug Meil
Priority: Minor


Detailed information on what HBase is doing in terms of reads is hard to come 
by.

What would be useful is to have a periodic StoreFile query report.  
Specifically, this could run on a configured interval (e.g., every 30 seconds, 
60 seconds) and dump the output to the log files.

This would have all StoreFiles accessed during the reporting period (and with 
the Path we would also know region, CF, and table), # of times the StoreFile 
was accessed, the size of the StoreFile, and the total time (ms) spent 
processing that StoreFile.

Even this level of summary would be useful to detect a which tables  CFs are 
being accessed the most, and including the StoreFile would provide insight into 
relative uncompaction (i.e., lots of StoreFiles).

I think the log-output, as opposed to UI, is an important facet with this.  I'm 
assuming that users will slice and dice this data on their own so I think we 
should skip any kind of admin view for now (i.e., new JSPs, new APIs to expose 
this data).  Just getting this to log-file would be a big improvement.

Will this have a non-zero performance impact?  Yes.  Hopefully small, but yes 
it will.  However, flying a plane without any instrumentation isn't fun.  :-)  

 

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4089) blockCache contents report


 [ 
https://issues.apache.org/jira/browse/HBASE-4089?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Doug Meil updated HBASE-4089:
-

Description: 
Summarized block-cache report for a RegionServer would be helpful.  For example 
...

table1
  cf1   100 blocks, totalBytes=y, averageTimeInCache= hours
  cf2   200 blocks, totalBytes=z, averageTimeInCache= hours

table2
  cf1  75 blocks, totalBytes=y, averageTimeInCache= hours
  cf2 150 blocks, totalBytes=z, averageTimeInCache= hours

... Etc.

The current metrics list blockCacheSize and blockCacheFree, but there is no way 
to know what's in there.  Any single block isn't really important, but the 
patterns of what CF/Table they came from, how big are they, and how long (on 
average) they've been in the cache, are important.

No such interface exists in HRegionInterface.  But I think it would be helpful 
from an operational perspective.

Updated (7-29):  Removing suggestion for UI.  I would be happy just to get this 
report on a configured interval dumped to a log file.



  was:
A UI that would display a block-cache report for a RegionServer would be 
helpful.  For example ...

table1
  cf1   100 blocks, totalBytes=y, averageTimeInCache= hours
  cf2   200 blocks, totalBytes=z, averageTimeInCache= hours

table2
  cf1  75 blocks, totalBytes=y, averageTimeInCache= hours
  cf2 150 blocks, totalBytes=z, averageTimeInCache= hours

... Etc.

The current metrics list blockCacheSize and blockCacheFree, but there is no way 
to know what's in there.  Any single block isn't really important, but the 
patterns of what CF/Table they came from, how big are they, and how long (on 
average) they've been in the cache, are important.

No such interface exists in HRegionInterface, so this is not just a UI request 
also an API change.  But I think it would be helpful from an operational 
perspective.



 blockCache contents report
 --

 Key: HBASE-4089
 URL: https://issues.apache.org/jira/browse/HBASE-4089
 Project: HBase
  Issue Type: New Feature
Reporter: Doug Meil

 Summarized block-cache report for a RegionServer would be helpful.  For 
 example ...
 table1
   cf1   100 blocks, totalBytes=y, averageTimeInCache= hours
   cf2   200 blocks, totalBytes=z, averageTimeInCache= hours
 table2
   cf1  75 blocks, totalBytes=y, averageTimeInCache= hours
   cf2 150 blocks, totalBytes=z, averageTimeInCache= hours
 ... Etc.
 The current metrics list blockCacheSize and blockCacheFree, but there is no 
 way to know what's in there.  Any single block isn't really important, but 
 the patterns of what CF/Table they came from, how big are they, and how long 
 (on average) they've been in the cache, are important.
 No such interface exists in HRegionInterface.  But I think it would be 
 helpful from an operational perspective.
 Updated (7-29):  Removing suggestion for UI.  I would be happy just to get 
 this report on a configured interval dumped to a log file.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072779#comment-13072779
]

ramkrishna.s.vasudevan commented on HBASE-3065:
---

I will upload the patch. I got the exact fix

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBase-3065[r1088475]_1.patch,
hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

ramkrishna.s.vasudevan updated HBASE-3065:
--

Attachment: HBASE-3065-addendum.patch

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4147) StoreFile query usage report

[
https://issues.apache.org/jira/browse/HBASE-4147?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072807#comment-13072807
]

Doug Meil commented on HBASE-4147:
--

I think that instrumenting StoreFileScanner by gathering time spent for all the
'next' and 'seek' calls would do it. And then on 'close' it would publish the
detailed record to some internal service that would gather up all the these
detail records and then periodically dump the summary.

I'm doing some hand-waving here because we don't want to introduce concurrency
issues in the publishing process (e.g., publishing to something that is
synchronized will effectively single-thread StoreFileScanners which would be a
non-starter), but based on my understanding of the code it seems like this
would be a fairly targeted change.

Thoughts?

StoreFile query usage report

Key: HBASE-4147
URL: https://issues.apache.org/jira/browse/HBASE-4147
Project: HBase
Issue Type: Improvement
Reporter: Doug Meil
Priority: Minor

Detailed information on what HBase is doing in terms of reads is hard to come
by.
What would be useful is to have a periodic StoreFile query report.
Specifically, this could run on a configured interval (e.g., every 30
seconds, 60 seconds) and dump the output to the log files.
This would have all StoreFiles accessed during the reporting period (and with
the Path we would also know region, CF, and table), # of times the StoreFile
was accessed, the size of the StoreFile, and the total time (ms) spent
processing that StoreFile.
Even this level of summary would be useful to detect a which tables CFs are
being accessed the most, and including the StoreFile would provide insight
into relative uncompaction (i.e., lots of StoreFiles).
I think the log-output, as opposed to UI, is an important facet with this.
I'm assuming that users will slice and dice this data on their own so I think
we should skip any kind of admin view for now (i.e., new JSPs, new APIs to
expose this data). Just getting this to log-file would be a big improvement.
Will this have a non-zero performance impact? Yes. Hopefully small, but yes
it will. However, flying a plane without any instrumentation isn't fun. :-)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4089) blockCache contents report

[
https://issues.apache.org/jira/browse/HBASE-4089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072809#comment-13072809
]

Doug Meil commented on HBASE-4089:
--

Regarding dumping the summary report to the log, I think exposing a public
'printSummary' (logSummary?) method on LruBlockCache would do it. Another
thread can take care of the scheduling on how often the block cache summary
should be run.

blockCache contents report
--

Key: HBASE-4089
URL: https://issues.apache.org/jira/browse/HBASE-4089
Project: HBase
Issue Type: New Feature
Reporter: Doug Meil

Summarized block-cache report for a RegionServer would be helpful. For
example ...
table1
cf1 100 blocks, totalBytes=y, averageTimeInCache= hours
cf2 200 blocks, totalBytes=z, averageTimeInCache= hours
table2
cf1 75 blocks, totalBytes=y, averageTimeInCache= hours
cf2 150 blocks, totalBytes=z, averageTimeInCache= hours
... Etc.
The current metrics list blockCacheSize and blockCacheFree, but there is no
way to know what's in there. Any single block isn't really important, but
the patterns of what CF/Table they came from, how big are they, and how long
(on average) they've been in the cache, are important.
No such interface exists in HRegionInterface. But I think it would be
helpful from an operational perspective.
Updated (7-29): Removing suggestion for UI. I would be happy just to get
this report on a configured interval dumped to a log file.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4089) blockCache contents report


[ 
https://issues.apache.org/jira/browse/HBASE-4089?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072812#comment-13072812
 ] 

Doug Meil commented on HBASE-4089:
--

If this approach is acceptable, probably should add this to the BlockCache 
interface.  This is how the block cache is accessed.

 blockCache contents report
 --

 Key: HBASE-4089
 URL: https://issues.apache.org/jira/browse/HBASE-4089
 Project: HBase
  Issue Type: New Feature
Reporter: Doug Meil

 Summarized block-cache report for a RegionServer would be helpful.  For 
 example ...
 table1
   cf1   100 blocks, totalBytes=y, averageTimeInCache= hours
   cf2   200 blocks, totalBytes=z, averageTimeInCache= hours
 table2
   cf1  75 blocks, totalBytes=y, averageTimeInCache= hours
   cf2 150 blocks, totalBytes=z, averageTimeInCache= hours
 ... Etc.
 The current metrics list blockCacheSize and blockCacheFree, but there is no 
 way to know what's in there.  Any single block isn't really important, but 
 the patterns of what CF/Table they came from, how big are they, and how long 
 (on average) they've been in the cache, are important.
 No such interface exists in HRegionInterface.  But I think it would be 
 helpful from an operational perspective.
 Updated (7-29):  Removing suggestion for UI.  I would be happy just to get 
 this report on a configured interval dumped to a log file.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072813#comment-13072813
]

Ted Yu commented on HBASE-3065:
---

With the addendum, TestSplitLogManager passed and I got one fewer test failure
from TestDistributedLogSplitting:
{code}
Failed tests:
testThreeRSAbort(org.apache.hadoop.hbase.master.TestDistributedLogSplitting)
testWorkerAbort(org.apache.hadoop.hbase.master.TestDistributedLogSplitting)
{code}
Applied addendum to TRUNK.

Thanks for the analysis Ramkrishna.

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4144) RS does not abort if the initialization of RS fails


 [ 
https://issues.apache.org/jira/browse/HBASE-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-4144:
--

  Resolution: Fixed
Hadoop Flags: [Reviewed]
  Status: Resolved  (was: Patch Available)

  RS does not abort if the initialization of RS fails
 

 Key: HBASE-4144
 URL: https://issues.apache.org/jira/browse/HBASE-4144
 Project: HBase
  Issue Type: Bug
  Components: regionserver
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
Priority: Minor
 Attachments: HBASE-4144_0.90.patch, HBASE-4144_trunk.patch


  If any exception occurs while initialization of RS the RS doesnot get
  aborted whereas only the RPC server gets stopped.
   private void preRegistrationInitialization()
   throws IOException, InterruptedException {
 try {
   initializeZooKeeper();
   initializeThreads();
   int nbBlocks = conf.getInt(hbase.regionserver.nbreservationblocks,4);
   for (int i = 0; i  nbBlocks; i++) {
 reservedSpace.add(new
  byte[HConstants.DEFAULT_SIZE_RESERVATION_BLOCK]);
   }
 } catch (Throwable t) {
   // Call stop if error or process will stick around for ever since
  server
   // puts up non-daemon threads.
   LOG.error(Stopping HRS because failed initialize, t);
   this.rpcServer.stop();
 }
   }
  So if any exception occurs while initilization the RPC server gets stopped
  but RS process is still running. But the log says stopping HRegionServer.
  So in the below code the catch() block will be executed when the RPCServer
  stop fails?
  In all other cases it doesnt handle any initialization failure.
 try {
   // Do pre-registration initializations; zookeeper, lease threads,tc.
   preRegistrationInitialization();
 } catch (Exception e) {
   abort(Fatal exception during initialization, e);
 }

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4138) If zookeeper.znode.parent is not specifed explicitly in Client code then HTable object loops continuously waiting for the root region by using /hbase as the base node.


[ 
https://issues.apache.org/jira/browse/HBASE-4138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072826#comment-13072826
 ] 

Ted Yu commented on HBASE-4138:
---

Integrated to TRUNK.

Thanks for the patch Ramkrishna.

 If zookeeper.znode.parent is not specifed explicitly in Client code then 
 HTable object loops continuously waiting for the root region by using /hbase 
 as the base node.
 ---

 Key: HBASE-4138
 URL: https://issues.apache.org/jira/browse/HBASE-4138
 Project: HBase
  Issue Type: Bug
  Components: client
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 0.92.0

 Attachments: HBASE-4138_trunk_1.patch, HBASE-4138_trunk_2.patch, 
 HBASE-4138_trunk_3.patch


 Change the zookeeper.znode.parent property (default is /hbase).
 Now do not specify this change in the client code.
 Use the HTable Object.
 The HTable is not able to find the root region and keeps continuously looping.
 Find the stack trace:
 
 Object.wait(long) line: not available [native method]  
 RootRegionTracker(ZooKeeperNodeTracker).blockUntilAvailable(long) line: 122
 RootRegionTracker.waitRootRegionLocation(long) line: 73
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 578
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HConnectionManager$HConnectionImplementation.locateRegionInMeta(byte[],
 byte[], byte[], boolean, Object) line: 687
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 589
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HConnectionManager$HConnectionImplementation.locateRegionInMeta(byte[],
 byte[], byte[], boolean, Object) line: 687
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 593
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HTable.init(Configuration, byte[]) line: 171 
 HTable.init(Configuration, String) line: 145 
 HBaseTest.test() line: 45

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072829#comment-13072829
]

ramkrishna.s.vasudevan commented on HBASE-3065:
---

@Ted
The testcase testWorkerAbort() is working fine.
testThreeRSAbort I am getting
'java.lang.AssertionError: expected:4000 but was:3400.'
Is this the error that we get?

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Ted Yu reopened HBASE-3065:
---

There is still more to fix.

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4138) If zookeeper.znode.parent is not specifed explicitly in Client code then HTable object loops continuously waiting for the root region by using /hbase as the base node.

2011-07-29 Thread Hudson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-4138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072882#comment-13072882
 ] 

Hudson commented on HBASE-4138:
---

Integrated in HBase-TRUNK #2063 (See 
[https://builds.apache.org/job/HBase-TRUNK/2063/])
HBASE-4138  If zookeeper.znode.parent is not specifed explicitly in Client
   code then HTable object loops continuously waiting for the root 
region
   by using /hbase as the base node.(ramkrishna.s.vasudevan)

tedyu : 
Files : 
* 
/hbase/trunk/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java
* 
/hbase/trunk/src/main/java/org/apache/hadoop/hbase/zookeeper/ZooKeeperWatcher.java
* 
/hbase/trunk/src/test/java/org/apache/hadoop/hbase/catalog/TestCatalogTracker.java
* /hbase/trunk/CHANGES.txt
* /hbase/trunk/src/main/java/org/apache/hadoop/hbase/master/HMaster.java
* 
/hbase/trunk/src/main/java/org/apache/hadoop/hbase/zookeeper/ZooKeeperNodeTracker.java
* 
/hbase/trunk/src/test/java/org/apache/hadoop/hbase/master/TestRestartCluster.java
* 
/hbase/trunk/src/main/java/org/apache/hadoop/hbase/regionserver/HRegionServer.java
* 
/hbase/trunk/src/main/java/org/apache/hadoop/hbase/zookeeper/RootRegionTracker.java
* /hbase/trunk/src/test/java/org/apache/hadoop/hbase/zookeeper/TestZKTable.java
* 
/hbase/trunk/src/test/java/org/apache/hadoop/hbase/replication/TestReplication.java
* 
/hbase/trunk/src/test/java/org/apache/hadoop/hbase/regionserver/handler/TestOpenRegionHandler.java


 If zookeeper.znode.parent is not specifed explicitly in Client code then 
 HTable object loops continuously waiting for the root region by using /hbase 
 as the base node.
 ---

 Key: HBASE-4138
 URL: https://issues.apache.org/jira/browse/HBASE-4138
 Project: HBase
  Issue Type: Bug
  Components: client
Affects Versions: 0.90.3
Reporter: ramkrishna.s.vasudevan
Assignee: ramkrishna.s.vasudevan
 Fix For: 0.92.0

 Attachments: HBASE-4138_trunk_1.patch, HBASE-4138_trunk_2.patch, 
 HBASE-4138_trunk_3.patch


 Change the zookeeper.znode.parent property (default is /hbase).
 Now do not specify this change in the client code.
 Use the HTable Object.
 The HTable is not able to find the root region and keeps continuously looping.
 Find the stack trace:
 
 Object.wait(long) line: not available [native method]  
 RootRegionTracker(ZooKeeperNodeTracker).blockUntilAvailable(long) line: 122
 RootRegionTracker.waitRootRegionLocation(long) line: 73
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 578
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HConnectionManager$HConnectionImplementation.locateRegionInMeta(byte[],
 byte[], byte[], boolean, Object) line: 687
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 589
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HConnectionManager$HConnectionImplementation.locateRegionInMeta(byte[],
 byte[], byte[], boolean, Object) line: 687
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[], boolean) line: 593
 HConnectionManager$HConnectionImplementation.locateRegion(byte[],
 byte[]) line: 558
 HTable.init(Configuration, byte[]) line: 171 
 HTable.init(Configuration, String) line: 145 
 HBaseTest.test() line: 45

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

2011-07-29 Thread Hudson (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072881#comment-13072881
]

Hudson commented on HBASE-3065:
---

Integrated in HBase-TRUNK #2063 (See
[https://builds.apache.org/job/HBase-TRUNK/2063/])
HBASE-3065 Addendum that removes metadata in getDataSetWatchSuccess()
(Ramkrishna)

tedyu :
Files :
* /hbase/trunk/src/main/java/org/apache/hadoop/hbase/master/SplitLogManager.java

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072886#comment-13072886
]

stack commented on HBASE-3065:
--

Excellent Ram. I'd figured the first part of your patch last night -- the
metdata prefix -- but not the second. Thank you. Let me build on your patch
for the rest of the fix.

So, yes, distributed log splitting does heavy interactions with zk. It went in
after the original 3065 patch was done. The url encoding that distributed
splitting zk'ing does is clashing with suffixes that this patch adds to file
names -- we need to find the places where we want to use znode name and make
sure we url decode. The second issue is the one you note above where this
patch adds metadata at front of data and we need to strip it reading in a few
places.

Let me see how far I get today (I'm gone for a week starting this evening...)

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Critical
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3581) hbase rpc should send size of response


 [ 
https://issues.apache.org/jira/browse/HBASE-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-3581:
--

Fix Version/s: (was: 0.92.0)
   0.94.0

Moving to 0.94 for now.

 hbase rpc should send size of response
 --

 Key: HBASE-3581
 URL: https://issues.apache.org/jira/browse/HBASE-3581
 Project: HBase
  Issue Type: Improvement
Reporter: ryan rawson
Assignee: ryan rawson
Priority: Critical
 Fix For: 0.94.0

 Attachments: HBASE-rpc-response.txt


 The RPC reply from Server-Client does not include the size of the payload, 
 it is framed like so:
 i32 callId
 byte errorFlag
 byte[] data
 The data segment would contain enough info about how big the response is so 
 that it could be decoded by a writable reader.
 This makes it difficult to write buffering clients, who might read the entire 
 'data' then pass it to a decoder. While less memory efficient, if you want to 
 easily write block read clients (eg: nio) it would be necessary to send the 
 size along so that the client could snarf into a local buf.
 The new proposal is:
 i32 callId
 i32 size
 byte errorFlag
 byte[] data
 the size being sizeof(data) + sizeof(errorFlag).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

stack updated HBASE-3065:
-

Priority: Blocker (was: Critical)

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Blocker
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4015) Refactor the TimeoutMonitor to make it less racy


 [ 
https://issues.apache.org/jira/browse/HBASE-4015?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-4015:
-

Priority: Blocker  (was: Critical)

 Refactor the TimeoutMonitor to make it less racy
 

 Key: HBASE-4015
 URL: https://issues.apache.org/jira/browse/HBASE-4015
 Project: HBase
  Issue Type: Sub-task
Affects Versions: 0.90.3
Reporter: Jean-Daniel Cryans
Priority: Blocker
 Fix For: 0.92.0


 The current implementation of the TimeoutMonitor acts like a race condition 
 generator, mostly making things worse rather than better. It does it's own 
 thing for a while without caring for what's happening in the rest of the 
 master.
 The first thing that needs to happen is that the regions should not be 
 processed in one big batch, because that sometimes can take minutes to 
 process (meanwhile a region that timed out opening might have opened, then 
 what happens is it will be reassigned by the TimeoutMonitor generating the 
 never ending PENDING_OPEN situation).
 Those operations should also be done more atomically, although I'm not sure 
 how to do it in a scalable way in this case.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-1621) merge tool should work on online cluster, but disabled table


 [ 
https://issues.apache.org/jira/browse/HBASE-1621?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-1621:
-

Priority: Blocker  (was: Critical)

Need this so can do improvements to hbck --fix (The improvements to hbck --fix) 
can happen out of band with 0.92 release but need this in place)

 merge tool should work on online cluster, but disabled table
 

 Key: HBASE-1621
 URL: https://issues.apache.org/jira/browse/HBASE-1621
 Project: HBase
  Issue Type: Bug
Reporter: ryan rawson
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 1621-trunk.txt, HBASE-1621-v2.patch, HBASE-1621.patch, 
 hbase-onlinemerge.patch


 taking down the entire cluster to merge 2 regions is a pain, i dont see why 
 the table or regions specifically couldnt be taken offline, then merged then 
 brought back up.
 this might need a new API to the regionservers so they can take direction 
 from not just the master.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072906#comment-13072906
]

Ted Yu commented on HBASE-3065:
---

I guess Prakash and Liyin would be able to accommodate this JIRA for
distributed log splitting since they work together.

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Blocker
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3581) hbase rpc should send size of response


 [ 
https://issues.apache.org/jira/browse/HBASE-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3581:
-

Fix Version/s: (was: 0.94.0)
   0.92.0

 hbase rpc should send size of response
 --

 Key: HBASE-3581
 URL: https://issues.apache.org/jira/browse/HBASE-3581
 Project: HBase
  Issue Type: Improvement
Reporter: ryan rawson
Assignee: ryan rawson
Priority: Critical
 Fix For: 0.92.0

 Attachments: HBASE-rpc-response.txt


 The RPC reply from Server-Client does not include the size of the payload, 
 it is framed like so:
 i32 callId
 byte errorFlag
 byte[] data
 The data segment would contain enough info about how big the response is so 
 that it could be decoded by a writable reader.
 This makes it difficult to write buffering clients, who might read the entire 
 'data' then pass it to a decoder. While less memory efficient, if you want to 
 easily write block read clients (eg: nio) it would be necessary to send the 
 size along so that the client could snarf into a local buf.
 The new proposal is:
 i32 callId
 i32 size
 byte errorFlag
 byte[] data
 the size being sizeof(data) + sizeof(errorFlag).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3581) hbase rpc should send size of response


[ 
https://issues.apache.org/jira/browse/HBASE-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072916#comment-13072916
 ] 

stack commented on HBASE-3581:
--

ugh

we should do this for tsdb and asynchbase  I'm going to pull it in again (sorry 
Ted)... we can punt later if it don't make cut.

 hbase rpc should send size of response
 --

 Key: HBASE-3581
 URL: https://issues.apache.org/jira/browse/HBASE-3581
 Project: HBase
  Issue Type: Improvement
Reporter: ryan rawson
Assignee: ryan rawson
Priority: Critical
 Fix For: 0.92.0

 Attachments: HBASE-rpc-response.txt


 The RPC reply from Server-Client does not include the size of the payload, 
 it is framed like so:
 i32 callId
 byte errorFlag
 byte[] data
 The data segment would contain enough info about how big the response is so 
 that it could be decoded by a writable reader.
 This makes it difficult to write buffering clients, who might read the entire 
 'data' then pass it to a decoder. While less memory efficient, if you want to 
 easily write block read clients (eg: nio) it would be necessary to send the 
 size along so that the client could snarf into a local buf.
 The new proposal is:
 i32 callId
 i32 size
 byte errorFlag
 byte[] data
 the size being sizeof(data) + sizeof(errorFlag).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4027) Enable direct byte buffers LruBlockCache

2011-07-29 Thread jirapos...@reviews.apache.org (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-4027?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072940#comment-13072940
 ] 

jirapos...@reviews.apache.org commented on HBASE-4027:
--


---
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/1214/#review1211
---



conf/hbase-env.sh
https://reviews.apache.org/r/1214/#comment2675

Is MaxDirectMemorySize determinable on the running jvm?  Could we make the 
offheapcachesize config as a percentage of the direct memory size like we have 
for memstore/blockcache today?  (default of 0.95 or something would make it so 
it never really has to be set for most cases... and i'm not sure what exactly 
a bit above the off heap cache size is)



src/main/java/org/apache/hadoop/hbase/io/hfile/BlockCacheTestUtils.java
https://reviews.apache.org/r/1214/#comment2676

2011



src/main/java/org/apache/hadoop/hbase/io/hfile/BlockCacheTestUtils.java
https://reviews.apache.org/r/1214/#comment2677

whitespace



src/main/java/org/apache/hadoop/hbase/io/hfile/CacheStats.java
https://reviews.apache.org/r/1214/#comment2678

license



src/main/java/org/apache/hadoop/hbase/io/hfile/CacheStats.java
https://reviews.apache.org/r/1214/#comment2679

class comment



src/main/java/org/apache/hadoop/hbase/io/hfile/DoubleBlockCache.java
https://reviews.apache.org/r/1214/#comment2680

whitespace here and throughout this file



src/main/java/org/apache/hadoop/hbase/io/hfile/DoubleBlockCache.java
https://reviews.apache.org/r/1214/#comment2681

Would it make sense to have DoubleBlockCache be more generic?  Does it need 
to be fixed with these two types or could it take two BlockCache's and they are 
executed in the order they are given in (just need to be clear in doc).

If this was generic, it could be reused for various multi-level caches 
(like an underlying cache with compressed blocks and one above it with 
uncompressed blocks)



src/main/java/org/apache/hadoop/hbase/io/hfile/DoubleBlockCache.java
https://reviews.apache.org/r/1214/#comment2682

longer than 80 chars



src/main/java/org/apache/hadoop/hbase/io/hfile/DoubleBlockCache.java
https://reviews.apache.org/r/1214/#comment2683

This seems like a behavior that we may not always want.

If we made this class generic, could we have some kind of policy we 
initiate it with?  (like default cache in level one, if accessed in level one, 
cache in level two, etc?)

we're going to always be double-storing anything so that the offHeap true 
capacity is (totalOffHeap - totalOnHeap).  in some cases, we might want to 
cache on heap first and then if evicted we cache off heap, or maybe we want it 
to work more like the existing LRU (first read goes into off heap, second read 
upgrades it to the on heap cache and removes from the off heap)



src/main/java/org/apache/hadoop/hbase/io/hfile/DoubleBlockCache.java
https://reviews.apache.org/r/1214/#comment2684

this is going to make for some weird stats?  seems like we may need to 
actually expose the stats of each underlying cache rather than both?  (or both 
and separate).  it's going to be difficult to understand what's happening when 
the hit and eviction stats cover both.



src/main/java/org/apache/hadoop/hbase/io/hfile/slab/SingleSizeCache.java
https://reviews.apache.org/r/1214/#comment2685

huh?



src/main/java/org/apache/hadoop/hbase/io/hfile/slab/SingleSizeCache.java
https://reviews.apache.org/r/1214/#comment2686

line  80 chars



src/main/java/org/apache/hadoop/hbase/io/hfile/slab/Slab.java
https://reviews.apache.org/r/1214/#comment2687

getTotalNumBlocks() and getRemainingNumBlocks() or something?  i find the 
method names a little unclear (or just add some javadoc)



src/main/java/org/apache/hadoop/hbase/io/hfile/slab/Slab.java
https://reviews.apache.org/r/1214/#comment2688

javadoc on these



src/main/java/org/apache/hadoop/hbase/io/hfile/slab/SlabCache.java
https://reviews.apache.org/r/1214/#comment2689

I'm not totally clear on why the SlabCache contains a bunch of 
SingleSizeCaches.  Why do you need to layer BlockCaches on top of BlockCaches?  
You'll have one slab per size rather than one cache per size?  Can you not pass 
the right evictor callback in so it goes back to the right slab?



src/main/java/org/apache/hadoop/hbase/io/hfile/slab/SlabCache.java
https://reviews.apache.org/r/1214/#comment2690

Why these ratios?  At the least, this should all be configurable (even if 
just in code and undocumented).

Do we need to always pre-allocate everything and determine the block/slab 
sizes and all that?  The design seems inflexible because it's all determine 
during construction rather than being adaptive.

I'm okay with the first

[jira] [Commented] (HBASE-3581) hbase rpc should send size of response

2011-07-29 Thread ryan rawson (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-3581?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072951#comment-13072951
 ] 

ryan rawson commented on HBASE-3581:


Lets just commit the patch as is.  Optimizing the deployment complexities of 
users isn't our job, and at the same time it's preventing a extremely useful 
patch from going in. Take the flag day, and in the future everything else will 
be easier. (until there is another flag day, but you want a job dont you?)

 hbase rpc should send size of response
 --

 Key: HBASE-3581
 URL: https://issues.apache.org/jira/browse/HBASE-3581
 Project: HBase
  Issue Type: Improvement
Reporter: ryan rawson
Assignee: ryan rawson
Priority: Critical
 Fix For: 0.92.0

 Attachments: HBASE-rpc-response.txt


 The RPC reply from Server-Client does not include the size of the payload, 
 it is framed like so:
 i32 callId
 byte errorFlag
 byte[] data
 The data segment would contain enough info about how big the response is so 
 that it could be decoded by a writable reader.
 This makes it difficult to write buffering clients, who might read the entire 
 'data' then pass it to a decoder. While less memory efficient, if you want to 
 easily write block read clients (eg: nio) it would be necessary to send the 
 size along so that the client could snarf into a local buf.
 The new proposal is:
 i32 callId
 i32 size
 byte errorFlag
 byte[] data
 the size being sizeof(data) + sizeof(errorFlag).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3065) Retry all 'retryable' zk operations; e.g. connection loss

[
https://issues.apache.org/jira/browse/HBASE-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072952#comment-13072952
]

stack commented on HBASE-3065:
--

@Ram We could add methods to RecoverableZookeeper than wrapped the callbacks to
strip the metadata prefix; maybe we should do that sometime. The addendum will
do for now.

Retry all 'retryable' zk operations; e.g. connection loss
-

Key: HBASE-3065
URL: https://issues.apache.org/jira/browse/HBASE-3065
Project: HBase
Issue Type: Bug
Reporter: stack
Assignee: Liyin Tang
Priority: Blocker
Fix For: 0.92.0

Attachments: 3065-v3.txt, 3065-v4.txt, HBASE-3065-addendum.patch,
HBase-3065[r1088475]_1.patch, hbase3065_2.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc


 [ 
https://issues.apache.org/jira/browse/HBASE-4032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-4032:
-

Attachment: 4032-v2.txt

Fix setTableDesc to do as it used to.  Added a unit test to prove that these 
new methods work as we'd expect.

 HBASE-451 improperly breaks public API HRegionInfo#getTableDesc
 ---

 Key: HBASE-4032
 URL: https://issues.apache.org/jira/browse/HBASE-4032
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 4032-v2.txt, 4032.txt


 After HBASE-451, HRegionInfo#getTableDesc has been modified to always return 
 {{null}}. 
 One immediate effect is broken unit tests.
 That aside, it is not in the spirit of deprecation to actually break the 
 method until after the deprecation cycle, it's a bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc


[ 
https://issues.apache.org/jira/browse/HBASE-4032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072993#comment-13072993
 ] 

stack commented on HBASE-4032:
--

Need a review please (and someone to commit if +1)

 HBASE-451 improperly breaks public API HRegionInfo#getTableDesc
 ---

 Key: HBASE-4032
 URL: https://issues.apache.org/jira/browse/HBASE-4032
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 4032-v2.txt, 4032.txt


 After HBASE-451, HRegionInfo#getTableDesc has been modified to always return 
 {{null}}. 
 One immediate effect is broken unit tests.
 That aside, it is not in the spirit of deprecation to actually break the 
 method until after the deprecation cycle, it's a bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3446) ProcessServerShutdown fails if META moves, orphaning lots of regions


[ 
https://issues.apache.org/jira/browse/HBASE-3446?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13072994#comment-13072994
 ] 

stack commented on HBASE-3446:
--

So, I need to update the last patch here and work on the failures seen. I want 
to write a test too to prove that we have retries after this patch goes in.  

 ProcessServerShutdown fails if META moves, orphaning lots of regions
 

 Key: HBASE-3446
 URL: https://issues.apache.org/jira/browse/HBASE-3446
 Project: HBase
  Issue Type: Bug
Affects Versions: 0.90.0
Reporter: Todd Lipcon
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 3446-v11.txt, 3446-v12.txt, 3446-v2.txt, 3446-v3.txt, 
 3446-v4.txt, 3446-v7.txt, 3446-v9.txt, 3446.txt


 I ran a rolling restart on a 5 node cluster with lots of regions, and 
 afterwards had LOTS of regions left orphaned. The issue appears to be that 
 ProcessServerShutdown failed because the server hosting META was restarted 
 around the same time as another server was being processed

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-4148) HFileOutputFormat doesn't fill in TIMERANGE_KEY metadata

2011-07-29 Thread Todd Lipcon (JIRA)

HFileOutputFormat doesn't fill in TIMERANGE_KEY metadata


 Key: HBASE-4148
 URL: https://issues.apache.org/jira/browse/HBASE-4148
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Affects Versions: 0.90.3
Reporter: Todd Lipcon
 Fix For: 0.90.5


When HFiles are flushed through the normal path, they include an attribute 
TIMERANGE_KEY which can be used to cull HFiles when performing a 
time-restricted scan. Files produced by HFileOutputFormat are currently missing 
this metadata.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (HBASE-4148) HFileOutputFormat doesn't fill in TIMERANGE_KEY metadata

2011-07-29 Thread Todd Lipcon (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-4148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Todd Lipcon reassigned HBASE-4148:
--

Assignee: Jonathan Hsieh

 HFileOutputFormat doesn't fill in TIMERANGE_KEY metadata
 

 Key: HBASE-4148
 URL: https://issues.apache.org/jira/browse/HBASE-4148
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Affects Versions: 0.90.3
Reporter: Todd Lipcon
Assignee: Jonathan Hsieh
 Fix For: 0.90.5


 When HFiles are flushed through the normal path, they include an attribute 
 TIMERANGE_KEY which can be used to cull HFiles when performing a 
 time-restricted scan. Files produced by HFileOutputFormat are currently 
 missing this metadata.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc


[ 
https://issues.apache.org/jira/browse/HBASE-4032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073003#comment-13073003
 ] 

Ted Yu commented on HBASE-4032:
---

There is already hbase/regionserver/TestHRegionInfo.java
Shall we name the test as hbase/regionserver/TestTableDescOfHRegionInfo.java ?

 HBASE-451 improperly breaks public API HRegionInfo#getTableDesc
 ---

 Key: HBASE-4032
 URL: https://issues.apache.org/jira/browse/HBASE-4032
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 4032-v2.txt, 4032.txt


 After HBASE-451, HRegionInfo#getTableDesc has been modified to always return 
 {{null}}. 
 One immediate effect is broken unit tests.
 That aside, it is not in the spirit of deprecation to actually break the 
 method until after the deprecation cycle, it's a bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc


[ 
https://issues.apache.org/jira/browse/HBASE-4032?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13073011#comment-13073011
 ] 

stack commented on HBASE-4032:
--

Thats my bad.  Copy the method from o.a.h.h.THRI into o.a.h.h.r.THRI?

 HBASE-451 improperly breaks public API HRegionInfo#getTableDesc
 ---

 Key: HBASE-4032
 URL: https://issues.apache.org/jira/browse/HBASE-4032
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 4032-v2.txt, 4032.txt


 After HBASE-451, HRegionInfo#getTableDesc has been modified to always return 
 {{null}}. 
 One immediate effect is broken unit tests.
 That aside, it is not in the spirit of deprecation to actually break the 
 method until after the deprecation cycle, it's a bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc


 [ 
https://issues.apache.org/jira/browse/HBASE-4032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-4032:
--

Attachment: 4032-v3.txt

Performed minor formatting.
The new test is added to TestHRegionInfo and it passes.

 HBASE-451 improperly breaks public API HRegionInfo#getTableDesc
 ---

 Key: HBASE-4032
 URL: https://issues.apache.org/jira/browse/HBASE-4032
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 4032-v2.txt, 4032-v3.txt, 4032.txt


 After HBASE-451, HRegionInfo#getTableDesc has been modified to always return 
 {{null}}. 
 One immediate effect is broken unit tests.
 That aside, it is not in the spirit of deprecation to actually break the 
 method until after the deprecation cycle, it's a bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc


 [ 
https://issues.apache.org/jira/browse/HBASE-4032?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ted Yu updated HBASE-4032:
--

Attachment: 4032-v3.txt

 HBASE-451 improperly breaks public API HRegionInfo#getTableDesc
 ---

 Key: HBASE-4032
 URL: https://issues.apache.org/jira/browse/HBASE-4032
 Project: HBase
  Issue Type: Bug
Reporter: Andrew Purtell
Assignee: stack
Priority: Blocker
 Fix For: 0.92.0

 Attachments: 4032-v2.txt, 4032-v3.txt, 4032.txt


 After HBASE-451, HRegionInfo#getTableDesc has been modified to always return 
 {{null}}. 
 One immediate effect is broken unit tests.
 That aside, it is not in the spirit of deprecation to actually break the 
 method until after the deprecation cycle, it's a bug.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-4032) HBASE-451 improperly breaks public API HRegionInfo#getTableDesc