date:20110506

[jira] [Updated] (HBASE-3863) Fix failing TestHBaseFsck.testHBaseFsckMeta unit test

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3863:
-

Attachment: 3863.txt

Fix.

 Fix failing TestHBaseFsck.testHBaseFsckMeta unit test
 -

 Key: HBASE-3863
 URL: https://issues.apache.org/jira/browse/HBASE-3863
 Project: HBase
  Issue Type: Bug
Reporter: stack
 Fix For: 0.92.0

 Attachments: 3863.txt


 Failing with:
 {code}
 Error Message
 org.apache.hadoop.hbase.HServerAddress cannot be cast to 
 org.apache.hadoop.hbase.ServerName
 Stacktrace
 java.lang.ClassCastException: org.apache.hadoop.hbase.HServerAddress cannot 
 be cast to org.apache.hadoop.hbase.ServerName
   at 
 org.apache.hadoop.hbase.util.HBaseFsckRepair.fixDupeAssignment(HBaseFsckRepair.java:59)
   at 
 org.apache.hadoop.hbase.util.TestHBaseFsck.testHBaseFsckMeta(TestHBaseFsck.java:163)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
 
 {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-3863) Fix failing TestHBaseFsck.testHBaseFsckMeta unit test

2011-05-06 Thread stack (JIRA)

Fix failing TestHBaseFsck.testHBaseFsckMeta unit test
-

 Key: HBASE-3863
 URL: https://issues.apache.org/jira/browse/HBASE-3863
 Project: HBase
  Issue Type: Bug
Reporter: stack
 Attachments: 3863.txt

Failing with:

{code}
Error Message

org.apache.hadoop.hbase.HServerAddress cannot be cast to 
org.apache.hadoop.hbase.ServerName
Stacktrace

java.lang.ClassCastException: org.apache.hadoop.hbase.HServerAddress cannot be 
cast to org.apache.hadoop.hbase.ServerName
at 
org.apache.hadoop.hbase.util.HBaseFsckRepair.fixDupeAssignment(HBaseFsckRepair.java:59)
at 
org.apache.hadoop.hbase.util.TestHBaseFsck.testHBaseFsckMeta(TestHBaseFsck.java:163)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)

{code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-3863) Fix failing TestHBaseFsck.testHBaseFsckMeta unit test

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3863?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-3863.
--

   Resolution: Fixed
Fix Version/s: 0.92.0
 Assignee: stack

Committing small fix.

 Fix failing TestHBaseFsck.testHBaseFsckMeta unit test
 -

 Key: HBASE-3863
 URL: https://issues.apache.org/jira/browse/HBASE-3863
 Project: HBase
  Issue Type: Bug
Reporter: stack
Assignee: stack
 Fix For: 0.92.0

 Attachments: 3863.txt


 Failing with:
 {code}
 Error Message
 org.apache.hadoop.hbase.HServerAddress cannot be cast to 
 org.apache.hadoop.hbase.ServerName
 Stacktrace
 java.lang.ClassCastException: org.apache.hadoop.hbase.HServerAddress cannot 
 be cast to org.apache.hadoop.hbase.ServerName
   at 
 org.apache.hadoop.hbase.util.HBaseFsckRepair.fixDupeAssignment(HBaseFsckRepair.java:59)
   at 
 org.apache.hadoop.hbase.util.TestHBaseFsck.testHBaseFsckMeta(TestHBaseFsck.java:163)
   at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
   at 
 sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
   at 
 sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
   at java.lang.reflect.Method.invoke(Method.java:597)
 
 {code}

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3855) Performance degradation of memstore because reseek is linear

2011-05-06 Thread dhruba borthakur (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-3855?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13029787#comment-13029787
]

dhruba borthakur commented on HBASE-3855:
-

I have a cf with three columns. I insert 100K puts on a single rowkey with
random column values. all these kvs are cached in the memstore. A single scan
to retrieve the values for two specified columnname takes about 4 millisecons
of CPU without this patch and about 0.025 millieseconds of cpu with this patch.
Unfortunately, I cannot share the test program because it is intertwined with
quite a few non-opensource-able code.

However, I think the patch will benefit all cases because it is better to do a
O(logn) lookup rather then even a small number of sequential scans.

Performance degradation of memstore because reseek is linear

Key: HBASE-3855
URL: https://issues.apache.org/jira/browse/HBASE-3855
Project: HBase
Issue Type: Improvement
Reporter: dhruba borthakur
Assignee: dhruba borthakur
Priority: Blocker
Fix For: 0.94.0

Attachments: memstoreReseek.txt

The scanner use reseek to find the next row (or next column) as part of a
scan. The reseek code iterates over a Set to position itself at the right
place. If there are many thousands of kvs that need to be skipped over, then
the time-cost is very high. In this case, a seek would be far lesser in cost
than a reseek.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3855) Performance degradation of memstore because reseek is linear

2011-05-06 Thread dhruba borthakur (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

dhruba borthakur updated HBASE-3855:


Attachment: memstoreReseek2.txt

A minor improvement from the previous patch
1.  invoke seek only if the getNext()call returned null and numIterReseek is 
zero.

 Performance degradation of memstore because reseek is linear
 

 Key: HBASE-3855
 URL: https://issues.apache.org/jira/browse/HBASE-3855
 Project: HBase
  Issue Type: Improvement
Reporter: dhruba borthakur
Assignee: dhruba borthakur
Priority: Blocker
 Fix For: 0.94.0

 Attachments: memstoreReseek.txt, memstoreReseek2.txt


 The scanner use reseek to find the next row (or next column) as part of a 
 scan. The reseek code iterates over a Set to position itself at the right 
 place. If there are many thousands of kvs that need to be skipped over, then 
 the time-cost is very high. In this case, a seek would be far lesser in cost 
 than a reseek.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3691) Add compressor support for 'snappy', google's compressor

2011-05-06 Thread Nichole Treadway (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-3691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nichole Treadway updated HBASE-3691:

Attachment: hbase-snappy-3691-trunk-003.patch

Thanks for the patch...I made a few additional changes in HColumnDescriptor,
and I updated the test files to include snappy.

I noticed there are places in the hbase.avro classes where snappy support would
need to be added in. Is it ok to add these changes in the patch, or do the avro
classes need to be auto-generated somehow?

Add compressor support for 'snappy', google's compressor

Key: HBASE-3691
URL: https://issues.apache.org/jira/browse/HBASE-3691
Project: HBase
Issue Type: Task
Reporter: stack
Priority: Critical
Fix For: 0.92.0

Attachments: hbase-snappy-3691-trunk-002.patch,
hbase-snappy-3691-trunk.patch

http://code.google.com/p/snappy/ is apache licensed.
bq. Snappy is a compression/decompression library. It does not aim for
maximum compression, or compatibility with any other compression library;
instead, it aims for very high speeds and reasonable compression. For
instance, compared to the fastest mode of zlib, Snappy is an order of
magnitude faster for most inputs, but the resulting compressed files are
anywhere from 20% to 100% bigger. On a single core of a Core i7 processor in
64-bit mode, Snappy compresses at about 250 MB/sec or more and decompresses
at about 500 MB/sec or more.
bq. Snappy is widely used inside Google, in everything from BigTable and
MapReduce to our internal RPC systems. (Snappy has previously been referred
to as Zippy in some presentations and the likes.)
Lets get it in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3691) Add compressor support for 'snappy', google's compressor

2011-05-06 Thread Nichole Treadway (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Nichole Treadway updated HBASE-3691:


Attachment: (was: hbase-snappy-3691-trunk-003.patch)

 Add compressor support for 'snappy', google's compressor
 

 Key: HBASE-3691
 URL: https://issues.apache.org/jira/browse/HBASE-3691
 Project: HBase
  Issue Type: Task
Reporter: stack
Priority: Critical
 Fix For: 0.92.0

 Attachments: hbase-snappy-3691-trunk-002.patch, 
 hbase-snappy-3691-trunk.patch


 http://code.google.com/p/snappy/ is apache licensed.
 bq. Snappy is a compression/decompression library. It does not aim for 
 maximum compression, or compatibility with any other compression library; 
 instead, it aims for very high speeds and reasonable compression. For 
 instance, compared to the fastest mode of zlib, Snappy is an order of 
 magnitude faster for most inputs, but the resulting compressed files are 
 anywhere from 20% to 100% bigger. On a single core of a Core i7 processor in 
 64-bit mode, Snappy compresses at about 250 MB/sec or more and decompresses 
 at about 500 MB/sec or more.
 bq. Snappy is widely used inside Google, in everything from BigTable and 
 MapReduce to our internal RPC systems. (Snappy has previously been referred 
 to as Zippy in some presentations and the likes.)
 Lets get it in.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3691) Add compressor support for 'snappy', google's compressor

2011-05-06 Thread Nichole Treadway (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-3691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]

Nichole Treadway updated HBASE-3691:

Attachment: hbase-snappy-3691-trunk-003.patch

Accidentally selected wrong license option.

Add compressor support for 'snappy', google's compressor

Key: HBASE-3691
URL: https://issues.apache.org/jira/browse/HBASE-3691
Project: HBase
Issue Type: Task
Reporter: stack
Priority: Critical
Fix For: 0.92.0

Attachments: hbase-snappy-3691-trunk-002.patch,
hbase-snappy-3691-trunk-003.patch, hbase-snappy-3691-trunk.patch

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3855) Performance degradation of memstore because reseek is linear

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3855?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3855:
-

   Resolution: Fixed
Fix Version/s: (was: 0.94.0)
   0.92.0
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

Committed to TRUNK.  Thanks for the patch Dhruba.

 Performance degradation of memstore because reseek is linear
 

 Key: HBASE-3855
 URL: https://issues.apache.org/jira/browse/HBASE-3855
 Project: HBase
  Issue Type: Improvement
Reporter: dhruba borthakur
Assignee: dhruba borthakur
Priority: Blocker
 Fix For: 0.92.0

 Attachments: memstoreReseek.txt, memstoreReseek2.txt


 The scanner use reseek to find the next row (or next column) as part of a 
 scan. The reseek code iterates over a Set to position itself at the right 
 place. If there are many thousands of kvs that need to be skipped over, then 
 the time-cost is very high. In this case, a seek would be far lesser in cost 
 than a reseek.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3850) Log more details when a scanner lease expires

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3850?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3850:
-

Priority: Critical  (was: Minor)
Tags: noob

 Log more details when a scanner lease expires
 -

 Key: HBASE-3850
 URL: https://issues.apache.org/jira/browse/HBASE-3850
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Reporter: Benoit Sigoure
Priority: Critical

 The message logged by the RegionServer when a Scanner lease expires isn't as 
 useful as it could be.  {{Scanner 4765412385779771089 lease expired}} - most 
 clients don't log their scanner ID, so it's really hard to figure out what 
 was going on.  I think it would be useful to at least log the name of the 
 region on which the Scanner was open, and it would be great to have the 
 ip:port of the client that had that lease too.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3417) CacheOnWrite is using the temporary output path for block names, need to use a more consistent block naming scheme

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3417?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3417:
-

Status: Patch Available  (was: Open)

This patch did not go in.  Want me to massage it in?

 CacheOnWrite is using the temporary output path for block names, need to use 
 a more consistent block naming scheme
 --

 Key: HBASE-3417
 URL: https://issues.apache.org/jira/browse/HBASE-3417
 Project: HBase
  Issue Type: Bug
  Components: io, regionserver
Affects Versions: 0.92.0
Reporter: Jonathan Gray
Assignee: Jonathan Gray
Priority: Critical
 Fix For: 0.92.0

 Attachments: HBASE-3417-v1.patch, HBASE-3417-v2.patch, 
 HBASE-3417-v5.patch


 Currently the block names used in the block cache are built using the 
 filesystem path.  However, for cache on write, the path is a temporary output 
 file.
 The original COW patch actually made some modifications to block naming stuff 
 to make it more consistent but did not do enough.  Should add a separate 
 method somewhere for generating block names using some more easily mocked 
 scheme (rather than just raw path as we generate a random unique file name 
 twice, once for tmp and then again when moved into place).

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (HBASE-2765) Master should display currently unassigned regions in UI

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack reassigned HBASE-2765:


Assignee: Todd Lipcon

You have fixed this, haven't you Todd?  I see RIT in master UI now.  And we 
have the hbck tool.  Can we close this issue out?

 Master should display currently unassigned regions in UI
 

 Key: HBASE-2765
 URL: https://issues.apache.org/jira/browse/HBASE-2765
 Project: HBase
  Issue Type: New Feature
  Components: master
Affects Versions: 0.90.0
Reporter: Todd Lipcon
Assignee: Todd Lipcon

 The master often knows that a region is unassigned, but thinks it's getting 
 assigned. It would be very useful to display the current set of unassigned 
 regions in the UI, as well as their current state (eg where it's supposed to 
 be getting assigned, how long ago it last heard from it, etc). This would be 
 useful in debugging/diagnosing cluster issues (eg I just noticed that a 
 client had hung, took me a while to figure out that I had a region that was 
 repeatedly failing to open with an NPE due to HBASE-2729)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-2675) Quick smoke tests testsuite

2011-05-06 Thread stack (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-2675?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13030021#comment-13030021
]

stack commented on HBASE-2675:
--

Is the following enough?

{code}
mvn clean test -Dtest=TestFromClientSide
{code}

It excercises the UI from the client side -- it starts up a minicluster --
doing about 40 odd little tests.

Quick smoke tests testsuite
-

Key: HBASE-2675
URL: https://issues.apache.org/jira/browse/HBASE-2675
Project: HBase
Issue Type: Test
Reporter: Benoit Sigoure
Priority: Minor

It would be nice if there was a known subset of the tests that run fast (e.g.
not more than a few seconds) and quickly help us check whether the code isn't
horribly broken. This way one could run those tests at a frequent interval
when iterating and only run the entire testsuite at the end, when they think
they're done, since doing so is very time consuming.
Someone would need to identify which tests really focus on the core
functionality and add a target in the build system to just run those tests.
As a bonus, it would be awesome++ if the core tests ran, say, 10x faster than
they currently do. There's a lot of sleep-based synchronization in the
tests and it would be nice to remove some of that where possible to make the
tests run as fast as the machine can handle them.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-2631) Decide between InMB and MB as suffix for field names in ClusterStatus objects

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-2631:
-

Labels: noob  (was: )

 Decide between InMB and MB as suffix for field names in ClusterStatus 
 objects
 -

 Key: HBASE-2631
 URL: https://issues.apache.org/jira/browse/HBASE-2631
 Project: HBase
  Issue Type: Improvement
Reporter: Jeff Hammerbacher
  Labels: noob

 We vacillate between InMB (e.g. HServerLoad.getMemStoreSizeInMB(), 
 HServerLoad.getStorefileIndexSizeInMB()) and MB (e.g. 
 HServerLoad.getUsedHeapMB()) in the various ClusterStatus objects 
 (HServerInfo, HServerLoad, HServerLoad.RegionLoad). We should probably pick 
 one and stick to it.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2646) Compaction requests should be prioritized to prevent blocking

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2646?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2646.
--

   Resolution: Fixed
Fix Version/s: 0.90.0
 Assignee: Jeff Whiting
 Hadoop Flags: [Reviewed]

This was applied a while back.  Resolving.  Thanks for the patch Jeff (Assigned 
it to you).

 Compaction requests should be prioritized to prevent blocking
 -

 Key: HBASE-2646
 URL: https://issues.apache.org/jira/browse/HBASE-2646
 Project: HBase
  Issue Type: Improvement
  Components: regionserver
Affects Versions: 0.20.4
 Environment: ubuntu server 10; hbase 0.20.4; 4 machine cluster (each 
 machine is an 8 core xeon with 16 GB of ram and 6TB of storage); ~250 Million 
 rows;
Reporter: Jeff Whiting
Assignee: Jeff Whiting
Priority: Critical
  Labels: compaction, split
 Fix For: 0.90.0

 Attachments: 2646-fix-race-condition-r1004349.txt, 2646-v2.txt, 
 2646-v3.txt, PriorityQueue-r996664.patch, prioritycompactionqueue-0.20.4.patch


 While testing the write capacity of a 4 machine hbase cluster we were getting 
 long and frequent client pauses as we attempted to load the data.  Looking 
 into the problem we'd get a relatively large compaction queue and when a 
 region hit the hbase.hstore.blockingStoreFiles limit it would get block the 
 client and the compaction request would get put on the back of the queue 
 waiting for many other less important compactions.  The client is basically 
 stuck at that point until a compaction is done.  Prioritizing the compaction 
 requests and allowing the request that is blocking other actions go first 
 would help solve the problem.
 You can see the problem by looking at our log files:
 You'll first see an event such as a too many HLog which will put a lot of 
 requests on the compaction queue.
 {noformat}
 2010-05-25 10:53:26,570 INFO org.apache.hadoop.hbase.regionserver.HLog: Too 
 many hlogs: logs=33, maxlogs=32; forcing flush of 22 regions(s): 
 responseCounts,RS_6eZzLtdwhGiTwHy,1274232223324, 
 responses,RS_0qhkL5rUmPCbx3K-1274213057242,1274513189592, 
 responses,RS_1ANYnTegjzVIsHW-12742177419
 21,1274511001873, responses,RS_1HQ4UG5BdOlAyuE-1274216757425,1274726323747, 
 responses,RS_1Y7SbqSTsZrYe7a-1274328697838,1274478031930, 
 responses,RS_1ZH5TB5OdW4BVLm-1274216239894,1274538267659, 
 responses,RS_3BHc4KyoM3q72Yc-1274290546987,1274502062319, 
 responses,RS_3ra9BaBMAXFAvbK-127421457
 9958,1274381552543, responses,RS_6SDrGNuyyLd3oR6-1274219941155,1274385453586, 
 responses,RS_8AGCEMWbI6mZuoQ-1274306857429,1274319602718, 
 responses,RS_8C8T9DN47uwTG1S-1274215381765,1274289112817, 
 responses,RS_8J5wmdmKmJXzK6g-1274299593861,1274494738952, 
 responses,RS_8e5Sz0HeFPAdb6c-1274288
 641459,1274495868557, 
 responses,RS_8rjcnmBXPKzI896-1274306981684,1274403047940, 
 responses,RS_9FS3VedcyrF0KX2-1274245971331,1274754745013, 
 responses,RS_9oZgPtxO31npv3C-1274214027769,1274396489756, 
 responses,RS_a3FdO2jhqWuy37C-1274209228660,1274399508186, 
 responses,RS_a3LJVxwTj29MHVa-12742
 {noformat}
 Then you see the too many log files:
 {noformat}
 2010-05-25 10:53:31,364 DEBUG 
 org.apache.hadoop.hbase.regionserver.CompactSplitThread: Compaction requested 
 for region 
 responses-index,--1274799047787--R_cBKrGxx0FdWjPso,1274804575862/783020138 
 because: regionserver/192.168.0.81:60020.cacheFlusher
 2010-05-25 10:53:32,364 WARN 
 org.apache.hadoop.hbase.regionserver.MemStoreFlusher: Region 
 responses-index,--1274799047787--R_cBKrGxx0FdWjPso,1274804575862 has too many 
 store files, putting it back at the end of the flush queue.
 {noformat}
 Which leads to this: 
 {noformat}
 2010-05-25 10:53:27,061 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
 Blocking updates for 'IPC Server handler 60 on 60020' on region 
 responses-index,--1274799047787--R_cBKrGxx0FdWjPso,1274804575862: memstore 
 size 128.0m is = than blocking 128.0m size
 2010-05-25 10:53:27,061 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
 Blocking updates for 'IPC Server handler 84 on 60020' on region 
 responses-index,--1274799047787--R_cBKrGxx0FdWjPso,1274804575862: memstore 
 size 128.0m is = than blocking 128.0m size
 2010-05-25 10:53:27,065 INFO org.apache.hadoop.hbase.regionserver.HRegion: 
 Blocking updates for 'IPC Server handler 1 on 60020' on region 
 responses-index,--1274799047787--R_cBKrGxx0FdWjPso,1274804575862: memstore 
 size 128.0m is = than blocking 128.0m size
 {noformat}
 Once the compaction / split is done a flush is able to happen which unblocks 
 the IPC allowing writes to continue.  Unfortunately this process can take 
 upwards of 15+ minutes (the specific case shown here from our logs took about 
 4 minutes).

--
This message is automatically generated by JIRA.
For more information on JIRA,

[jira] [Resolved] (HBASE-2465) HMaster should not contact each RS on startup

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2465?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2465.
--

Resolution: Won't Fix

We don't have a verifyClusterState any more and I don't think we contact each 
regionserver on startup any more (Reopen if I have this wrong Todd).

 HMaster should not contact each RS on startup
 -

 Key: HBASE-2465
 URL: https://issues.apache.org/jira/browse/HBASE-2465
 Project: HBase
  Issue Type: Improvement
  Components: master
Reporter: Todd Lipcon

 On startup, in verifyClusterState, the master contacts each region server 
 serially. If a region server is down it will retry for several minutes (if 
 the client retry setting is high). During this period, the master cannot be 
 shut down, and also isn't processing real work.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3861) MiniZooKeeperCluster.startup() should refer to hbase.zookeeper.property.maxClientCnxns

2011-05-06 Thread Andrew Purtell (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13030029#comment-13030029
 ] 

Andrew Purtell commented on HBASE-3861:
---

+1, I'll commit with Stack's comment addressed also.

 MiniZooKeeperCluster.startup() should refer to 
 hbase.zookeeper.property.maxClientCnxns
 --

 Key: HBASE-3861
 URL: https://issues.apache.org/jira/browse/HBASE-3861
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.90.3
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 0.90.3

 Attachments: HBASE-3861.patch, HBASE-3861.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 Currently the number of the client connections is hard-wired to 1000:
 {{{
 standaloneServerFactory = new NIOServerCnxnFactory();
 standaloneServerFactory.configure(new 
 InetSocketAddress(clientPort),1000);
   } catch (BindException e) {
  
 }}}
 This should be set according to the test environment's hbase configuration. 
 The property in 
 question is : hbase.zookeeper.property.maxClientCnxns.
 Currently some tests such as org.apache.hadoop.hbase.client.TestHCM fail 
 because the number of connections used by the HBase client exceeds 1000. 
 Recently MAX_CACHED_HBASE_INSTANCES increased from 31 to 2000 on 0.90 branch:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javap1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javar1=1096818r2=1096817view=diffpathrev=1096818
 and correspondingly the hbase config on the Zookeeper server-side also 
 increased in hbase-default.xml:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/resources/hbase-default.xml?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlp1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlr1=1091594r2=1091593view=diffpathrev=1091594
 So if MiniZKCluster looks at this setting, the test won't have this failure.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3861) MiniZooKeeperCluster.startup() should refer to hbase.zookeeper.property.maxClientCnxns

2011-05-06 Thread Hbase Build Acct (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13030036#comment-13030036
 ] 

Hbase Build Acct commented on HBASE-3861:
-

Sounds good



 MiniZooKeeperCluster.startup() should refer to 
 hbase.zookeeper.property.maxClientCnxns
 --

 Key: HBASE-3861
 URL: https://issues.apache.org/jira/browse/HBASE-3861
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.90.3
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 0.90.3

 Attachments: HBASE-3861.patch, HBASE-3861.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 Currently the number of the client connections is hard-wired to 1000:
 {{{
 standaloneServerFactory = new NIOServerCnxnFactory();
 standaloneServerFactory.configure(new 
 InetSocketAddress(clientPort),1000);
   } catch (BindException e) {
  
 }}}
 This should be set according to the test environment's hbase configuration. 
 The property in 
 question is : hbase.zookeeper.property.maxClientCnxns.
 Currently some tests such as org.apache.hadoop.hbase.client.TestHCM fail 
 because the number of connections used by the HBase client exceeds 1000. 
 Recently MAX_CACHED_HBASE_INSTANCES increased from 31 to 2000 on 0.90 branch:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javap1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javar1=1096818r2=1096817view=diffpathrev=1096818
 and correspondingly the hbase config on the Zookeeper server-side also 
 increased in hbase-default.xml:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/resources/hbase-default.xml?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlp1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlr1=1091594r2=1091593view=diffpathrev=1091594
 So if MiniZKCluster looks at this setting, the test won't have this failure.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2297) Make it so can send email and have it show as comments in JIRA

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2297?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2297.
--

Resolution: Fixed
  Assignee: stack

Resolving.  This works now (Though it shows as being from Hbase Build Acct --- 
can open new issue for that).

 Make it so can send email and have it show as comments in JIRA
 --

 Key: HBASE-2297
 URL: https://issues.apache.org/jira/browse/HBASE-2297
 Project: HBase
  Issue Type: Task
Reporter: stack
Assignee: stack

 Some discussion up in mailing list suggests it may be just a matter of 
 removing the 'Reply-To' which has the list in it.  I filed INFRA-2533.  I 
 also tested replying to an issue emission and changing the to to be 
 j...@apache.org instead and my comment went into the issue as a comment.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2240) Balancer should not reassign (because of overloading) a region that was just opened

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2240.
--

Resolution: Invalid

Resolving as invalid (Jon argues this in body of this issue and stuff suggested 
in the issue -- upping balancer period -- has been implemented)

 Balancer should not reassign (because of overloading) a region that was just 
 opened
 ---

 Key: HBASE-2240
 URL: https://issues.apache.org/jira/browse/HBASE-2240
 Project: HBase
  Issue Type: Bug
  Components: documentation
Reporter: stack
Priority: Minor

 I'm running a mapreduce job.  I see a region split and its daughters come on 
 line and then 8 seconds later, master judges the regionserver overloaded and 
 so closes the just-opened region.  This messes up clients.   They may have 
 just picked up the new location for their row query and now its moved again 
 and client has to go hunting anew. 
 We need to assign the vintage regions first.
 I'm going to change balancer slop.  Its not sloppy enough and balancing cuts 
 in too early.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2963) based on hadoop0.21.0

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2963.
--

Resolution: Duplicate

Duplicate of hbase-2233

 based on hadoop0.21.0
 -

 Key: HBASE-2963
 URL: https://issues.apache.org/jira/browse/HBASE-2963
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.20.6
Reporter: chenjiajun
Priority: Blocker
 Fix For: 0.92.0


 I upgrade my hadoop from 0.20.2 to 0.21.0，but hbase is does't work!

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2765) Master should display currently unassigned regions in UI

2011-05-06 Thread Todd Lipcon (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2765?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Todd Lipcon resolved HBASE-2765.


Resolution: Duplicate

oh hey, yea, I forgot I had filed this! Yes, fixed for 0.92

 Master should display currently unassigned regions in UI
 

 Key: HBASE-2765
 URL: https://issues.apache.org/jira/browse/HBASE-2765
 Project: HBase
  Issue Type: New Feature
  Components: master
Affects Versions: 0.90.0
Reporter: Todd Lipcon
Assignee: Todd Lipcon

 The master often knows that a region is unassigned, but thinks it's getting 
 assigned. It would be very useful to display the current set of unassigned 
 regions in the UI, as well as their current state (eg where it's supposed to 
 be getting assigned, how long ago it last heard from it, etc). This would be 
 useful in debugging/diagnosing cluster issues (eg I just noticed that a 
 client had hung, took me a while to figure out that I had a region that was 
 repeatedly failing to open with an NPE due to HBASE-2729)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2165) Improve fragmentation display and implementation

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2165.
--

Resolution: Won't Fix

We don't have this attribute any more.  Resolving issue till it comes back.

 Improve fragmentation display and implementation
 

 Key: HBASE-2165
 URL: https://issues.apache.org/jira/browse/HBASE-2165
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.20.4, 0.90.0
Reporter: Lars George
Priority: Minor
 Fix For: 0.92.0


 Improve by 
 - moving the blocking FS scan into a thread so that the UI loads fast and 
 initially displays n/a but once it has completed the scan it displays the 
 proper numbers
 - explaining what fragmentation means to the user (better hints or help text 
 in UI)
 - Switch -ROOT- (and maybe even .META.?) to simply say Yes or a tick that 
 it is fragmented as it only has 0% or 100% available (since it has only a 
 single region)
 - also computing the space occupied by each table and the total and - if 
 easily done - add a graph to display it (Google Pie Chart would be nice but 
 is an external link)

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-2010) Ant task to create the test jar is inconsistent with hadoop

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-2010?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-2010.
--

Resolution: Invalid

Marking as invalid since move to maven (We are no longer like hadoop -- it uses 
ivy)

 Ant task to create the test jar is inconsistent with hadoop
 ---

 Key: HBASE-2010
 URL: https://issues.apache.org/jira/browse/HBASE-2010
 Project: HBase
  Issue Type: Improvement
  Components: scripts
Affects Versions: 0.90.0
Reporter: Cosmin Lehene
Priority: Trivial
 Fix For: 0.92.0

 Attachments: HBASE-2010.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 In hadoop projects the jar-test task is used to compile the test jars. 
 In hbase this task is called compile-core-test. 
 Shouldn't we have a jar-test task that depends on the compile-core-test?

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3861) MiniZooKeeperCluster.startup() should refer to hbase.zookeeper.property.maxClientCnxns

2011-05-06 Thread Andrew Purtell (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell updated HBASE-3861:
--

   Resolution: Fixed
Fix Version/s: 0.92.0
 Release Note:   (was: added new patch with numClientCnxns set to 5000 
(enough for TestHCM to pass).)
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

Committed to trunk and 0.90 branch. Tests pass locally.

 MiniZooKeeperCluster.startup() should refer to 
 hbase.zookeeper.property.maxClientCnxns
 --

 Key: HBASE-3861
 URL: https://issues.apache.org/jira/browse/HBASE-3861
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.90.3
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 0.90.3, 0.92.0

 Attachments: HBASE-3861.patch, HBASE-3861.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 Currently the number of the client connections is hard-wired to 1000:
 {{{
 standaloneServerFactory = new NIOServerCnxnFactory();
 standaloneServerFactory.configure(new 
 InetSocketAddress(clientPort),1000);
   } catch (BindException e) {
  
 }}}
 This should be set according to the test environment's hbase configuration. 
 The property in 
 question is : hbase.zookeeper.property.maxClientCnxns.
 Currently some tests such as org.apache.hadoop.hbase.client.TestHCM fail 
 because the number of connections used by the HBase client exceeds 1000. 
 Recently MAX_CACHED_HBASE_INSTANCES increased from 31 to 2000 on 0.90 branch:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javap1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javar1=1096818r2=1096817view=diffpathrev=1096818
 and correspondingly the hbase config on the Zookeeper server-side also 
 increased in hbase-default.xml:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/resources/hbase-default.xml?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlp1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlr1=1091594r2=1091593view=diffpathrev=1091594
 So if MiniZKCluster looks at this setting, the test won't have this failure.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-3864) Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861

2011-05-06 Thread Aaron T. Myers (JIRA)

Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861
---

 Key: HBASE-3864
 URL: https://issues.apache.org/jira/browse/HBASE-3864
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Reporter: Aaron T. Myers


HBASE-2899 renamed {{hfile.min.blocksize.size}} to 
{{hbase.mapreduce.hfileoutputformat.blocksize}}. However, HBASE-1861 (committed 
after HBASE-2899) reverted this change.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3864) Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861

2011-05-06 Thread Aaron T. Myers (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron T. Myers updated HBASE-3864:
--

Attachment: hbase-3864.0.patch

Patch which changes the config back from {{hfile.min.blocksize.size}} to 
{{hbase.mapreduce.hfileoutputformat.blocksize}}.

 Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861
 ---

 Key: HBASE-3864
 URL: https://issues.apache.org/jira/browse/HBASE-3864
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Reporter: Aaron T. Myers
 Attachments: hbase-3864.0.patch


 HBASE-2899 renamed {{hfile.min.blocksize.size}} to 
 {{hbase.mapreduce.hfileoutputformat.blocksize}}. However, HBASE-1861 
 (committed after HBASE-2899) reverted this change.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3864) Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861

2011-05-06 Thread Aaron T. Myers (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aaron T. Myers updated HBASE-3864:
--

Status: Patch Available  (was: Open)

 Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861
 ---

 Key: HBASE-3864
 URL: https://issues.apache.org/jira/browse/HBASE-3864
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Reporter: Aaron T. Myers
 Attachments: hbase-3864.0.patch


 HBASE-2899 renamed {{hfile.min.blocksize.size}} to 
 {{hbase.mapreduce.hfileoutputformat.blocksize}}. However, HBASE-1861 
 (committed after HBASE-2899) reverted this change.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3744) createTable blocks until all regions are out of transition

2011-05-06 Thread stack (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-3744?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13030114#comment-13030114
]

stack commented on HBASE-3744:
--

I applied the addendum w/ some massaging, no refactoring. Thanks for the patch
Ted. Lets see if it cures the failing TestAdmin in 0.90.

createTable blocks until all regions are out of transition
--

Key: HBASE-3744
URL: https://issues.apache.org/jira/browse/HBASE-3744
Project: HBase
Issue Type: Bug
Components: master
Affects Versions: 0.90.1
Reporter: Todd Lipcon
Assignee: Ted Yu
Priority: Critical
Fix For: 0.90.3

Attachments: 3744-addendum-for-TestAdmin.patch, 3744-addendum.txt,
3744-v2.txt, 3744-v3.txt, 3744.txt, create_big_tables.rb,
create_big_tables.rb, create_big_tables.rb

In HBASE-3305, the behavior of createTable was changed and introduced this
bug: createTable now blocks until all regions have been assigned, since it
uses BulkStartupAssigner. BulkStartupAssigner.waitUntilDone calls
assignmentManager.waitUntilNoRegionsInTransition, which waits across all
regions, not just the regions of the table that has just been created.
We saw an issue where one table had a region which was unable to be opened,
so it was stuck in RegionsInTransition permanently (every open was failing).
Since this was the case, waitUntilDone would always block indefinitely even
though the newly created table had been assigned.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3865) Failing TestWALReplay

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3865:
-

Attachment: 3865.txt

 Failing TestWALReplay
 -

 Key: HBASE-3865
 URL: https://issues.apache.org/jira/browse/HBASE-3865
 Project: HBase
  Issue Type: Bug
Reporter: stack
 Attachments: 3865.txt


 Failed last few times up on jenkins.  The change to the signature of the 
 internalFlushCache to add passing of a MonitorVisitor meant the override here 
 was no longer called.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-3865) Failing TestWALReplay

2011-05-06 Thread stack (JIRA)

Failing TestWALReplay
-

 Key: HBASE-3865
 URL: https://issues.apache.org/jira/browse/HBASE-3865
 Project: HBase
  Issue Type: Bug
Reporter: stack
 Attachments: 3865.txt

Failed last few times up on jenkins.  The change to the signature of the 
internalFlushCache to add passing of a MonitorVisitor meant the override here 
was no longer called.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-3865) Failing TestWALReplay

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-3865.
--

   Resolution: Fixed
Fix Version/s: 0.92.0
 Assignee: stack

Applied to TRUNK

 Failing TestWALReplay
 -

 Key: HBASE-3865
 URL: https://issues.apache.org/jira/browse/HBASE-3865
 Project: HBase
  Issue Type: Bug
Reporter: stack
Assignee: stack
 Fix For: 0.92.0

 Attachments: 3865.txt


 Failed last few times up on jenkins.  The change to the signature of the 
 internalFlushCache to add passing of a MonitorVisitor meant the override here 
 was no longer called.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3861) MiniZooKeeperCluster.startup() should refer to hbase.zookeeper.property.maxClientCnxns

2011-05-06 Thread stack (JIRA)


[ 
https://issues.apache.org/jira/browse/HBASE-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13030154#comment-13030154
 ] 

stack commented on HBASE-3861:
--

I just backed this out of TRUNK because its OOME'ing TestHCM up on hudson and 
locally on two different platforms (I can't see why it would do it though... 
must be some interaction w/ hbase-3777).  It runs fine for me on branch.

 MiniZooKeeperCluster.startup() should refer to 
 hbase.zookeeper.property.maxClientCnxns
 --

 Key: HBASE-3861
 URL: https://issues.apache.org/jira/browse/HBASE-3861
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.90.3
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 0.90.3, 0.92.0

 Attachments: HBASE-3861.patch, HBASE-3861.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 Currently the number of the client connections is hard-wired to 1000:
 {{{
 standaloneServerFactory = new NIOServerCnxnFactory();
 standaloneServerFactory.configure(new 
 InetSocketAddress(clientPort),1000);
   } catch (BindException e) {
  
 }}}
 This should be set according to the test environment's hbase configuration. 
 The property in 
 question is : hbase.zookeeper.property.maxClientCnxns.
 Currently some tests such as org.apache.hadoop.hbase.client.TestHCM fail 
 because the number of connections used by the HBase client exceeds 1000. 
 Recently MAX_CACHED_HBASE_INSTANCES increased from 31 to 2000 on 0.90 branch:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javap1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javar1=1096818r2=1096817view=diffpathrev=1096818
 and correspondingly the hbase config on the Zookeeper server-side also 
 increased in hbase-default.xml:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/resources/hbase-default.xml?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlp1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlr1=1091594r2=1091593view=diffpathrev=1091594
 So if MiniZKCluster looks at this setting, the test won't have this failure.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Reopened] (HBASE-3861) MiniZooKeeperCluster.startup() should refer to hbase.zookeeper.property.maxClientCnxns

2011-05-06 Thread Andrew Purtell (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3861?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andrew Purtell reopened HBASE-3861:
---


Caused TestHCM to OOME on trunk up on Jenkins. Backed out for now. Reopening to 
investigate / fix.

 MiniZooKeeperCluster.startup() should refer to 
 hbase.zookeeper.property.maxClientCnxns
 --

 Key: HBASE-3861
 URL: https://issues.apache.org/jira/browse/HBASE-3861
 Project: HBase
  Issue Type: Improvement
Affects Versions: 0.90.3
Reporter: Eugene Koontz
Assignee: Eugene Koontz
 Fix For: 0.90.3, 0.92.0

 Attachments: HBASE-3861.patch, HBASE-3861.patch

   Original Estimate: 1h
  Remaining Estimate: 1h

 Currently the number of the client connections is hard-wired to 1000:
 {{{
 standaloneServerFactory = new NIOServerCnxnFactory();
 standaloneServerFactory.configure(new 
 InetSocketAddress(clientPort),1000);
   } catch (BindException e) {
  
 }}}
 This should be set according to the test environment's hbase configuration. 
 The property in 
 question is : hbase.zookeeper.property.maxClientCnxns.
 Currently some tests such as org.apache.hadoop.hbase.client.TestHCM fail 
 because the number of connections used by the HBase client exceeds 1000. 
 Recently MAX_CACHED_HBASE_INSTANCES increased from 31 to 2000 on 0.90 branch:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/java/org/apache/hadoop/hbase/client/HConnectionManager.java?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javap1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fjava%2Forg%2Fapache%2Fhadoop%2Fhbase%2Fclient%2FHConnectionManager.javar1=1096818r2=1096817view=diffpathrev=1096818
 and correspondingly the hbase config on the Zookeeper server-side also 
 increased in hbase-default.xml:
 http://svn.apache.org/viewvc/hbase/branches/0.90/src/main/resources/hbase-default.xml?p2=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlp1=%2Fhbase%2Fbranches%2F0.90%2Fsrc%2Fmain%2Fresources%2Fhbase-default.xmlr1=1091594r2=1091593view=diffpathrev=1091594
 So if MiniZKCluster looks at this setting, the test won't have this failure.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Resolved] (HBASE-1242) HBase Design Overview

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-1242?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack resolved HBASE-1242.
--

Resolution: Later

Marking as later.  No work done on this issue in a while.

 HBase Design Overview
 -

 Key: HBASE-1242
 URL: https://issues.apache.org/jira/browse/HBASE-1242
 Project: HBase
  Issue Type: Task
  Components: documentation
Reporter: Evgeny Ryabitskiy
Assignee: Evgeny Ryabitskiy
Priority: Minor
 Attachments: HBase_Arc6.jpeg


 I'm going to make wiki page contain HBase Design overview. That will cover 
  - architecture of HBase,
  - used solutions
  - unresolved problems
 It should be something like http://labs.google.com/papers/bigtable.html 
 focused on Implementation part.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Commented] (HBASE-3732) New configuration option for client-side compression

2011-05-06 Thread Karthick Sankarachary (JIRA)

[
https://issues.apache.org/jira/browse/HBASE-3732?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13030167#comment-13030167
]

Karthick Sankarachary commented on HBASE-3732:
--

Oh, I see. I like the idea of keeping the value in a compressed form until the
client tries to get it. Perhaps we can compress the value depending on
whether it's fatter than a certain threshold? Also, given that the value
typically accounts for most of the {{KeyValue}}'s size, do we need to call
{{HFile#getCompressingStream}} if the value is already compressed up front?

New configuration option for client-side compression

Key: HBASE-3732
URL: https://issues.apache.org/jira/browse/HBASE-3732
Project: HBase
Issue Type: New Feature
Reporter: Jean-Daniel Cryans
Fix For: 0.92.0

Attachments: compressed_streams.jar

We have a case here where we have to store very fat cells (arrays of
integers) which can amount into the hundreds of KBs that we need to read
often, concurrently, and possibly keep in cache. Compressing the values on
the client using java.util.zip's Deflater before sending them to HBase proved
to be in our case almost an order of magnitude faster.
There reasons are evident: less data sent to hbase, memstore contains
compressed data, block cache contains compressed data too, etc.
I was thinking that it might be something useful to add to a family schema,
so that Put/Result do the conversion for you. The actual compression algo
should also be configurable.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Assigned] (HBASE-3864) Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack reassigned HBASE-3864:


Assignee: Aaron T. Myers

 Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861
 ---

 Key: HBASE-3864
 URL: https://issues.apache.org/jira/browse/HBASE-3864
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Reporter: Aaron T. Myers
Assignee: Aaron T. Myers
 Attachments: hbase-3864.0.patch


 HBASE-2899 renamed {{hfile.min.blocksize.size}} to 
 {{hbase.mapreduce.hfileoutputformat.blocksize}}. However, HBASE-1861 
 (committed after HBASE-2899) reverted this change.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3864) Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861

2011-05-06 Thread stack (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3864?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

stack updated HBASE-3864:
-

   Resolution: Fixed
Fix Version/s: 0.92.0
 Hadoop Flags: [Reviewed]
   Status: Resolved  (was: Patch Available)

Committed to TRUNK.  Thanks for the patch Aaron.

 Rename of hfile.min.blocksize.size in HBASE-2899 reverted in HBASE-1861
 ---

 Key: HBASE-3864
 URL: https://issues.apache.org/jira/browse/HBASE-3864
 Project: HBase
  Issue Type: Bug
  Components: mapreduce
Reporter: Aaron T. Myers
Assignee: Aaron T. Myers
 Fix For: 0.92.0

 Attachments: hbase-3864.0.patch


 HBASE-2899 renamed {{hfile.min.blocksize.size}} to 
 {{hbase.mapreduce.hfileoutputformat.blocksize}}. However, HBASE-1861 
 (committed after HBASE-2899) reverted this change.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Created] (HBASE-3866) Script to add regions gradually to a new regionserver.

2011-05-06 Thread Aravind Gottipati (JIRA)

Script to add regions gradually to a new regionserver.
--

 Key: HBASE-3866
 URL: https://issues.apache.org/jira/browse/HBASE-3866
 Project: HBase
  Issue Type: Improvement
  Components: scripts
Affects Versions: 0.90.2
Reporter: Aravind Gottipati
Priority: Minor


When a new region server is brought online, the current balancer kicks off a 
whole bunch of region moves and causes a lot of regions to be un-available 
right away.  A slower balancer that gradually balances the cluster is probably 
a good script to have.  I have an initial version that mooches off the 
region_mover script to do this.



--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

[jira] [Updated] (HBASE-3866) Script to add regions gradually to a new regionserver.

2011-05-06 Thread Aravind Gottipati (JIRA)


 [ 
https://issues.apache.org/jira/browse/HBASE-3866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Aravind Gottipati updated HBASE-3866:
-

Attachment: slow_balancer.rb

This script uses a lot of the code from region_mover.rb.  The script should be 
invoked like this.

 HBASE_NOEXEC=true $HBASE_HOME/bin/hbase org.jruby.Main 
$HBASE_HOME/bin/slow_balancer.rb --debug -l 2

The -l option is the target difference between the server with the maximum 
regions and the server with the minimum regions.  Once the delta reaches this 
point, the script exits.  If -l is not passed, it defaults to the number of 
region servers in your environment.


 Script to add regions gradually to a new regionserver.
 --

 Key: HBASE-3866
 URL: https://issues.apache.org/jira/browse/HBASE-3866
 Project: HBase
  Issue Type: Improvement
  Components: scripts
Affects Versions: 0.90.2
Reporter: Aravind Gottipati
Priority: Minor
 Attachments: slow_balancer.rb


 When a new region server is brought online, the current balancer kicks off a 
 whole bunch of region moves and causes a lot of regions to be un-available 
 right away.  A slower balancer that gradually balances the cluster is 
 probably a good script to have.  I have an initial version that mooches off 
 the region_mover script to do this.

--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira

73 matches

Mail list logo