Defect or work as designed? stop-hbase.sh hangs there and have to manually kill the master daemon

2011-03-22 Thread Yu Li
Hi all, I noticed that using hbase 0.20.6, if we set HBASE_MANAGES_ZK to false in hbase-env.sh, and set the hbase.zookeeper.quorum property in hbase-site.xml, but don't start any zookeeper service in the cluster, then hbase master/regionserver would keep on trying to connect to the zookeeper

Regionserver exit quitely

2011-03-22 Thread 茅旭峰
Hi, I saw logs at regionserver like == 2011-03-22 15:58:26,890 DEBUG org.apache.hadoop.hbase.regionserver.handler.CloseRegionHandler: Processing close of table1,j-sLRcrI_bZ1NbfoRH-0fz-g9m0=,1300726276228.b13159778f4e190b0ee99c655de6d928. 2011-03-22 15:58:26,890 DEBUG

it does't use the parameter about hbase.regionserver.flushlogentries

2011-03-22 Thread Gaojinchao
I want to improve the writing performance through adjusting hbase.regionserver.flushlogentries. But it seems no changing. In version 0.90.1, why does remove this parameter from file hbase-default.xml?

Anybody Familiar with REST+Hbase?

2011-03-22 Thread sreejith P. K.
Hello, Is there anyone familiar with stargate REST interface to HBASE and PHP? Can anybody provide materials for the same other than http://wiki.apache.org/hadoop/Hbase/Stargate ? Thanks -- Sreejith PK Nesote Technologies (P) Ltd

Re: schema WAS: Stargate and Hbase

2011-03-22 Thread sreejith P. K.
Hi Ted Yu, Thank you for your reply. It works fine using default Scan filter functions in Java. How can I make it possible using PHP? Thanks On Sat, Mar 19, 2011 at 9:56 PM, Ted Yu yuzhih...@gmail.com wrote: sreejith: I leave your second question to other experts. Let me try to answer schema

Re: Regionserver exit quitely

2011-03-22 Thread 茅旭峰
Sorry, I found some logs indicating lost connection to zookeeper, and be treated as dead node, far away from the previous logs. I'll check further, please ignore this thread. Mao Xu-Feng On Tue, Mar 22, 2011 at 4:52 PM, 茅旭峰 m9s...@gmail.com wrote: Hi, I saw logs at regionserver like ==

HRegionserver memory usage

2011-03-22 Thread Shahnawaz Saifi
Hi, While uploading data to hbase memory usage getting too high (95%). we have a cluster of seven nodes (hadoop master namenode, secondary namenode, hbase master node, zookeeper node, slave datanode1, 2,3). Slave data nodes are getting memory usage upto 99% and same time later some of them are

Re: importing dataset, some problems and performance issues

2011-03-22 Thread Ferdy Galema
HBase already makes my life better, so no worries there :) I agree the topic of this thread is not clear anymore. I also already know how to tackle my problem. So just for the record let me explain what I was thinking/doing: The original intend was to clean up my HBase installation (remove

Re: Anybody Familiar with REST+Hbase?

2011-03-22 Thread Ted Yu
See: http://hbase.apache.org/docs/current/api/org/apache/hadoop/hbase/rest/package-summary.html On Tue, Mar 22, 2011 at 2:07 AM, sreejith P. K. sreejit...@nesote.comwrote: Hello, Is there anyone familiar with stargate REST interface to HBASE and PHP? Can anybody provide materials for the same

Solved: Region servers dying without logs (hardware errors)

2011-03-22 Thread M.Deniz OKTAR
Hi all, We were having stability issues with yahoo benchmark and hbase on our servers. Region servers were dying without trace and even sometimes end up with system freeze. After trying everything, we have switched to a new cluster and our problems solved. Our previous cluster was a sandy bridge

Re: Solved: Region servers dying without logs (hardware errors)

2011-03-22 Thread Stack
Thank you for updating the list Deniz. St.Ack On Tue, Mar 22, 2011 at 7:25 AM, M.Deniz OKTAR deniz.ok...@gmail.com wrote: Hi all, We were having stability issues with yahoo benchmark and hbase on our servers. Region servers were dying without trace and even sometimes end up with system

Re: HRegionserver memory usage

2011-03-22 Thread Stack
Can you provide more detail on versions and memory sizes. In general, the JVM grows to occupy the allotted heap so your reports below about its using 95% and 99% could be just 'normal' behavior. When its 'unresponsive', what do the logs say? What do you mean by 'unresponsive'? DN won't take

Re: schema WAS: Stargate and Hbase

2011-03-22 Thread Stack
On Tue, Mar 22, 2011 at 2:16 AM, sreejith P. K. sreejit...@nesote.com wrote: How can I make it possible using PHP? You would go via REST or thrift. Study the APIs on these Interfaces to see if they offer you sufficient to implement what Ted described above. St.Ack

Re: Defect or work as designed? stop-hbase.sh hangs there and have to manually kill the master daemon

2011-03-22 Thread Stack
Sounds like a bug. Can you try on 0.90.1 and see if it has same issue? Thanks, St.Ack On Mon, Mar 21, 2011 at 11:55 PM, Yu Li car...@gmail.com wrote: Hi all, I noticed that using hbase 0.20.6, if we set HBASE_MANAGES_ZK to false in hbase-env.sh, and set the hbase.zookeeper.quorum property

Re: File formats in Hadoop

2011-03-22 Thread Weishung Chung
My fellow superb hbase experts, Looking at the HFile specs and have some questions. How is a particular table cell in a HBase table being represented in the HFile? Does the key of the key value pair represent the rowkey+column family:qualifier+timestamp and the value represent the corresponding

Re: HRegionserver memory usage

2011-03-22 Thread Shahnawaz Saifi
we are using hbase-0.20.6 hbasemaster, master secondary nn has 36GB of Memory, zookeeper slave datanodes got 16GB. And heap size is default in hbase-env.sh. unresponsive: box is not accessible it hangs and shows java error on screen. DN uses entire available resources. Client are able to

Re: File formats in Hadoop

2011-03-22 Thread Vivek Krishna
http://nosql.mypopescu.com/post/3220921756/hbase-internals-hfile-explained might help. Viv On Tue, Mar 22, 2011 at 11:43 AM, Weishung Chung weish...@gmail.com wrote: My fellow superb hbase experts, Looking at the HFile specs and have some questions. How is a particular table cell in a

Re: File formats in Hadoop

2011-03-22 Thread Weishung Chung
I also found this informative article http://cloudepr.blogspot.com/2009/09/hfile-block-indexed-file-format-to.html http://cloudepr.blogspot.com/2009/09/hfile-block-indexed-file-format-to.htmlis the key value pair be eg column family1 with one qualifier 1 with 2 versions key1 : rowkey1+column

We're seeing problems loading data into HBase using MR

2011-03-22 Thread h
Hey everyone, I've got a situation where my data loads to HBase are failing. The data is sent an isolated HBase cluster from a different hadoop cluster. What I see is that the performance is pretty bad (around 40k burst, 1k average inserts - with about 200 byte payloads). If I were to write

JobControl and HBase MR chaining

2011-03-22 Thread Vishal Kapoor
with JobControl not rewritten under mapreduce and continues to be under mapred( hadoop version 20.2) expecting subclasses of Mapper from mapred how do we use TableMapper and TableReducer to chain jobs? I am utterly confused, any ray of hope? thanks, Vishal

Re: Hbck errors

2011-03-22 Thread Adam Phelps
On 3/21/11 10:13 PM, Stack wrote: On Mon, Mar 21, 2011 at 7:19 PM, Adam Phelpsa...@opendns.com wrote: It looks like we've come up against a problem that looks identical to the one you described. How did you go about manually inserting the two child regions? You know the daughter regions

Re: We're seeing problems loading data into HBase using MR

2011-03-22 Thread Stack
What do the regionserver logs say Dirk? If you jstack one of them a few times, do you see anything -- are they hanging up on any call? Post your configs to pastebin and we'll take a look. St.Ack On Tue, Mar 22, 2011 at 10:59 AM, h hb...@patientcentral.com wrote: Hey everyone, I've got a

Re: Region server crashes when using replication

2011-03-22 Thread Jean-Daniel Cryans
First issue: UnknownHostException is unforgiving, your machines need to be able to talk to haddop2-zk3 (is that a typo?) and it seems that at least that one can't. The reason the machine dies is that we usually try to fail fast in HBase. Second issue: There's not enough information, all I see is

Re: Scan isn't processing all rows

2011-03-22 Thread Sean Sechrist
Okay, so I figured out what was going wrong: The property hbase.regionserver.lease.period was 120s on the machine I submitted the job from, but was only 60s on the RegionServer. This caused the scanner to timeout on the region server. But when the next HTable.ClientScanner.next() call got the

Re: Scan isn't processing all rows

2011-03-22 Thread Stack
Good man Sean for figuring it. Please make an issue. Lets try and fix it so no one else has the pain you just did. St.Ack On Tue, Mar 22, 2011 at 11:27 AM, Sean Sechrist ssechr...@gmail.com wrote: Okay, so I figured out what was going wrong: The property hbase.regionserver.lease.period was

Manual Region Splitting Question.

2011-03-22 Thread Vivek Krishna
I have GBs of data to be dumped to HBase. After lots of trials and reading through the mailing list, I figured out creating regions manually is a good option because all data was hitting one node initially... My approach to creating regions is as follow. - I sampled like about 1% of the

Re: Manual Region Splitting Question.

2011-03-22 Thread Jean-Daniel Cryans
It depends if you are also inserting in an ordered fashion right? Even if you have regions a through z, but you start inserting only keys with starting with a, then you'll only hit the first regions. J-D On Tue, Mar 22, 2011 at 11:46 AM, Vivek Krishna vivekris...@gmail.com wrote: I have GBs of

Re: Region server crashes when using replication

2011-03-22 Thread Eran Kutner
Thanks, J-D. As for the first issue, why does this behavior make sense? What happens when the connection between the two cluster fails? Will the region servers of the primary fail as well? or at least won't be able to start? Seems very radical. Regarding the second issue, I didn't see anything

Re: Region server crashes when using replication

2011-03-22 Thread Jean-Daniel Cryans
Inline. J-D On Tue, Mar 22, 2011 at 11:51 AM, Eran Kutner e...@gigya.com wrote: Thanks, J-D. As for the first issue, why does this behavior make sense? What happens when the connection between the two cluster fails? Will the region servers of the primary fail as well? or at least won't be

Re: Manual Region Splitting Question.

2011-03-22 Thread Vivek Krishna
For eg., lets assume I have keys in range of aa, ab,ac..zz Using the sample data I create regions like this aa-ba region 1 ca-da region 2 etc., The reason why I did not create region bb-bz because I did not encounter in the sample.q But when I encounter a key like bb, it does not fall in the

Re: Manual Region Splitting Question.

2011-03-22 Thread Jean-Daniel Cryans
bb will fall into the first region since the next start key is ca and bb is smaller than that. J-D On Tue, Mar 22, 2011 at 12:07 PM, Vivek Krishna vivekris...@gmail.com wrote: For eg., lets assume I have keys in range of aa, ab,ac..zz Using the sample data I create regions like this aa-ba

RE: JobControl and HBase MR chaining

2011-03-22 Thread Doug Meil
You need to execute two Jobs serially that use TableMapper in a thread. Can't use JobControl. -Original Message- From: Vishal Kapoor [mailto:vishal.kapoor...@gmail.com] Sent: Tuesday, March 22, 2011 1:34 PM To: user@hbase.apache.org Subject: JobControl and HBase MR chaining with

Re: Scan isn't processing all rows

2011-03-22 Thread Sean Sechrist
Okay, see the ticket here: https://issues.apache.org/jira/browse/HBASE-3686. Thanks, Sean On Tue, Mar 22, 2011 at 2:40 PM, Stack st...@duboce.net wrote: Good man Sean for figuring it. Please make an issue. Lets try and fix it so no one else has the pain you just did. St.Ack On Tue, Mar

Re: Region server crashes when using replication

2011-03-22 Thread Eran Kutner
Actually, it will probably be connection timeout, not connection refused when there is no connection between the two clusters. Is there a workaround I can implement now for HBASE-3664, can I write something in ZK so the server has an old entry to delete and is happy with it? -eran On Tue,

Re: JobControl and HBase MR chaining

2011-03-22 Thread Vishal Kapoor
sorry about the confusion on job chaining, i have few set of map reduce jobs which are interdependent. not multiple mappers for a single task. Vishal On Tue, Mar 22, 2011 at 3:16 PM, Doug Meil doug.m...@explorysmedical.com wrote: You need to execute two Jobs serially that use TableMapper in a

auto-restart regionservers

2011-03-22 Thread Geoff Hendrey
Is there a way in which regionservers can be made to automatically restart when they crash/shutdown. We see several shutdowns every day...just wondering if there is any mode in which the daemons can autorestart. BTW, I'd be happy to pursue the *cause* of the crash/shutdown, but it just seems to be

Re: Region server crashes when using replication

2011-03-22 Thread Jean-Daniel Cryans
You can apply the patch that I included there and that I also committed to the 0.90 branch. J-D On Tue, Mar 22, 2011 at 12:37 PM, Eran Kutner e...@gigya.com wrote: Actually, it will probably be connection timeout, not connection refused when there is no connection between the two clusters.

Re: auto-restart regionservers

2011-03-22 Thread Jean-Daniel Cryans
My 100 region servers run happily almost all the time, so I think you should definitely investigate! We preferred to leave the restart to your favorite cluster management tool, no need to reinvent the wheel. J-D On Tue, Mar 22, 2011 at 12:43 PM, Geoff Hendrey ghend...@decarta.com wrote: Is

Re: File formats in Hadoop

2011-03-22 Thread Weishung Chung
I found this useful article that explains the internal storage of HFile http://www.larsgeorge.com/2009/10/hbase-architecture-101-storage.html http://www.larsgeorge.com/2009/10/hbase-architecture-101-storage.html On Tue, Mar 22, 2011 at 11:31 AM, Weishung Chung weish...@gmail.com wrote: I also

Re: importing dataset, some problems and performance issues

2011-03-22 Thread Jean-Daniel Cryans
Usually the logs are pretty chatty about what's blocking them, here's one example of my going through my own logs: http://search-hadoop.com/m/fJ0vh6ojHm1 J-D On Tue, Mar 22, 2011 at 4:18 AM, Ferdy Galema ferdy.gal...@kalooga.com wrote: HBase already makes my life better, so no worries there :)

Using split command in shell

2011-03-22 Thread Vivek Krishna
The command is `split table or region row` How to find what the region row is? I tried the ones shown in node:60030 webpage. Does not work. Viv

Re: Using split command in shell

2011-03-22 Thread Jean-Daniel Cryans
Please define does not work, does it give you an error message or it just doesn't split? If latter, maybe the split was rejected, depending on the file layout. You would find such message in the region server log. J-D On Tue, Mar 22, 2011 at 2:00 PM, Vivek Krishna vivekris...@gmail.com wrote:

Re: auto-restart regionservers

2011-03-22 Thread Stack
Geoff: Tell us more. Hardware, sizings, configs., versions, type of failures. How many TBs? How many regions? Thanks, St.Ack On Tue, Mar 22, 2011 at 12:43 PM, Geoff Hendrey ghend...@decarta.com wrote: Is there a way in which regionservers can be made to automatically restart when they

Re: Cleanup partitions_ files from HFileOutputFormat

2011-03-22 Thread Stack
You should be able to clean them Michael. Make sure their content have been moved into place under the hbase.rootdir first though. Thereafter they should not be referenced. We're done with them. St.Ack On Tue, Mar 22, 2011 at 2:18 PM, Michael Naseef mtnaseef...@gmail.com wrote: Hi all, is

Re: Using split command in shell

2011-03-22 Thread Vivek Krishna
when I type `split region_name`, I don't know what to use as region name? I tried entering the region name as seen in nodeName:60030 webpage, it does not split the region. It outputs it completed in 1.234 seconds, but nothing happens. What ever I type as region_name, it outputs completed

Re: Using split command in shell

2011-03-22 Thread Stack
On Tue, Mar 22, 2011 at 3:11 PM, Vivek Krishna vivekris...@gmail.com wrote: `split region_name`,     I don't know what to use as region name? hbase(main):001:0 help 'split' Split table or pass a region row to split individual region  I tried entering the region name as seen in nodeName:60030

Re: Using split command in shell

2011-03-22 Thread Vivek Krishna
When I entered the row key, it did nothing. Even the logs weren't helpful. When I typed arbitrary asdf as region row it started splitting. To clarify, my question was what is defined as a region row, is it the first key of the region? the last key? any key in the region? or a combo of table name

Re: File formats in Hadoop

2011-03-22 Thread Ryan Rawson
Curious, why do you mention SequenceFile and TFile. Neither of those are either in the hbase.io, and TFile is not used anywhere in HBase. -ryan On Sat, Mar 19, 2011 at 9:01 AM, Weishung Chung weish...@gmail.com wrote: I am browsing through the hadoop.io package and was wondering what other

Re: File formats in Hadoop

2011-03-22 Thread Weishung Chung
They are used in hadoop org.apache.hadoop.io.SequenceFile org.apache.hadoop.io.file.tfile.TFile On Tue, Mar 22, 2011 at 10:06 PM, Ryan Rawson ryano...@gmail.com wrote: Curious, why do you mention SequenceFile and TFile. Neither of those are either in the hbase.io, and TFile is not used