Re: HBase load distribution vs. scan efficiency

2014-01-20 Thread Amit Sela
The number of scans depends on the number of regions a day's data uses. You need to manage compaction and splitting manually. If a days data is 100MB and you want regions to be no more than 200MB than it's two regions to scan per day, if it's 1GB than 10 etc. Compression will help you maximize

Re: Region server logs not getting updated..

2014-01-20 Thread Sandeep B A
Hi Ted, Sorry for the late reply. The log location which was listed using ps command, I checked. Thing is after some time, looks like logs not getting updated... In the logs also not much information why region server stopped updating logs.. How do I make region server to start updating logs...?

Re: Unable to find region for hello_world,,99999999999999 after 10 tries.

2014-01-20 Thread Fernando Iwamoto - Plannej
anybody can explain the causes of unable to find region after 10 tries?. For example, one of the cause is that table is not create. So i can troubleshoot my problem. 2014/1/16 Fernando Iwamoto - Plannej fernando.iwam...@plannej.com.br I have fixed the e-mail sent and RegionServers logs

Re: Questions about HBase load balancing and HFile

2014-01-20 Thread Bill Q
Hi Ted and Bharath, Thanks a lot for the replies. For question #1, if there is a RS is under heavy load by serving to hot regions, the HMaster will move one of the two regions to another RS, or HMaster will split both of them and move the newly crated halves to other RSs? For question #3, does

Re: Questions about HBase load balancing and HFile

2014-01-20 Thread Ted Yu
bq. under heavy load by serving to hot regions Did you mean 'two hot regions' ? If so, the master will move one of them to another RS. Cheers On Mon, Jan 20, 2014 at 6:17 AM, Bill Q bill.q@gmail.com wrote: Hi Ted and Bharath, Thanks a lot for the replies. For question #1, if there is

Re: Unable to find region for hello_world,,99999999999999 after 10 tries.

2014-01-20 Thread Fernando Iwamoto - Plannej
I found this debug, this might the problem? Whenever i created a table, it closes the region server, any thoughts? 2014-01-20 16:03:02,118 INFO [RpcServer.handler=12,port=16000] master.HMaster: Client=ubuntu//50.50.1.4 create 'hello_world', {NAME = 'test', DATA_BLOCK_ENCODING = 'NONE',

Re: Unable to find region for hello_world,,99999999999999 after 10 tries.

2014-01-20 Thread Ted Yu
bq. 2014-01-20 16:03:02,216 INFO [RegionOpenAndInitThread-hello_world-1] regionserver.HRegion: Closed region was closed, not the region server. Cheers On Mon, Jan 20, 2014 at 8:14 AM, Fernando Iwamoto - Plannej fernando.iwam...@plannej.com.br wrote: I found this debug, this might the

Re: Region server logs not getting updated..

2014-01-20 Thread Ted Yu
Is this region server still serving requests ? You can scan one of the regions hosted by this server, or issue compaction request to such region to see if there is new log showing up. Cheers On Mon, Jan 20, 2014 at 2:08 AM, Sandeep B A sandytheb...@gmail.com wrote: Hi Ted, Sorry for the late

Issue connectiong to HBase using Pig's HBaseStorage: Unable to find region for my_table

2014-01-20 Thread Russell Jurney
I'm having trouble connecting to HBase from Pig's HBaseStoragehttp://pig.apache.org/docs/r0.12.0/api/org/apache/pig/backend/hadoop/hbase/HBaseStorage.htmlcommand. Any help would be appreciated. I'm running this command: time_series = LOAD '/tmp/time_series.txt' AS (date_time:chararray,

Re: Issue connectiong to HBase using Pig's HBaseStorage: Unable to find region for my_table

2014-01-20 Thread Fernando Iwamoto - Plannej
What kind of HBase and Pig version are you using? I'm having the same problem as you. 2014/1/20 Russell Jurney russell.jur...@gmail.com I'm having trouble connecting to HBase from Pig's HBaseStorage http://pig.apache.org/docs/r0.12.0/api/org/apache/pig/backend/hadoop/hbase/HBaseStorage.html

Re: Questions about HBase load balancing and HFile

2014-01-20 Thread Ted Yu
For question #4, see also http://hbase.apache.org/book.html#regions.arch.locality Cheers On Sun, Jan 19, 2014 at 10:49 PM, Bharath Vissapragada bhara...@cloudera.com wrote: For question #3, The block size Lars talks about is the blocksize inside a HFile which is different from HDFS block

Re: Issue connectiong to HBase using Pig's HBaseStorage: Unable to find region for my_table

2014-01-20 Thread Russell Jurney
I'm on CDH 4.4, so it is HBase 0.94.6. On Mon, Jan 20, 2014 at 9:31 AM, Fernando Iwamoto - Plannej fernando.iwam...@plannej.com.br wrote: What kind of HBase and Pig version are you using? I'm having the same problem as you. 2014/1/20 Russell Jurney russell.jur...@gmail.com I'm having

Re: Issue connectiong to HBase using Pig's HBaseStorage: Unable to find region for my_table

2014-01-20 Thread Fernando Iwamoto - Plannej
My issue is the same with Russel, and as for me, yes. scan 'my_table' and ui shows my table(on 60010) 2014/1/20 Stack st...@duboce.net Does the region/table exist on your cluster? Can you see it in the UI and scan it from shell? St.Ack On Mon, Jan 20, 2014 at 10:12 AM, Russell Jurney

Re: Issue connectiong to HBase using Pig's HBaseStorage: Unable to find region for my_table

2014-01-20 Thread Russell Jurney
Yes, the table exists on the cluster. Yes, I can see it in hbase shell and can scan it. On Mon, Jan 20, 2014 at 1:33 PM, Stack st...@duboce.net wrote: Does the region/table exist on your cluster? Can you see it in the UI and scan it from shell? St.Ack On Mon, Jan 20, 2014 at 10:12 AM,

HBase Thrift2 does not support createTable() API anymore?

2014-01-20 Thread Tianying Chang
Hi, It seems some API that is supported by Thrift are not in the Thrift2 anymore, e.g. createTable, deleteTable, getTableRegions, and so on? Basically, these are those API that supported by Thrift2. How can I createTable through Thrift2? Am I missing something here? Thanks Tian-Ying print

Re: HBase Thrift2 does not support createTable() API anymore?

2014-01-20 Thread Ted Yu
I assume you were referring to 0.96+ The 3 methods you mentioned belong to HBaseAdmin Please take a look at hbase-thrift/src/test/java/org/apache/hadoop/hbase/thrift2/TestThriftHBaseServiceHandler.java where you can find examples for table creation, etc. Cheers On Mon, Jan 20, 2014 at 4:56

Re: HBase load distribution vs. scan efficiency

2014-01-20 Thread James Taylor
Hi William, Phoenix uses this bucket mod solution as well ( http://phoenix.incubator.apache.org/salted.html). For the scan, you have to run it in every possible bucket. You can still do a range scan, you just have to prepend the bucket number to the start/stop key of each scan you do, and then you

Re: HBase Thrift2 does not support createTable() API anymore?

2014-01-20 Thread Tianying Chang
Thanks Ted. I am trying to use Python to access the HBase Thrift2 server. Is HBaseAdmin exposed in Hbase.thrift? I could not find it in HBase.thrift? Thanks Tian-Ying On Mon, Jan 20, 2014 at 5:14 PM, Ted Yu yuzhih...@gmail.com wrote: I assume you were referring to 0.96+ The 3 methods you

Finding the latest updated rows

2014-01-20 Thread William Kang
Hi, In HBase, the time stamp is set for each column, not for the entire row. If somehow I want to find the latest updated (put new row, or update only certain columns in some rows, etc) rows, is there an efficient way to do it? Many thanks. William

Re: Region server logs not getting updated..

2014-01-20 Thread Sandeep B A
Hi Ted, Yes, region servers are up and running. I scan I get the rows but nothing gets updated in the logs. I issued major compaction and nothing gets updated in the logs. I have also tried to give new location to the logs directory but there also not updating. I verified all the permission are

Re: Region server logs not getting updated..

2014-01-20 Thread ramkrishna vasudevan
What is your log level? Check that. So what is the log that you are currently seeing in the logs? Regards Ram On Tue, Jan 21, 2014 at 9:40 AM, Sandeep B A sandytheb...@gmail.com wrote: Hi Ted, Yes, region servers are up and running. I scan I get the rows but nothing gets updated in the

Re: Region server logs not getting updated..

2014-01-20 Thread Bharath Vissapragada
Hi, You can check if you have any overrides for log4j from CM (search for log4j regionserver in hbase configs from CM UI). Also check the logging threshold as Ramkrishna suggested. - Bharath On Tue, Jan 21, 2014 at 9:40 AM, Sandeep B A sandytheb...@gmail.com wrote: Hi Ted, Yes, region

Re: HBase load distribution vs. scan efficiency

2014-01-20 Thread James Taylor
The salt byte is a stable hash of the rest of the row key. The system has to remember the total number of buckets, as that's what's used to mod the hash value with. Adding new regions/regions servers is fine, as it's orthogonal to the bucket number, though typically the cluster size determines the

Re: HBase Thrift2 does not support createTable() API anymore?

2014-01-20 Thread Tianying Chang
Ram, thanks for your confirmation! Probably very few people are using Thrift2? Is that Thrift2 not being supported at production level? I have seen people using finagle thrift server more often than the native HBase Thrift server. Thanks Tian-Ying On Mon, Jan 20, 2014 at 7:53 PM, ramkrishna

Fw: [ANNOUNCE] HBase 0.94.16 is available for download

2014-01-20 Thread lars hofhansl
- Forwarded Message - From: lars hofhansl la...@apache.org To: hbase-dev d...@hbase.apache.org Sent: Monday, January 20, 2014 5:30 PM Subject: [ANNOUNCE] HBase 0.94.16 is available for download The HBase Team is pleased to announce the immediate release of HBase 0.94.16. Download it

Re: Region server logs not getting updated..

2014-01-20 Thread Sandeep B A
Hi, @ramkrishna: Log level currently set is INFO. When it was last updated, i see INFO level logs. @Bharath: From cloudera manager there was no overriden logging. While trying on coprocessors, we added few jars, when i removed those, region server logs started getting updated. There was a

Re: Region server logs not getting updated..

2014-01-20 Thread ramkrishna vasudevan
Now the issue is resolved. Nice.. and Thank you for the update. On Tue, Jan 21, 2014 at 12:06 PM, Sandeep B A sandytheb...@gmail.comwrote: Hi, @ramkrishna: Log level currently set is INFO. When it was last updated, i see INFO level logs. @Bharath: From cloudera manager there was no

cannot start hregionserver

2014-01-20 Thread LEI Xiaofeng
Hi, I added a new node to my HBase cluster. The PC is the same as the old ones. It has 24G memory. When I tried to start regionserver on that node, I got error below: Could not create the Java virtual machine. Exception in thread main java.lang.NoClassDefFoundError:

Re: cannot start hregionserver

2014-01-20 Thread haosdent
Use free cmd to get more details about you available memory. On Tue, Jan 21, 2014 at 2:44 PM, LEI Xiaofeng le...@ihep.ac.cn wrote: Hi, I added a new node to my HBase cluster. The PC is the same as the old ones. It has 24G memory. When I tried to start regionserver on that node, I got error

Re: Finding the latest updated rows

2014-01-20 Thread Joshi, Rekha
Hi Wiliam, The timestamp part of rowkey schema design caters to this., usually efficient but your SLA may differ. http://hbase.apache.org/book.html#reverse.timestamp http://hbase.apache.org/book.html#schema.casestudies http://hbase.apache.org/book.html#timeseries Thanks Rekha On 21/01/14