hbase table creation

2011-05-01 Thread Priya A
I m new to hbase.. here there is some errors as follows: while creatin hbase table and trying to display.. hbase(main):010:0 create 'test' 11/05/01 12:40:47 DEBUG zookeeper.ZooKeeperWrapper: Read ZNode /hbase/root-region-server got 192.168.1.72:60020 11/05/01 12:40:47 DEBUG

RE: HFileOutputFormat: writing to multiple columns for each row (not multiple column families)

2011-05-01 Thread Panayotis Antonopoulos
It seems that I was not sorting the KeyValues properly as I was not using the KeyValueSortReducer that comes with HBase. From: antonopoulos...@hotmail.com To: user@hbase.apache.org Subject: HFileOutputFormat: writing to multiple columns for each row (not multiple column families) Date:

Row count without iterating over ResultScanner?

2011-05-01 Thread Wojciech Langiewicz
Hi, I would like to know if there's a way to quickly count number of rows from scan result? Right now I'm iterating over ResultScanner like this: int count = 0; for (Result rr = scanner.next(); rr != null; rr = scanner.next()) { ++count; } But with number of rows reaching millions this

Re: Row count without iterating over ResultScanner?

2011-05-01 Thread Himanshu Vashishtha
If you are interested row count only (and not want to fetch the table rows to your client side), you can also try out https://issues.apache.org/jira/browse/HBASE-1512. PS: Which version you are on? The above patch is in main trunk as of now, so to use it you would have to checkout the code and

Re: Row count without iterating over ResultScanner?

2011-05-01 Thread Wojciech Langiewicz
Yes, I was using default caching, setting this value to few thousands made significant difference in performance, I'll experiment more with this option. Right now I want to stay away from MR, mainly because of cluster warm-up time, and I want to get results almost real-time (few seconds max).

Re: Row count without iterating over ResultScanner?

2011-05-01 Thread Himanshu Vashishtha
Yes, you can define your scan object at the client side and pass to the AggregateClient.rowCount. You can refer to AggregateClient javadoc and associated TestAggregateProtocol test methods to get an idea. Thanks, Himanshu On Sun, May 1, 2011 at 12:29 PM, Wojciech Langiewicz

RE: Row count without iterating over ResultScanner?

2011-05-01 Thread Doug Meil
Another thing is be careful about CF/attributes you have in the Scan. If you add a column family (scan.addFamily) , it will pull *all* the attributes of that column family. If you only care about a row-count, pick only one very small attribute from the row. -Original Message-

one of our datanodes stops working after few hours

2011-05-01 Thread Jack Levin
I took a jstack (http://pastebin.com/5v6mHg3t). After few hours, its literally staggers to a halt and gets very very slow... Any ideas whats its blocking on? (main issue is that fsreads for RS get really slow when that happens). -Jack

RE: one of our datanodes stops working after few hours

2011-05-01 Thread Michael Segel
What's your xceivers set to? What's the ulimit -n set for hdfs/hadoop user... (You didn't say which release/version you were using.) Date: Sun, 1 May 2011 17:47:18 -0700 Subject: one of our datanodes stops working after few hours From: magn...@gmail.com To: user@hbase.apache.org I

Re: one of our datanodes stops working after few hours

2011-05-01 Thread Jack Levin
Version: 0.20.2+320 hdfs .89 HBASE ulimit is 32k xcievers is 5k Note from the jstack, I am not exceeding xcievers. -Jack On Sun, May 1, 2011 at 6:19 PM, Michael Segel michael_se...@hotmail.com wrote: What's your xceivers set to? What's the ulimit -n  set for hdfs/hadoop user...

Re: Row count without iterating over ResultScanner?

2011-05-01 Thread Michel Segel
Hi, There's a row counter app in the hbase release that's a m/r job. You could also do a dynamic counter too. Sent from a remote device. Please excuse any typos... Mike Segel On May 1, 2011, at 8:44 AM, Wojciech Langiewicz wlangiew...@gmail.com wrote: Hi, I would like to know if there's a

Re: hbase test library

2011-05-01 Thread Patrick Angeles
I still think static mocks are easier to work with (and read), but yes, in their absence Mockito and friends make a huge difference. I'm okay with using mocking tools here or rolling my own static mocks for HTableInterface, etc. But yes, I'm thinking more of a 'fake' in-process and in-memory

Re: hbase table creation

2011-05-01 Thread sulabh choudhury
You have to define at least 1 column family name while creating a table hbase(main):010:0 create 'test', {NAME ='columnFamily' } On Sun, May 1, 2011 at 12:40 AM, Priya A priya8...@gmail.com wrote: I m new to hbase.. here there is some errors as follows: while creatin hbase table and