Re: Re: Any fast way to random access hbase data?

2014-08-13 Thread leiwang...@gmail.com
Sorry, the real region server config is this: hfile.block.cache.size 0.25 hbase-site.xml leiwang...@gmail.com From: Esteban Gutierrez Date: 2014-08-14 01:05 To: user@hbase.apache.org Subject: Re: Re: Any fast way to random access hbase data? Hi Lei, Any chance for you to provide the

Re: Re: Any fast way to random access hbase data?

2014-08-13 Thread lars hofhansl
_ From: "leiwang...@gmail.com" To: user Sent: Wednesday, August 13, 2014 2:23 AM Subject: Re: Re: Any fast way to random access hbase data? Haven't tried yet only one thread 10 regions servers, total 2555 regions. I am just new to HBase and not s

Re: Re: Any fast way to random access hbase data?

2014-08-13 Thread Esteban Gutierrez
;ROWC > > OL', REPLICATION_SCOPE => '0', VERSIONS => '1', COMPRESSION => 'LZ4', > > MIN_VERSIONS => > > '0', TTL => '2147483647', KEEP_DELETED_CELLS => 'false', BLOCKSIZE => > > '65

Re: Re: Any fast way to random access hbase data?

2014-08-13 Thread Jean-Marc Spaggiari
27;NONE', > BLOOMFILTER => 'ROWC > OL', REPLICATION_SCOPE => '0', VERSIONS => '1', COMPRESSION => 'LZ4', > MIN_VERSIONS => > '0', TTL => '2147483647', KEEP_DELETED_CELLS => 'false', BLOCKSIZ

Re: Re: Any fast way to random access hbase data?

2014-08-13 Thread leiwang...@gmail.com
COMPRESSION => 'LZ4', MIN_VERSIONS => '0', TTL => '2147483647', KEEP_DELETED_CELLS => 'false', BLOCKSIZE => '65536', IN_ME MORY => 'false', ENCODE_ON_DISK => 'true', BLOCKCACHE => 'true'}]}

Re: Any fast way to random access hbase data?

2014-08-13 Thread Esteban Gutierrez
Hello Lei, Have you tried a larger batch size? how many threads or tasks are you using to fetch data? could you please describe a little bit more your HBase cluster? e.g. how many region servers, how many regions per RS? whats the hit ratio of the block cache? any chance for you to share the table

Any fast way to random access hbase data?

2014-08-13 Thread leiwang...@gmail.com
I have a hbase table with more than 2G rows. Every hour there comes 5M~10M row ids and i must get all the row info from the hbase table. But even I use the batch call(1000 row ids as a list) as described here http://stackoverflow.com/questions/13310434/hbase-api-get-data-rows-information-by-l