RE: ResultScanner performance

Dai, Kevin Tue, 26 Aug 2014 20:19:55 -0700

Hi, Ted

We have a cluster of 48 machines and at least 100T data(which is still 
increasing).
The problem is that we have a lot of row keys (about tens of thousands ) to 
query in the meantime and we don't fetch all the data at once, instead we fetch 
them when needed,
so we may hold tens of thousands ResultScanner in the meantime.
I want to know whether it will hurt the performance and network resources and 
if so, is there any way to solve it?

Best regards,
Kevin.
-----Original Message-----
From: Ted Yu [mailto:[email protected]] 
Sent: 2014年8月26日 16:49
To: [email protected]
Cc: [email protected]; Huang, Jianshi
Subject: Re: ResultScanner performance

Can you give a bit more detail ?
What size is the cluster / dataset ?
What problem are you solving ?
Would using coprocessor help reduce the usage of ResultScanner ?

Cheers

On Aug 26, 2014, at 12:13 AM, "Dai, Kevin" <[email protected]> wrote:

> Hi, everyone
> 
> My application will hold tens of thousands of ResultScanner to get Data. Will 
> it hurt the performance and network resources?
> If so, is there any way to solve it?
> Thanks,
> Kevin.

RE: ResultScanner performance

Reply via email to