Hi, Ted We have a cluster of 48 machines and at least 100T data(which is still increasing). The problem is that we have a lot of row keys (about tens of thousands ) to query in the meantime and we don't fetch all the data at once, instead we fetch them when needed, so we may hold tens of thousands ResultScanner in the meantime. I want to know whether it will hurt the performance and network resources and if so, is there any way to solve it?
Best regards, Kevin. -----Original Message----- From: Ted Yu [mailto:[email protected]] Sent: 2014年8月26日 16:49 To: [email protected] Cc: [email protected]; Huang, Jianshi Subject: Re: ResultScanner performance Can you give a bit more detail ? What size is the cluster / dataset ? What problem are you solving ? Would using coprocessor help reduce the usage of ResultScanner ? Cheers On Aug 26, 2014, at 12:13 AM, "Dai, Kevin" <[email protected]> wrote: > Hi, everyone > > My application will hold tens of thousands of ResultScanner to get Data. Will > it hurt the performance and network resources? > If so, is there any way to solve it? > Thanks, > Kevin.
