Very similar. We setup a column range (we're using ColumnRangeFilter right now), and we want the first column in the range.
The problem we have a lot of rows. If there's no such capability, then we need to control the parallelism ourselves. Shall I sort the rows first before scanning? Will a random order be more efficient if we have many servers? Jianshi On Thu, Aug 28, 2014 at 1:44 AM, Ted Yu <[email protected]> wrote: > So you want to specify several columns. e.g. c2, c3, and c4, the GET is > supposed to return the first one of them (doesn't have to be c2, can be c3 > if c2 is absent) ? > > To my knowledge there is no such capability now. > > Cheers > > > On Wed, Aug 27, 2014 at 10:28 AM, Jianshi Huang <[email protected]> > wrote: > > > On Thu, Aug 28, 2014 at 1:20 AM, Jianshi Huang <[email protected]> > > wrote: > > > > > > > > There's a special but common case that for each row we only need the > > first > > > column. Is there a better way to do this than multiple scans + take(1)? > > > > > > > We still need to set a column range, is there a way to get the first > column > > value of a range using GET? > > > > > > -- > > Jianshi Huang > > > > LinkedIn: jianshi > > Twitter: @jshuang > > Github & Blog: http://huangjs.github.com/ > > > -- Jianshi Huang LinkedIn: jianshi Twitter: @jshuang Github & Blog: http://huangjs.github.com/
