Hi,

Thanks Steven, and George for sharing your experience.

George, I understand that you have quite a lot of data, but have you tried to 
count them all ?

As this matter occurs when selecting all columns, selection on a subset of them 
is correct.


> >
> >
> >
> >

In my Job, I used Drill to query HBase all the time and the HBase contains 
billions of rows and over 20 columns, I never noticed any count errors.

On Sat, Jul 11, 2015 at 3:57 AM, Steven Phillips <[email protected]>
wrote:

> This looks like a bug to me. You should file a jira, and include as 
> much information as possible for us to reproduce the issue.
>
> On Thu, Jul 9, 2015 at 8:03 PM, Carrot Hu -CIC <[email protected]>
> wrote:
>
> > Hi,
> >
> >
> >
> > I created a test table in HBASE with 2 column families [‘cf0’, 
> > ‘cf1’] and
> > 3 columns for each, all same value, 100,000 rows in total.
> >
> >
> >
> > SELECT COUNT(*) will not return the correct row count (much less, a 
> > few thousands).
> >
> > However, doing SELECT on a subset of those columns gives the right count.
> >
> >
> >
> > If I reduced the column number to 2 in this case, SELECT COUNT(*) 
> > gives the right count.
> >
> >
> >
> > Later I tried tables with only one column family, drill returns the 
> > right count only when column number < 6.
> >
> >
> >
> > What could be the explanations of this matter?
> >
> > Have I missed any drill configurations?
> >
> >
> >
> > *Carrot Hu,
> >
> > *R&D Engineer*
> >
> >
> >
> >
> >
> >
>
>
>
> --
>  Steven Phillips
>  Software Engineer
>
>  mapr.com
>


Reply via email to