Hi, Thanks Steven, and George for sharing your experience.
George, I understand that you have quite a lot of data, but have you tried to count them all ? As this matter occurs when selecting all columns, selection on a subset of them is correct. > > > > > > > > In my Job, I used Drill to query HBase all the time and the HBase contains billions of rows and over 20 columns, I never noticed any count errors. On Sat, Jul 11, 2015 at 3:57 AM, Steven Phillips <[email protected]> wrote: > This looks like a bug to me. You should file a jira, and include as > much information as possible for us to reproduce the issue. > > On Thu, Jul 9, 2015 at 8:03 PM, Carrot Hu -CIC <[email protected]> > wrote: > > > Hi, > > > > > > > > I created a test table in HBASE with 2 column families [‘cf0’, > > ‘cf1’] and > > 3 columns for each, all same value, 100,000 rows in total. > > > > > > > > SELECT COUNT(*) will not return the correct row count (much less, a > > few thousands). > > > > However, doing SELECT on a subset of those columns gives the right count. > > > > > > > > If I reduced the column number to 2 in this case, SELECT COUNT(*) > > gives the right count. > > > > > > > > Later I tried tables with only one column family, drill returns the > > right count only when column number < 6. > > > > > > > > What could be the explanations of this matter? > > > > Have I missed any drill configurations? > > > > > > > > *Carrot Hu, > > > > *R&D Engineer* > > > > > > > > > > > > > > > > -- > Steven Phillips > Software Engineer > > mapr.com >
