I hit a very similar issue recently. Via HBase shell, i was able to fetch all records, whereas I was only able to see a small subset of records when queried from Drill. Each time I inserted 1000 records, only about 50 of those would show up.
Although I could repro' the problem consistently, it was resolved once i updated my Hadoop setup. My guess is that it was a HBase bug which got resolved. Although strange as it seems, it might not have to do with Drill itself. -Abhishek On Tue, Jan 12, 2016 at 7:52 AM, Jason Altekruse <[email protected]> wrote: > I'm not sure why this is happening, we have tests in our automated suite > that I believe run some pretty large queries against Hbase and verify the > results. > > Aditya, do you have some time available to try to reproduce this and > diagnose the problem? > > On Wed, Jan 6, 2016 at 2:03 PM, Kumiko Yada <[email protected]> wrote: > > > I'm having the same issue. Is there any workaround for this? > > > > Thanks > > Kumiko > > > > -----Original Message----- > > From: Kevin Verhoeven [mailto:[email protected]] > > Sent: Monday, December 21, 2015 10:37 AM > > To: [email protected] > > Subject: Drill query does not return all results from HBase > > > > We have a problem where a Drill query against HBase does not return all > > results. The following query should return over 100,000 rows, but we only > > get about 1,030 back. > > > > SELECT row_key FROM `hbase`.`customer_staged` WHERE customer_number = 800 > > > > If we scan directly using the hbase shell we see over 100,000 rows, but > > the same Drill query does not return a fraction of the expected results. > We > > have also run a count against the table and Drill returns the same 1,030 > > number, which is far less than expect. What could be going wrong? > > > > We are running Drill 1.2 on Ubuntu 14.04 against CDH 5.4.3 (HBase 1.0). > We > > run HBase on six RegionServers, the table has about 1.3 billion rows. > > > > Thanks, > > > > Kevin > > > > >
