I hit a very similar issue recently. Via HBase shell, i was able to fetch
all records, whereas I was only able to see a small subset of records when
queried from Drill. Each time I inserted 1000 records, only about 50 of
those would show up.

Although I could repro' the problem consistently, it was resolved once i
updated my Hadoop setup. My guess is that it was a HBase bug which got
resolved. Although strange as it seems, it might not have to do with Drill
itself.

-Abhishek

On Tue, Jan 12, 2016 at 7:52 AM, Jason Altekruse <[email protected]>
wrote:

> I'm not sure why this is happening, we have tests in our automated suite
> that I believe run some pretty large queries against Hbase and verify the
> results.
>
> Aditya, do you have some time available to try to reproduce this and
> diagnose the problem?
>
> On Wed, Jan 6, 2016 at 2:03 PM, Kumiko Yada <[email protected]> wrote:
>
> > I'm having the same issue.  Is there any workaround for this?
> >
> > Thanks
> > Kumiko
> >
> > -----Original Message-----
> > From: Kevin Verhoeven [mailto:[email protected]]
> > Sent: Monday, December 21, 2015 10:37 AM
> > To: [email protected]
> > Subject: Drill query does not return all results from HBase
> >
> > We have a problem where a Drill query against HBase does not return all
> > results. The following query should return over 100,000 rows, but we only
> > get about 1,030 back.
> >
> > SELECT row_key FROM `hbase`.`customer_staged` WHERE customer_number = 800
> >
> > If we scan directly using the hbase shell we see over 100,000 rows, but
> > the same Drill query does not return a fraction of the expected results.
> We
> > have also run a count against the table and Drill returns the same 1,030
> > number, which is far less than expect. What could be going wrong?
> >
> > We are running Drill 1.2 on Ubuntu 14.04 against CDH 5.4.3 (HBase 1.0).
> We
> > run HBase on six RegionServers, the table has about 1.3 billion rows.
> >
> > Thanks,
> >
> > Kevin
> >
> >
>

Reply via email to