I've been doing some experimentation with Drill (1.0) against MaprDB. I believe I'd see similar behavior vs HBASE, although I haven't tried yet.
I have a 11GB table, which is split into 8 regions (not perfectly balanced, some have 2x the # of records as others). When I run a drill query which results in a full table scan (eg: select *..), I notice in the profile that there are only 8 minor fragments assigned. This seems to coincide with other tests which I've ran that also seem to assign one minor fragment per region. Is this expected? Are there any ways of increasing the parallelism on table scans? (eg: every XX records, or assign XX fragments per region) -- Andy Pernsteiner Manager, Field Enablement ph: 206.228.0737 www.mapr.com Now Available - Free Hadoop On-Demand Training <http://www.mapr.com/training?utm_source=Email&utm_medium=Signature&utm_campaign=Free%20available>
