Hi I recently use drill to join hbase table. I have one question, does there a way to make the scan concurrently?
We have a big table and scan very slow. I researched the hbase storage engine and find there have a similar logic in HBaseGroupScan class which created a sub scan list for each incomingendpoint. But our business logic is more complicate. So I want to created a scan list in HBaseGroupScan based on our need. I added the subsan into endpointFragmentMapping successfully , but I can not make it run concurrently as the ScanBatch class run the subscan list in single thread. So I have no idea how to solve it. I think the key point is change the incomingendpoint parameter. It seems this parameter determine the concurrent thread. Anyone can help me?
