Baike Xia created IMPALA-11600:
----------------------------------
Summary: Support Bucketed Table And Related Optimizations
Key: IMPALA-11600
URL: https://issues.apache.org/jira/browse/IMPALA-11600
Project: IMPALA
Issue Type: New Feature
Components: Backend, Distributed Exec, Frontend
Reporter: Baike Xia
In Hive, we can create bucket tables, divide data in fine-grained ways, and
publish data to different files based on bucket columns. Like this, we can make
specific optimizations to the Query to speed up the Query.
I think it would be exciting for Impala to have support for bucket table
creation and related optimizations.
The following document is a design document that supports the creation of
bucket tables. If you are interested, welcome to give some suggestions.
[Support Bucketed Table And Related
Optimizations|https://docs.google.com/document/d/1-hvGK-Ng-GtPqxbgB7rTPfrkCtYLVFDehn9ybL-mGUc/edit#heading=h.3y9ae6d7rbnq]
--
This message was sent by Atlassian Jira
(v8.20.10#820010)