additinal suggestion: 1、support at least two level partition 2、build the B+Tree by partition column shoud split the segment and make it small and may speed load data in carbondata 3、delete data by partition column
best regards fish At 2017-03-31 23:42:07, "QiangCai" <qiang...@qq.com> wrote: >Hi all, > > Let's start the discussion regarding the partition table. > > To support partition table, what we should do? > > 1. create table with partition to support Range Partitioning, Hash >Partitioning, List Partitioning and Composite Partitioning, write the >partition info to schema. > > 2. during data loading, re-partition the input data, start a task process >a partition, write partition information to footer and index file. > > 3. during data query, prune B+Tree by partition if the filter contain the >partition column. or prune data blocks by partition when there is only >partition column predicate. > > 4. optimizer the join performance of two partition tables if partition >column is the join column. > > Any thoughts, comments and questions ? > > Thanks! > >Best Regards >David > > > >-- >View this message in context: >http://apache-carbondata-mailing-list-archive.1130556.n5.nabble.com/DISCUSSION-support-new-feature-Partition-Table-tp9935.html >Sent from the Apache CarbonData Mailing List archive mailing list archive at >Nabble.com.