[
https://issues.apache.org/jira/browse/DRILL-3333?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14600757#comment-14600757
]
Steven Phillips commented on DRILL-3333:
----------------------------------------
When I was working on this, I originally had planned to simply include
DRILL-1950 to handle the partition pruning. But I discussed with a couple
others, and we decided not to do it that way, but instead use the Partition
Pruning rules, exposing the column information. The main reason for choosing
this is that it will allow pruning when the filter expression contains any
arbitrary drill function on the data. This is because we use the drill function
interpreter to evaluate whether the partition can be pruned.
The filter pushdown code in DRILL-1950 will of course still be useful for
pruning more general data distributions. The pruning here was really designed
specifically to work along-side the CTAS-partitioning feature, which uses
single values for the partition columns.
> Add support for auto-partitioning in parquet writer
> ---------------------------------------------------
>
> Key: DRILL-3333
> URL: https://issues.apache.org/jira/browse/DRILL-3333
> Project: Apache Drill
> Issue Type: Bug
> Reporter: Steven Phillips
> Assignee: Steven Phillips
> Attachments: DRILL-3333.patch, DRILL-3333.patch,
> DRILL-3333_2015-06-22_15:22:11.patch, DRILL-3333_2015-06-23_17:38:32.patch
>
>
> When a table is created with a partition by clause, the parquet writer will
> create separate files for the different partition values. The data will first
> be sorted by the partition keys, and the parquet writer will create new file
> when it encounters a new value for the partition columns.
> When data is queried against the data that was created this way, partition
> pruning will work if the filter contains a partition column. And unlike
> directory based partitioning, no view is required, nor is it necessary to
> reference the dir* column names.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)