This is an automated email from the ASF dual-hosted git repository.
bridgetb pushed a commit to branch gh-pages
in repository https://gitbox.apache.org/repos/asf/drill.git
The following commit(s) were added to refs/heads/gh-pages by this push:
new cefc5fb Fixing broken links to CTAS and Explain Commands - DRILL 6749
cefc5fb is described below
commit cefc5fb02af86826840f41ba9df50427b030f652
Author: Nitin S <[email protected]>
AuthorDate: Thu Sep 20 19:10:13 2018 -0700
Fixing broken links to CTAS and Explain Commands - DRILL 6749
Fixing broken links to CTAS and Explain Commands
---
_docs/performance-tuning/026-parquet-filter-pushdown.md | 6 +++---
1 file changed, 3 insertions(+), 3 deletions(-)
diff --git a/_docs/performance-tuning/026-parquet-filter-pushdown.md
b/_docs/performance-tuning/026-parquet-filter-pushdown.md
index 466f083..0afdcee 100644
--- a/_docs/performance-tuning/026-parquet-filter-pushdown.md
+++ b/_docs/performance-tuning/026-parquet-filter-pushdown.md
@@ -24,7 +24,7 @@ Parquet filter pushdown is similar to partition pruning in
that it reduces the a
The query planner looks at the minimum and maximum values in each row group
for an intersection. If no intersection exists, the planner can prune the row
group in the table. If the minimum and maximum value range is too large, Drill
does not apply Parquet filter pushdown. The query planner can typically prune
more data when the tables in the Parquet file are sorted by row groups.
##Using Parquet Filter Pushdown
-Currently, Parquet filter pushdown only supports filters that reference
columns from a single table (local filters). Parquet filter pushdown requires
the minimum and maximum values in the Parquet file metadata. All Parquet files
created in Drill using the CTAS statement contain the necessary metadata. If
your Parquet files were created using another tool, you may need to use Drill
to read and rewrite the files using the [CTAS
command]({{site.baseurl}}/docs/create-table-as-ctas-command/).
+Currently, Parquet filter pushdown only supports filters that reference
columns from a single table (local filters). Parquet filter pushdown requires
the minimum and maximum values in the Parquet file metadata. All Parquet files
created in Drill using the CTAS statement contain the necessary metadata. If
your Parquet files were created using another tool, you may need to use Drill
to read and rewrite the files using the [CTAS
command]({{site.baseurl}}/docs/create-table-as-ctas/).
Parquet filter pushdown works best if you presort the data. You do not have to
sort the entire data set at once. You can sort a subset of the data set, sort
another subset, and so on.
@@ -39,7 +39,7 @@ The following table lists the Parquet filter pushdown options
with their descrip
| "planner.store.parquet.rowgroup.filter.pushdown.threshold" | Sets the number
of row groups that a table can have. You can increase the threshold if the
filter can prune many row groups. However, if this setting is too high, the
filter evaluation overhead increases. Base this setting on the data set.
Reduce this setting if the planning time is significant, or you do not see
any benefit at runtime. | 10,000 |
###Viewing the Query Plan
-Because Drill applies Parquet filter pushdown during the query planning phase,
you can view the query execution plan to see if Drill pushes down the filter
when a query on a Parquet file contains a filter expression. You can run the
[EXPLAIN PLAN command]({{site.baseurl}}/docs/explain-commands/) to see the
execution plan for the query, as shown in the following example.
+Because Drill applies Parquet filter pushdown during the query planning phase,
you can view the query execution plan to see if Drill pushes down the filter
when a query on a Parquet file contains a filter expression. You can run the
[EXPLAIN PLAN command]({{site.baseurl}}/docs/explain/) to see the execution
plan for the query, as shown in the following example.
**Example**
@@ -79,4 +79,4 @@ The following table lists the supported and unsupported
clauses, operators, data
- a dynamic star in the sub-query or queries that include the WITH statement.
- several filter predicates with the OR logical operator.
- more than one EXISTS operator (instead of JOIN operators).
-- INNER JOIN and local filtering with a several conditions.
\ No newline at end of file
+- INNER JOIN and local filtering with a several conditions.