[
https://issues.apache.org/jira/browse/DRILL-5093?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Robert Hou updated DRILL-5093:
------------------------------
Attachment: drill.parquet_metadata
> Explain plan shows all partitions when query scans all partitions, and filter
> pushdown is used with metadata caching.
> ---------------------------------------------------------------------------------------------------------------------
>
> Key: DRILL-5093
> URL: https://issues.apache.org/jira/browse/DRILL-5093
> Project: Apache Drill
> Issue Type: Bug
> Components: Query Planning & Optimization
> Affects Versions: 1.9.0
> Reporter: Robert Hou
> Assignee: Jinfeng Ni
> Attachments: 0_0_1.parquet, 0_0_2.parquet, 0_0_3.parquet,
> 0_0_4.parquet, 0_0_5.parquet, drill.parquet_metadata
>
>
> This query scans all the partitions because the partitions cannot be pruned.
> When metadata caching is used, the explain plan shows all the partitions,
> when it should only show the parent.
> 0: jdbc:drill:zk=10.10.100.186:5181/drill/rho> explain plan for select \*
> from orders_parts_metadata;
> +------+------+
> | text | json |
> +------+------+
> | 00-00 Screen
> 00-01 Project(*=[$0])
> 00-02 Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath
> [path=/drill/testdata/filter/orders_parts_metadata/0_0_1.parquet],
> ReadEntryWithPath
> [path=/drill/testdata/filter/orders_parts_metadata/0_0_3.parquet],
> ReadEntryWithPath
> [path=/drill/testdata/filter/orders_parts_metadata/0_0_4.parquet],
> ReadEntryWithPath
> [path=/drill/testdata/filter/orders_parts_metadata/0_0_5.parquet],
> ReadEntryWithPath
> [path=/drill/testdata/filter/orders_parts_metadata/0_0_2.parquet]],
> selectionRoot=/drill/testdata/filter/orders_parts_metadata, numFiles=5,
> usedMetadataFile=true,
> cacheFileRoot=/drill/testdata/filter/orders_parts_metadata, columns=[`*`]]])
> Here is the same query with a table that does not have metadata caching.
> 0: jdbc:drill:zk=10.10.100.186:5181/drill/rho> explain plan for select \*
> from orders_parts;
> +------+------+
> | text | json |
> +------+------+
> | 00-00 Screen
> 00-01 Project(*=[$0])
> 00-02 Scan(groupscan=[ParquetGroupScan [entries=[ReadEntryWithPath
> [path=maprfs:///drill/testdata/filter/orders_parts]],
> selectionRoot=maprfs:/drill/testdata/filter/orders_parts, numFiles=1,
> usedMetadataFile=false, columns=[`*`]]])
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)