[ 
https://issues.apache.org/jira/browse/DRILL-4519?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Miroslav Holubec updated DRILL-4519:
------------------------------------
    Description: 
We have parquet files in folders with following convention YYYY/MM/DD/HH.
Without drill's parquet metadata directory prunning works seamlessly.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = YYYY,  dir1 = MM, dir2 = DD, dir3 = HH
{noformat}
After creating metadata and executing same query, dir0 contains HH folder name 
instead yearly folder name. dir1...3 are null.
{noformat}
refresh table metadata hdfs.test.indexed;
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = HH,  dir1 = null, dir2 = null, dir3 = null
{noformat}



  was:
We have parquet files in folders with following convention YYYY/MM/DD/HH.
Without drill's parquet metadata directory prunning works seamlessly.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = YYYY,  dir1 = MM, dir2 = DD, dir3 = HH
{noformat}
After creating metadata and executing same query, dir0 contains HH folder name 
instead yearly folder name. dir1...3 are null.
{noformat}
select dir0, dir1, dir2 from hdfs.test.indexed;
dir0 = HH,  dir1 = null, dir2 = null, dir3 = null
{noformat}




> File system directory-based partition pruning doesn't work correctly with 
> parquet metadata
> ------------------------------------------------------------------------------------------
>
>                 Key: DRILL-4519
>                 URL: https://issues.apache.org/jira/browse/DRILL-4519
>             Project: Apache Drill
>          Issue Type: Bug
>    Affects Versions: 1.4.0, 1.5.0
>            Reporter: Miroslav Holubec
>
> We have parquet files in folders with following convention YYYY/MM/DD/HH.
> Without drill's parquet metadata directory prunning works seamlessly.
> {noformat}
> select dir0, dir1, dir2 from hdfs.test.indexed;
> dir0 = YYYY,  dir1 = MM, dir2 = DD, dir3 = HH
> {noformat}
> After creating metadata and executing same query, dir0 contains HH folder 
> name instead yearly folder name. dir1...3 are null.
> {noformat}
> refresh table metadata hdfs.test.indexed;
> select dir0, dir1, dir2 from hdfs.test.indexed;
> dir0 = HH,  dir1 = null, dir2 = null, dir3 = null
> {noformat}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to