[
https://issues.apache.org/jira/browse/DRILL-3692?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Sungwook Yoon updated DRILL-3692:
---------------------------------
Description:
We are trying to use Hive parquet stored files partitioned by some column year.
So, the directory structure is partitioned with year=value
Let's say there are 5 years, so dir0 are like year=2010,
year=2011,year=2012,year=2013,year=2014
We did like following
select * from dfs.root.`/user/hive/warehouse/table` d where d.dir0 =
'year=2012';
I get nothing.
Apparently, there are parquet files in the directory though.
Sometimes it picks up e.g., year=2010,
That is,
select * from dfs.root.`/user/hive/warehouse/table` d where d.dir0 =
'year=2010';
retrieves values.
Not all subdirectories in dir0 are correctly picked up.
I think the files under every dir0 are picked up, just the names of dir0 are
not correctly picked up.
was:
We are trying to use Hive parquet stored files partitioned by some column year.
So, the directory structure is partitioned with year=value
Let's say there are 5 years, so dir0 are like year=2010,
year=2011,year=2012,year=2013,year=2014
We did like following
select * from dfs.root.`/user/hive/warehouse/table` d where d.dir0 =
'year=2012';
I get nothing.
Apparently, there are parquet files in the directory though.
Sometimes it picks up e.g., year=2010,
That is,
select * from dfs.root.`/user/hive/warehouse/table` d where d.dir0 =
'year=2010';
retrieves values.
Not all subdirectories in dir0 are correctly picked up.
> Some subdirectories are not correctly picked up as dir0 for Hive partitioned
> by dirs
> ------------------------------------------------------------------------------------
>
> Key: DRILL-3692
> URL: https://issues.apache.org/jira/browse/DRILL-3692
> Project: Apache Drill
> Issue Type: Bug
> Components: Functions - Drill, Storage - Parquet
> Affects Versions: 1.1.0
> Environment: MapR 5.0, Drill 1.1.0 and Sqlline through Zookeeper
> Reporter: Sungwook Yoon
> Assignee: Mehant Baid
>
> We are trying to use Hive parquet stored files partitioned by some column
> year.
> So, the directory structure is partitioned with year=value
> Let's say there are 5 years, so dir0 are like year=2010,
> year=2011,year=2012,year=2013,year=2014
> We did like following
> select * from dfs.root.`/user/hive/warehouse/table` d where d.dir0 =
> 'year=2012';
> I get nothing.
> Apparently, there are parquet files in the directory though.
> Sometimes it picks up e.g., year=2010,
> That is,
> select * from dfs.root.`/user/hive/warehouse/table` d where d.dir0 =
> 'year=2010';
> retrieves values.
> Not all subdirectories in dir0 are correctly picked up.
> I think the files under every dir0 are picked up, just the names of dir0 are
> not correctly picked up.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)