Hi All,
interval year column is marked as interval in parquet schema for CTAS
created parquet file. This is seen on Drill 1.9.0, git commit id : f3c26e34
0: jdbc:drill:schema=dfs.tmp> create table dur_day(dr_day) as select
cast(columns[0] as* interval day*) dr_day from `duration.csv`;
+-----------+----------------------------+
| Fragment | Number of records written |
+-----------+----------------------------+
| 0_0 | 23 |
+-----------+----------------------------+
1 row selected (0.306 seconds)
0: jdbc:drill:schema=dfs.tmp> select * from dur_day;
+--------------+
| dr_day |
+--------------+
| P20DT4562S |
| P1DT29212S |
| P24DT8553S |
| P1DT36164S |
| P23DT4041S |
| P15DT5347S |
| P10DT36726S |
| P10DT12817S |
| P12DT43008S |
| P20DT9056S |
| P25DT41883S |
| P8DT4496S |
| P4DT24315S |
| P21DT21058S |
| P14DT16591S |
| P26DT4589S |
| P12DT18910S |
| P29DT24828S |
| P13DT16655S |
| P9DT34464S |
| P12DT6110S |
| P26DT16721S |
| P23DT20649S |
+--------------+
23 rows selected (0.133 seconds)
Looking at the parquet schema for above CTAS created parquet file, we see
that INTERVAL YEAR column is marked as INTERVAL in the parquet schema.
[root@centos-01 duration]# ~/parquet-tools/parquet-schema 0_0_0.parquet
message root {
optional fixed_len_byte_array(12) dr_day (INTERVAL);
}
Thanks,
Khurram