[
https://issues.apache.org/jira/browse/DRILL-4505?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15195059#comment-15195059
]
Tobias commented on DRILL-4505:
-------------------------------
Sadly neither options seems to work for us.
When can we expect the untyped null work to land in drill proper?
> Can't group by or sort across files with different schema
> ---------------------------------------------------------
>
> Key: DRILL-4505
> URL: https://issues.apache.org/jira/browse/DRILL-4505
> Project: Apache Drill
> Issue Type: Bug
> Components: Storage - Parquet
> Affects Versions: 1.5.0
> Environment: Java 1.8
> Reporter: Tobias
>
> We are currently trying out the support for querying across parquet files
> with different schemas.
> Simple selects work well but when we wan't to do sort or group by Drill
> returns "UNSUPPORTED_OPERATION ERROR: Sort doesn't currently support sorts
> with changing schemas Fragment 0:0 [Error Id:
> ff490670-64c1-4fb8-990e-a02aa44ac010 on zookeeper-1:31010]"
> This is despite not even including the new columns in the query.
> Expected result would be to treat the non existing columns in certain files
> as either null or default value and allow them to be grouped and sorted
> Example
> SELECT APPLICATION_ID ,dir0 AS year_ FROM dfs.`/PRO/UTC/1` WHERE dir2
> >='2016-01-01' AND dir2<'2016-04-02' work with changing schema
> but SELECT max(APPLICATION_ID ),dir0 AS year_ FROM dfs.`/PRO/UTC/1` WHERE
> dir2 >='2016-01-01' AND dir2<'2016-04-02' group by dir0 does not work
> For us this hampers any possibility to have an evolving schema with moderatly
> complex queries
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)