Hi Fawze,

In Impala, only scalar types are allowed in the select list because Impala
always produces result sets with all scalar values, i.e. simple tables.
The collect_set() function in Hive returns an array, and Impala cannot put
an array into a single cell of an output table.

If you want to write files that contain complex data I'm afraid you'll need
Hive.

BR,
    Zoltan




On Mon, Jul 30, 2018 at 10:21 AM Fawze Abujaber <fawz...@gmail.com> wrote:

> Hi everyone!
>
>
>
> could anybody tell, how I can replace collect_set Hive function in impala?
>
>
>
> Query like this:
>
>
>
> select
> col1,
>
> collect_set(distinct col2)
>
> from dpi_parquet_gzip
> group by
> col1
>
>
>
> thanks a lot!
>
> --
> Take Care
> Fawze Abujaber
>

Reply via email to