Hi Fawze, In Impala, only scalar types are allowed in the select list because Impala always produces result sets with all scalar values, i.e. simple tables. The collect_set() function in Hive returns an array, and Impala cannot put an array into a single cell of an output table.
If you want to write files that contain complex data I'm afraid you'll need Hive. BR, Zoltan On Mon, Jul 30, 2018 at 10:21 AM Fawze Abujaber <fawz...@gmail.com> wrote: > Hi everyone! > > > > could anybody tell, how I can replace collect_set Hive function in impala? > > > > Query like this: > > > > select > col1, > > collect_set(distinct col2) > > from dpi_parquet_gzip > group by > col1 > > > > thanks a lot! > > -- > Take Care > Fawze Abujaber >