Hi Anup, The Impala documentation contains a good description about UDFs: https://impala.apache.org/docs/build/html/topics/impala_udf.html You might also want to take a look at this repo that contains a couple of examples: https://github.com/cloudera/impala-udf-samples
For best performance, I recommend you to write your UDF in C++. Cheers, Zoltan On Fri, Jan 17, 2020 at 8:24 PM Anup Tiwari <anupsdtiw...@gmail.com> wrote: > Thanks Zoltán for update. > Can you provide me some links for UDF development for such use case ? > > > Regards, > Anup Tiwari > > > On Mon, Jan 13, 2020 at 9:59 PM Zoltán Borók-Nagy <borokna...@cloudera.com> > wrote: > >> Hi Anup, >> >> Impala added support for the DATE type in version 3.3. >> >> Parquet represents dates as 32-bit integers, storing the number of days >> from the Unix epoch, 1 January 1970. >> For pre-3.3 versions of Impala it means even if you could read the >> numbers from the Parquet file you'd still need to write a UDF that converts >> them to strings. >> >> Cheers, >> Zoltan >> >> >> On Mon, Jan 13, 2020 at 8:54 AM Anup Tiwari <anupsdtiw...@gmail.com> >> wrote: >> >>> Hi Team, >>> >>> I have a table data written in parquet format by hive which contains >>> DATE datatype. >>> When i am reading it in Impala, it is giving me an error due to data >>> type. >>> So do we have any workaround for this like cast("datecolumn" as string)? >>> >>> Regards, >>> Anup Tiwari >>> >>