Hi Anup,

The Impala documentation contains a good description about UDFs:
https://impala.apache.org/docs/build/html/topics/impala_udf.html
You might also want to take a look at this repo that contains a couple of
examples: https://github.com/cloudera/impala-udf-samples

For best performance, I recommend you to write your UDF in C++.

Cheers,
     Zoltan


On Fri, Jan 17, 2020 at 8:24 PM Anup Tiwari <anupsdtiw...@gmail.com> wrote:

> Thanks Zoltán for update.
> Can you provide me some links for UDF development for such use case ?
>
>
> Regards,
> Anup Tiwari
>
>
> On Mon, Jan 13, 2020 at 9:59 PM Zoltán Borók-Nagy <borokna...@cloudera.com>
> wrote:
>
>> Hi Anup,
>>
>> Impala added support for the DATE type in version 3.3.
>>
>> Parquet represents dates as 32-bit integers, storing the number of days
>> from the Unix epoch, 1 January 1970.
>> For pre-3.3 versions of Impala it means even if you could read the
>> numbers from the Parquet file you'd still need to write a UDF that converts
>> them to strings.
>>
>> Cheers,
>>     Zoltan
>>
>>
>> On Mon, Jan 13, 2020 at 8:54 AM Anup Tiwari <anupsdtiw...@gmail.com>
>> wrote:
>>
>>> Hi Team,
>>>
>>> I have a table data written in parquet format by hive which contains
>>> DATE datatype.
>>> When i am reading it in Impala, it is giving me an error due to data
>>> type.
>>> So do we have any workaround for this like cast("datecolumn" as string)?
>>>
>>> Regards,
>>> Anup Tiwari
>>>
>>

Reply via email to