[ 
https://issues.apache.org/jira/browse/IMPALA-7658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17176448#comment-17176448
 ] 

Daniel Becker commented on IMPALA-7658:
---------------------------------------

Query performance test:

Used a UDF from TestUdf.java that adds three integers:
{code:java}
create function tpch15_parquet.sum3(int, int, int) returns int
location '/test-warehouse/impala-hive-udfs.jar'
symbol='org.apache.impala.TestUdf';{code}
 Used the following query on the master branch and the change's branch:
{code:java}
set num_nodes=1; set mt_dop=1;
select min(tpch15_parquet.sum3(cast(l_orderkey as int), cast(l_partkey as int), 
cast(l_suppkey as int)))
from tpch15_parquet.lineitem;{code}
Results averaged over 100 runs after warmup:
Master: 20.6346s, stddev: 0.3132411856765332
Change: 19.0256s, stddev: 0.42039019873436


This is a ~7.8% improvement.

> Proper codegen for HiveUdfCall
> ------------------------------
>
>                 Key: IMPALA-7658
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7658
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>            Reporter: Tim Armstrong
>            Assignee: Daniel Becker
>            Priority: Major
>              Labels: codegen, performance
>
> This function uses GetCodegendComputeFnWrapper() to call the interpreted path 
> but instead we could codegen the Evaluate() function to reduce the overhead. 
> I think this is likely to be a little involved since there's a loop to 
> unroll, so the solution might end up looking like IMPALA-5168



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to