[
https://issues.apache.org/jira/browse/IMPALA-7658?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17176448#comment-17176448
]
Daniel Becker commented on IMPALA-7658:
---------------------------------------
Query performance test:
Used a UDF from TestUdf.java that adds three integers:
{code:java}
create function tpch15_parquet.sum3(int, int, int) returns int
location '/test-warehouse/impala-hive-udfs.jar'
symbol='org.apache.impala.TestUdf';{code}
Used the following query on the master branch and the change's branch:
{code:java}
set num_nodes=1; set mt_dop=1;
select min(tpch15_parquet.sum3(cast(l_orderkey as int), cast(l_partkey as int),
cast(l_suppkey as int)))
from tpch15_parquet.lineitem;{code}
Results averaged over 100 runs after warmup:
Master: 20.6346s, stddev: 0.3132411856765332
Change: 19.0256s, stddev: 0.42039019873436
This is a ~7.8% improvement.
> Proper codegen for HiveUdfCall
> ------------------------------
>
> Key: IMPALA-7658
> URL: https://issues.apache.org/jira/browse/IMPALA-7658
> Project: IMPALA
> Issue Type: Improvement
> Components: Backend
> Reporter: Tim Armstrong
> Assignee: Daniel Becker
> Priority: Major
> Labels: codegen, performance
>
> This function uses GetCodegendComputeFnWrapper() to call the interpreted path
> but instead we could codegen the Evaluate() function to reduce the overhead.
> I think this is likely to be a little involved since there's a loop to
> unroll, so the solution might end up looking like IMPALA-5168
--
This message was sent by Atlassian Jira
(v8.3.4#803005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]