[ 
https://issues.apache.org/jira/browse/IMPALA-12134?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17723881#comment-17723881
 ] 

ASF subversion and git services commented on IMPALA-12134:
----------------------------------------------------------

Commit 607ae742e5b12df12e2e2e64c890190a02beb765 in impala's branch 
refs/heads/master from Kurt Deschler
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=607ae742e ]

IMPALA-12134: Optimize row materialization time

This patch improves row materialization time by providing specialized
formatting logic for default date and timestamp formats. For Beeswax
protocol, performance is also improved by caching deserialized column
metadata to avoid unnecessary per-row cost.

Benchmarks:
- Manually tested mixed datatype table showed ~20% reduction in row
  materialization time
- Added cases to date-benchmark for new formatters. Date formatting
  improved by 3x and timestamp by 2x

Machine Info: Intel(R) Core(TM) i7-8700 CPU @ 3.20GHz
ToYearMonthDay:            Function  iters/ms
-------------------------------------------------------
             TestCctzToYearMonthDay               16.5
                 TestToYearMonthDay               61.1
                         TestToYear                280
                       TestToString                 18
          TestToString_stringstream               1.86
           TestDefaultDateToCharBuf               25.5
              TestTimestampToString               11.7
      TestDefaultTimestampToCharBuf               15.7

Testing:
- Ran core tests

Change-Id: I1ef5e4137fa6c2d0a5f08b430e01e3fb7de86330
Reviewed-on: http://gerrit.cloudera.org:8080/19875
Reviewed-by: Impala Public Jenkins <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Optimize row materialization time
> ---------------------------------
>
>                 Key: IMPALA-12134
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12134
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>            Reporter: Kurt Deschler
>            Assignee: Kurt Deschler
>            Priority: Major
>
> IMPALA-12111 addressed the most significant contributors to slow row 
> materialization. However, there is still room for significant improvement 
> with the following optimizations:
>  * Specialized implementation for default Data and Timestamp formatting.
>  * Caching deserialized column metadata



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to