[ 
https://issues.apache.org/jira/browse/IMPALA-7171?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16773656#comment-16773656
 ] 

ASF subversion and git services commented on IMPALA-7171:
---------------------------------------------------------

Commit fb127f8f7e1d0d8d6c1d72702fd10c23e119e41c in impala's branch 
refs/heads/2.x from Alex Rodoni
[ https://gitbox.apache.org/repos/asf?p=impala.git;h=fb127f8 ]

IMPALA-7171: [DOCS] Hints for Kudu insert and upsert

Change-Id: I04378e6f2b17d4d6e844192807d946b9045e2927
Reviewed-on: http://gerrit.cloudera.org:8080/10737
Reviewed-by: Thomas Marshall <[email protected]>
Tested-by: Impala Public Jenkins <[email protected]>


> Add docs for Kudu insert partitioning/sorting
> ---------------------------------------------
>
>                 Key: IMPALA-7171
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7171
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Docs
>            Reporter: Thomas Tauber-Marshall
>            Assignee: Alex Rodoni
>            Priority: Major
>              Labels: docs
>             Fix For: Impala 2.13.0, Impala 3.1.0
>
>
> On the page: 
> http://impala.apache.org/docs/build3x/html/topics/impala_kudu.html, at the 
> end of the section: "Impala DML Support for Kudu Tables (INSERT, UPDATE, 
> DELETE, UPSERT)", we should add text like:
> Starting from Impala 2.9, Impala will automatically add a partition and sort 
> step to INSERTs before sending the rows to Kudu. Since Kudu partitions and 
> sorts rows on write, pre-partitioning and sorting takes some of the load off 
> of Kudu, and helps ensure that large INSERTs complete without timing out, but 
> it may slow down the end-to-end performance of the INSERT. Starting from 
> Impala 2.10, the hints "/* +noshuffle,noclustered */" may be used to turn 
> this pre-partitioning and sorting off. Additionally, since sorting may 
> consume a lot of memory, users should consider setting a "mem_limit" for 
> these queries.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to