[jira] [Commented] (KUDU-2437) Generate ScanToken from small chunks in tablet

Xu Yao (JIRA) Tue, 15 May 2018 06:54:23 -0700

    [ 
https://issues.apache.org/jira/browse/KUDU-2437?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16475866#comment-16475866
 ]


Xu Yao commented on KUDU-2437:
------------------------------

https://gerrit.cloudera.org/#/c/10406/

> Generate ScanToken from small chunks in tablet
> ----------------------------------------------
>
>                 Key: KUDU-2437
>                 URL: https://issues.apache.org/jira/browse/KUDU-2437
>             Project: Kudu
>          Issue Type: Improvement
>          Components: client, master, tablet
>            Reporter: Xu Yao
>            Priority: Major
>
> When reading data in a kudu table using spark, if there is a large amount of 
> data in the tablet, reading the data takes a long time.
> The reason is that KuduRDD uses a tablet to generate the scanToken, so a 
> spark task needs to process all the data in a tablet. So:
>  # TS report the DRS bounds info to Master
>  # Client get the bounds info from Master
>  # Client generate the scanToken by bounds info of tablet（set 
> LowerBoundPrimaryKey and UpperBoundPrimaryKey）



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

[jira] [Commented] (KUDU-2437) Generate ScanToken from small chunks in tablet

Reply via email to