[ 
https://issues.apache.org/jira/browse/IMPALA-14523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Smith resolved IMPALA-14523.
------------------------------------
    Fix Version/s: Impala 5.0.0
       Resolution: Fixed

> Optimize JDBC table for Hive Multistream Driver
> -----------------------------------------------
>
>                 Key: IMPALA-14523
>                 URL: https://issues.apache.org/jira/browse/IMPALA-14523
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Backend
>    Affects Versions: Impala 4.4.0
>            Reporter: Kurt Deschler
>            Assignee: Pranav Yogi Lodha
>            Priority: Major
>             Fix For: Impala 5.0.0
>
>
> HIVE-27872 added multi-stream fetch capabilites to the HS2 JDBC driver which 
> facilitates very fast transport of bulk data over JDBC. However, to achieve 
> high performance it is necessary for the client to consume data quickly from 
> the single-threaded JDBC client. This requires minimizing the (synchronous) 
> work done after fetching data from JDBC and performing any expensive 
> processing using multiple threads. It is possible to achive this result 
> either by fetching data on a single thread and handing it off for consumption 
> or using locking to serialize fetching. In either case, the fetch path must 
> copy data in an efficent way from the cursor to local memory and defer any 
> expensive encoding/decoding/conversion to a multi-threaded codepath. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to