[jira] [Commented] (IMPALA-12909) Generate distributed plan for query accessing multiple JDBC tables

Kurt Deschler (Jira) Thu, 19 Dec 2024 06:43:06 -0800


    [ 
https://issues.apache.org/jira/browse/IMPALA-12909?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17907059#comment-17907059
 ]


Kurt Deschler commented on IMPALA-12909:
----------------------------------------

The consensus seems to be that parallelizing individual tables is not feasible 
and that parallelizing separate tables is not worth pursuing at this time. We 
can leave this ticket open for future work.

> Generate distributed plan for query accessing multiple JDBC tables
> ------------------------------------------------------------------
>
>                 Key: IMPALA-12909
>                 URL: https://issues.apache.org/jira/browse/IMPALA-12909
>             Project: IMPALA
>          Issue Type: Sub-task
>          Components: Frontend
>            Reporter: Wenzhe Zhou
>            Assignee: Pranav Yogi Lodha
>            Priority: Major
>
> For a query which access multiple JDBC tables, Planner generate single node 
> plan. It's better to generate distributed plan so that JDBC read could be 
> scheduled on executors. This restriction is due to current design of External 
> data source framework because scan is single threaded. DataSourceScanNode 
> cannot run in node other than coordinator. 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[jira] [Commented] (IMPALA-12909) Generate distributed plan for query accessing multiple JDBC tables

Reply via email to