[ 
https://issues.apache.org/jira/browse/IMPALA-7871?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joe McDonnell resolved IMPALA-7871.
-----------------------------------
       Resolution: Fixed
    Fix Version/s: Impala 3.2.0

> Don't load Hive builtin jars for dataload
> -----------------------------------------
>
>                 Key: IMPALA-7871
>                 URL: https://issues.apache.org/jira/browse/IMPALA-7871
>             Project: IMPALA
>          Issue Type: Improvement
>          Components: Infrastructure
>    Affects Versions: Impala 3.1.0
>            Reporter: Joe McDonnell
>            Assignee: Joe McDonnell
>            Priority: Major
>             Fix For: Impala 3.2.0
>
>
> One step in dataload is "Loading Hive Builtins", which copies a large number 
> of jars into HDFS (or whatever storage). This step takes a couple minutes on 
> HDFS dataload and 8 minutes on S3. Despite its name, I can't find any 
> indication that Hive or anything else uses these jars. Dataload and core 
> tests run fine without it. S3 can load data without it. There's no indication 
> that this is needed.
> Unless we find something using these jars, we should remove this step.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to