[ 
https://issues.apache.org/jira/browse/IMPALA-15103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Joe McDonnell resolved IMPALA-15103.
------------------------------------
    Fix Version/s: Impala 5.0.0
       Resolution: Fixed

> Minicluster Hive should respect the HDFS_REPLICATION env variable
> -----------------------------------------------------------------
>
>                 Key: IMPALA-15103
>                 URL: https://issues.apache.org/jira/browse/IMPALA-15103
>             Project: IMPALA
>          Issue Type: Task
>          Components: Infrastructure
>    Affects Versions: Impala 5.0.0
>            Reporter: Joe McDonnell
>            Assignee: Joe McDonnell
>            Priority: Major
>             Fix For: Impala 5.0.0
>
>
> HDFS replication defaults to 3, which dramatically increases disk space for 
> tables. We have an existing HDFS_REPLICATION environment variable, but Hive 
> does not use it in its configurations. We should change Hive's configurations 
> to respect this environment variable. This is particularly useful for 
> perf-AB-test jobs where we are loading large scale TPC-H or TPC-DS and want 
> to avoid the disk space use.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to