Joe McDonnell created IMPALA-15103:
--------------------------------------

             Summary: Minicluster Hive should respect the HDFS_REPLICATION env 
variable
                 Key: IMPALA-15103
                 URL: https://issues.apache.org/jira/browse/IMPALA-15103
             Project: IMPALA
          Issue Type: Task
          Components: Infrastructure
    Affects Versions: Impala 5.0.0
            Reporter: Joe McDonnell
            Assignee: Joe McDonnell


HDFS replication defaults to 3, which dramatically increases disk space for 
tables. We have an existing HDFS_REPLICATION environment variable, but Hive 
does not use it in its configurations. We should change Hive's configurations 
to respect this environment variable. This is particularly useful for 
perf-AB-test jobs where we are loading large scale TPC-H or TPC-DS and want to 
avoid the disk space use.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to