Joe McDonnell created IMPALA-15103:
--------------------------------------
Summary: Minicluster Hive should respect the HDFS_REPLICATION env
variable
Key: IMPALA-15103
URL: https://issues.apache.org/jira/browse/IMPALA-15103
Project: IMPALA
Issue Type: Task
Components: Infrastructure
Affects Versions: Impala 5.0.0
Reporter: Joe McDonnell
Assignee: Joe McDonnell
HDFS replication defaults to 3, which dramatically increases disk space for
tables. We have an existing HDFS_REPLICATION environment variable, but Hive
does not use it in its configurations. We should change Hive's configurations
to respect this environment variable. This is particularly useful for
perf-AB-test jobs where we are loading large scale TPC-H or TPC-DS and want to
avoid the disk space use.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)