[
https://issues.apache.org/jira/browse/IMPALA-15103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joe McDonnell resolved IMPALA-15103.
------------------------------------
Fix Version/s: Impala 5.0.0
Resolution: Fixed
> Minicluster Hive should respect the HDFS_REPLICATION env variable
> -----------------------------------------------------------------
>
> Key: IMPALA-15103
> URL: https://issues.apache.org/jira/browse/IMPALA-15103
> Project: IMPALA
> Issue Type: Task
> Components: Infrastructure
> Affects Versions: Impala 5.0.0
> Reporter: Joe McDonnell
> Assignee: Joe McDonnell
> Priority: Major
> Fix For: Impala 5.0.0
>
>
> HDFS replication defaults to 3, which dramatically increases disk space for
> tables. We have an existing HDFS_REPLICATION environment variable, but Hive
> does not use it in its configurations. We should change Hive's configurations
> to respect this environment variable. This is particularly useful for
> perf-AB-test jobs where we are loading large scale TPC-H or TPC-DS and want
> to avoid the disk space use.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)