Rajesh Balamohan created HADOOP-18347: -----------------------------------------
Summary: Restrict vectoredIO threadpool to reduce memory pressure Key: HADOOP-18347 URL: https://issues.apache.org/jira/browse/HADOOP-18347 Project: Hadoop Common Issue Type: Sub-task Components: common, fs, fs/adl, fs/s3 Reporter: Rajesh Balamohan https://github.com/apache/hadoop/blob/trunk/hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/S3AInputStream.java#L964-L967 Currently, it fetches all the ranges with unbounded threadpool. This will not cause memory pressures with standard benchmarks like TPCDS. However, when large number of ranges are present with large files, this could potentially spike up memory usage of the task. Limiting the threadpool size could reduce the memory usage. -- This message was sent by Atlassian Jira (v8.20.10#820010) --------------------------------------------------------------------- To unsubscribe, e-mail: common-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-dev-h...@hadoop.apache.org