Rajesh Balamohan created HADOOP-18347:
-----------------------------------------

             Summary: Restrict vectoredIO threadpool to reduce memory pressure
                 Key: HADOOP-18347
                 URL: https://issues.apache.org/jira/browse/HADOOP-18347
             Project: Hadoop Common
          Issue Type: Sub-task
          Components: common, fs, fs/adl, fs/s3
            Reporter: Rajesh Balamohan


https://github.com/apache/hadoop/blob/trunk/hadoop-tools/hadoop-aws/src/main/java/org/apache/hadoop/fs/s3a/S3AInputStream.java#L964-L967

Currently, it fetches all the ranges with unbounded threadpool. This will not 
cause memory pressures with standard benchmarks like TPCDS. However, when large 
number of ranges are present with large files, this could potentially spike up 
memory usage of the task. Limiting the threadpool size could reduce the memory 
usage.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: common-issues-h...@hadoop.apache.org

Reply via email to