[ https://issues.apache.org/jira/browse/HDFS-12777?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16293598#comment-16293598 ]
Hudson commented on HDFS-12777: ------------------------------- SUCCESS: Integrated in Jenkins build Hadoop-trunk-Commit #13391 (See [https://builds.apache.org/job/Hadoop-trunk-Commit/13391/]) HDFS-12777. [READ] Reduce memory and CPU footprint for PROVIDED volumes. (cdouglas: rev e1a28f95b8ffcb86300148f10a23b710f8388341) * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/ReplicaBuilder.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/test/java/org/apache/hadoop/hdfs/server/datanode/fsdataset/impl/TestProvidedImpl.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/fsdataset/impl/ProvidedVolumeImpl.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/DirectoryScanner.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/ProvidedReplica.java * (edit) hadoop-hdfs-project/hadoop-hdfs/src/main/java/org/apache/hadoop/hdfs/server/datanode/FinalizedProvidedReplica.java > [READ] Reduce memory and CPU footprint for PROVIDED volumes. > ------------------------------------------------------------ > > Key: HDFS-12777 > URL: https://issues.apache.org/jira/browse/HDFS-12777 > Project: Hadoop HDFS > Issue Type: Sub-task > Reporter: Virajith Jalaparti > Assignee: Virajith Jalaparti > Attachments: HDFS-12777-HDFS-9806.001.patch, > HDFS-12777-HDFS-9806.002.patch, HDFS-12777-HDFS-9806.003.patch, > HDFS-12777-HDFS-9806.004.patch > > > As opposed to local blocks, each DN keeps track of all blocks in PROVIDED > storage. This can be millions of blocks for 100s of TBs of PROVIDED data. > Storing the data for these blocks can lead to a large memory footprint. > Further, with so many blocks, {{DirectoryScanner}} running on a PROVIDED > volume can increase the memory and CPU utilization. > To reduce these overheads, this JIRA aims to (a) disable the > {{DirectoryScanner}} on PROVIDED volumes (as HDFS-9806 focuses on only > read-only data in PROVIDED volumes), (b) reduce the space occupied by > {{FinalizedProvidedReplicaInfo}} by using a common URI prefix across all > PROVIDED blocks. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org