[
https://issues.apache.org/jira/browse/HDFS-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15375423#comment-15375423
]
Daryn Sharp commented on HDFS-10616:
------------------------------------
Will be an umbrella for sub-tasks to incrementally integrate large internal
patches. In combination with other internal changes (forthcoming IPC
optimizations, other object allocation reductions), heap growth has
dramatically slowed.
> Improve performance of path handling
> ------------------------------------
>
> Key: HDFS-10616
> URL: https://issues.apache.org/jira/browse/HDFS-10616
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: hdfs
> Affects Versions: 2.0.0-alpha
> Reporter: Daryn Sharp
> Assignee: Daryn Sharp
>
> Path handling in the namesystem and directory is very inefficient. The path
> is repeatedly resolved, decomposed into path components, recombined to a full
> path. parsed again, throughout the system. This is directly inefficient for
> general performance, and indirectly via unnecessary pressure on young gen GC.
> The namesystem should only operate on paths, parse it once into inodes, and
> the directory should only operate on inodes.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]