[
https://issues.apache.org/jira/browse/HDFS-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15386358#comment-15386358
]
Daryn Sharp commented on HDFS-10616:
------------------------------------
I chose 2.9 solely because branch-2 and trunk have drifted often quite a bit
from 2.7 and earlier. Given the sheer volume of internal optimizations I'm
pushing out (I've barely started), I don't have the time to back-port but feel
free to pitch in if you like!
> Improve performance of path handling
> ------------------------------------
>
> Key: HDFS-10616
> URL: https://issues.apache.org/jira/browse/HDFS-10616
> Project: Hadoop HDFS
> Issue Type: Bug
> Components: hdfs
> Affects Versions: 2.0.0-alpha
> Reporter: Daryn Sharp
> Assignee: Daryn Sharp
>
> Path handling in the namesystem and directory is very inefficient. The path
> is repeatedly resolved, decomposed into path components, recombined to a full
> path. parsed again, throughout the system. This is directly inefficient for
> general performance, and indirectly via unnecessary pressure on young gen GC.
> The namesystem should only operate on paths, parse it once into inodes, and
> the directory should only operate on inodes.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]