[ 
https://issues.apache.org/jira/browse/HDFS-10616?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15386358#comment-15386358
 ] 

Daryn Sharp commented on HDFS-10616:
------------------------------------

I chose 2.9 solely because branch-2 and trunk have drifted often quite a bit 
from 2.7 and earlier.  Given the sheer volume of internal optimizations I'm 
pushing out (I've barely started), I don't have the time to back-port but feel 
free to pitch in if you like!

> Improve performance of path handling
> ------------------------------------
>
>                 Key: HDFS-10616
>                 URL: https://issues.apache.org/jira/browse/HDFS-10616
>             Project: Hadoop HDFS
>          Issue Type: Bug
>          Components: hdfs
>    Affects Versions: 2.0.0-alpha
>            Reporter: Daryn Sharp
>            Assignee: Daryn Sharp
>
> Path handling in the namesystem and directory is very inefficient.  The path 
> is repeatedly resolved, decomposed into path components, recombined to a full 
> path. parsed again, throughout the system.  This is directly inefficient for 
> general performance, and indirectly via unnecessary pressure on young gen GC.
> The namesystem should only operate on paths, parse it once into inodes, and 
> the directory should only operate on inodes.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to