[
https://issues.apache.org/jira/browse/HDFS-6690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrew Wang updated HDFS-6690:
------------------------------
Attachment: hdfs-6690.001.patch
Patch attached. I looked at some heap dumps to verify. With an image with 10k
xattrs with 15KB names, I saw 307MB of heap dominated by XAttr. With this
change, I see a total heap of 27MB, so much improved.
> Deduplicate xattr names in memory
> ---------------------------------
>
> Key: HDFS-6690
> URL: https://issues.apache.org/jira/browse/HDFS-6690
> Project: Hadoop HDFS
> Issue Type: Improvement
> Components: namenode
> Affects Versions: 2.5.0
> Reporter: Andrew Wang
> Assignee: Andrew Wang
> Attachments: hdfs-6690.001.patch
>
>
> When the same string is used repeatedly for an xattr name, we could
> potentially save some NN memory by deduplicating the strings.
--
This message was sent by Atlassian JIRA
(v6.2#6252)