[ https://issues.apache.org/jira/browse/HADOOP-14688?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16102183#comment-16102183 ]
Daryn Sharp commented on HADOOP-14688: -------------------------------------- Where does the interning play a meaningful part in the process of the related jira? Haven't dug into the code, but isn't this information typically encoded in a xattr and only transiently exists during a namesystem operation? If yes, the overhead of interning is likely unworth it. Ie. you already have a unique string, why bother with effectively a hash lookup to sub it with another string if the unique instance is being gc'ed soon anyway. > Intern strings in KeyVersion and EncryptedKeyVersion > ---------------------------------------------------- > > Key: HADOOP-14688 > URL: https://issues.apache.org/jira/browse/HADOOP-14688 > Project: Hadoop Common > Issue Type: Improvement > Components: kms > Reporter: Xiao Chen > Assignee: Xiao Chen > Attachments: HADOOP-14688.01.patch > > > This is inspired by [~mi...@cloudera.com]'s work on HDFS-11383. > The key names and key version names are usually the same for a bunch of > {{KeyVersion}} and {{EncryptedKeyVersion}}. We should not create duplicate > objects for them. > This is more important to HDFS-10899, where we try to re-encrypt all files' > EDEKs in a given EZ. Those EDEKs all has the same key name, and mostly using > no more than a couple of key version names. -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: common-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: common-issues-h...@hadoop.apache.org