[ 
https://issues.apache.org/jira/browse/HDFS-5851?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13973462#comment-13973462
 ] 

Andrew Wang commented on HDFS-5851:
-----------------------------------

I'd really like to integrate this with HDFS-4949 where possible. One concern is 
that we should avoid having another pool of memory carved off from the cluster. 
HDFS-4949's cache pools were designed to eventually integrate with YARN, but 
this might introduce another separate pool for a memory quota, putting us back 
in the same place.

There are also plans to move towards sub-block caching. Whole-block caching is 
wasteful for columnar formats like ORC and Parquet. With sub-block caching, 
automatic cache replacement looks a lot more attractive (another planned 
feature). These are both things we can support with HDFS-4949's infrastructure. 
I'm not sure about with HSM.

It'd also be nice if apps could ZCR these memory-only replicas, ideally reusing 
the existing auto-ZCR infrastructure.

Anyway, either pulling this towards HDFS-4949 or vice versa, we should figure 
out these details before moving ahead. I'll echo Colin's desire for a meeting 
to discuss this. We're willing to host at our Palo Alto office.

> Support memory as a storage medium
> ----------------------------------
>
>                 Key: HDFS-5851
>                 URL: https://issues.apache.org/jira/browse/HDFS-5851
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>          Components: datanode
>    Affects Versions: 3.0.0
>            Reporter: Arpit Agarwal
>            Assignee: Arpit Agarwal
>
> Memory can be used as a storage medium for smaller/transient files for fast 
> write throughput.
> More information/design will be added later.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to