[ 
https://issues.apache.org/jira/browse/HDFS-8401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14563531#comment-14563531
 ] 

Arpit Agarwal commented on HDFS-8401:
-------------------------------------

bq. The administrator simply set a list of files and directories to be cached. 
When applications read those files or directories, they were retrieved from the 
cache.
It's impractical to involve the administrator every time a new file is to be 
cached. We've heard this requirement makes caching difficult to use. There are 
a couple of other things can help with usability e.g. de-duplication of cache 
directives, predictability of cache locality.

bq. If we just want a generic FS caching layer in Hadoop, we could do that in 
hadoop-common.
That was my intention. I'll move the jira to common.

> Memfs - a layered file system for in-memory storage in HDFS
> -----------------------------------------------------------
>
>                 Key: HDFS-8401
>                 URL: https://issues.apache.org/jira/browse/HDFS-8401
>             Project: Hadoop HDFS
>          Issue Type: Bug
>            Reporter: Arpit Agarwal
>            Assignee: Arpit Agarwal
>
> We propose creating a layered filesystem that can provide in-memory storage 
> using existing features within HDFS. memfs will use lazy persist writes 
> introduced by HDFS-6581. For reads, memfs can use the Centralized Cache 
> Management feature introduced in HDFS-4949 to load hot data to memory.
> Paths in memfs and hdfs will correspond 1:1 so memfs will require no 
> additional metadata and it can be implemented entirely as a client-side 
> library.
> The advantage of a layered file system is that it requires little or no 
> changes to existing applications. e.g. Applications can use something like 
> {{memfs://}} instead of {{hdfs://}} for files targeted to memory storage. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to