[ 
https://issues.apache.org/jira/browse/HDFS-4750?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13641309#comment-13641309
 ] 

Brock Noland commented on HDFS-4750:
------------------------------------

Hi Brandon,

Great to see this proposal! I am fine with using a new JIRA for this, but if we 
do so, should HDFS-252 be closed as a duplicate? As you know I created an 
Apache Licensed NFS4 proxy for HDFS 
(https://github.com/cloudera/hdfs-nfs-proxy). I have a couple 
questions/comments:

The proposal says that waiting "10 milliseconds" should be able to convert most 
writes to sequential writes. I am curious has this been tested under load on 
modern kernels? The reason I ask is that I found that often the NFS4 proxy has 
to wait much longer than 10 milliseconds to receive the pre-requisite writes. 
It's possible that behavior is NFS4 only.

Before implementing the NFS4 proxy I implemented a NFS3 proxy as you propose. 
Unfortunately I deleted the git repo when I became frustrated with the mismatch 
between NFS3 and HDFS semantics. If I remember correctly, one example was that 
when I had a small file, a small append resulted in a write of the entire file. 
I cannot remember exactly how it behaved with larger files. Have you 
encountered this? If so, how will it be handled?

Another problem I ran into was that since NFS3 doesn't have a close, I was 
never sure when to close the HDFS file handle. I see that you plan to handle 
this by idle closing file handles. I thought about this approach as well but my 
concern was it will often be the case that there is data which has not been 
"synced" to HDFS when the native program has closed the file. Therefore there 
are races with other clients being able to see that data. I am not 100% up the 
latest of when a file length is updated in HDFS, but I believe there is a 
similar issue with the length metadata as well. How will this be handled?

Once again, great work on the proposal!

Cheers,
Brock
                
> Support NFSv3 interface to HDFS
> -------------------------------
>
>                 Key: HDFS-4750
>                 URL: https://issues.apache.org/jira/browse/HDFS-4750
>             Project: Hadoop HDFS
>          Issue Type: New Feature
>    Affects Versions: 3.0.0
>            Reporter: Brandon Li
>            Assignee: Brandon Li
>         Attachments: HADOOP-NFS-Proposal.pdf
>
>
> Access HDFS is usually done through HDFS Client or webHDFS. Lack of seamless 
> integration with client’s file system makes it difficult for users and 
> impossible for some applications to access HDFS. NFS interface support is one 
> way for HDFS to support such easy integration.
> This JIRA is to track the NFS protocol support for accessing HDFS. With HDFS 
> client, webHDFS and the NFS interface, HDFS will be easier to access and be 
> able support more applications and use cases. 
> We will upload the design document and the initial implementation. 

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to