[ 
https://issues.apache.org/jira/browse/HDFS-7729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318358#comment-14318358
 ] 

Kai Zheng commented on HDFS-7729:
---------------------------------

I just took a quick look at the new patch checking how it uses the raw coder. 
Some comments:
# Similar to {{blockGroupDataBlocks}}, we can have {{blockGroupParityBlocks}}, 
instead of {{blockGroupSize}}.
# Better to call this version, encode(ECChunk[], ECChunk[]), which can be more 
efficient. Sure we can get this done when change to use block erasure coder as 
follow up.
# In {{void encode(ByteBuffer[] buffers)}}, looks like data buffers and parity 
buffers are combined together so we need to split. Why we combine them ? For a 
certain file, the stripping EC schema is fixed, we can have fixed data and 
parity buffers array as members for each time to fill, to prepare for the 
encoding.
# We need to call {{release()}} for the coder to release underlying encoding 
buffers or resources.

> Add logic to DFSOutputStream to support writing a file in striping layout 
> --------------------------------------------------------------------------
>
>                 Key: HDFS-7729
>                 URL: https://issues.apache.org/jira/browse/HDFS-7729
>             Project: Hadoop HDFS
>          Issue Type: Sub-task
>            Reporter: Li Bo
>            Assignee: Li Bo
>         Attachments: Codec-tmp.patch, HDFS-7729-001.patch, 
> HDFS-7729-002.patch, HDFS-7729-003.patch, HDFS-7729-004.patch, 
> HDFS-7729-005.patch, HDFS-7729-006.patch, HDFS-7729-007.patch, 
> HDFS-7729-008.patch
>
>
> If client wants to directly write a file striping layout, we need to add some 
> logic to DFSOutputStream.  DFSOutputStream needs multiple DataStreamers to 
> write each cell of a stripe to a remote datanode. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to