[
https://issues.apache.org/jira/browse/HDFS-7729?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14318358#comment-14318358
]
Kai Zheng commented on HDFS-7729:
---------------------------------
I just took a quick look at the new patch checking how it uses the raw coder.
Some comments:
# Similar to {{blockGroupDataBlocks}}, we can have {{blockGroupParityBlocks}},
instead of {{blockGroupSize}}.
# Better to call this version, encode(ECChunk[], ECChunk[]), which can be more
efficient. Sure we can get this done when change to use block erasure coder as
follow up.
# In {{void encode(ByteBuffer[] buffers)}}, looks like data buffers and parity
buffers are combined together so we need to split. Why we combine them ? For a
certain file, the stripping EC schema is fixed, we can have fixed data and
parity buffers array as members for each time to fill, to prepare for the
encoding.
# We need to call {{release()}} for the coder to release underlying encoding
buffers or resources.
> Add logic to DFSOutputStream to support writing a file in striping layout
> --------------------------------------------------------------------------
>
> Key: HDFS-7729
> URL: https://issues.apache.org/jira/browse/HDFS-7729
> Project: Hadoop HDFS
> Issue Type: Sub-task
> Reporter: Li Bo
> Assignee: Li Bo
> Attachments: Codec-tmp.patch, HDFS-7729-001.patch,
> HDFS-7729-002.patch, HDFS-7729-003.patch, HDFS-7729-004.patch,
> HDFS-7729-005.patch, HDFS-7729-006.patch, HDFS-7729-007.patch,
> HDFS-7729-008.patch
>
>
> If client wants to directly write a file striping layout, we need to add some
> logic to DFSOutputStream. DFSOutputStream needs multiple DataStreamers to
> write each cell of a stripe to a remote datanode.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)