Andrew Wang created HDFS-12534:
----------------------------------
Summary: Provide logical BlockLocations for EC files for better
split calculation
Key: HDFS-12534
URL: https://issues.apache.org/jira/browse/HDFS-12534
Project: Hadoop HDFS
Issue Type: Bug
Components: erasure-coding
Affects Versions: 3.0.0-beta1
Reporter: Andrew Wang
Assignee: Andrew Wang
I talked to [~vanzin] and [~alex.behm] some more about split calculation with
EC. It turns out HDFS-12222 was resolved prematurely. Applications depend on
HDFS BlockLocation to understand where the split points are. The current scheme
of returning one BlockLocation per block group loses this information.
We should change this to provide logical blocks. Divide the file length by the
block size and provide suitable BlockLocations to match, with virtual offsets
and lengths too.
I'm not marking this as incompatible, since changing it this way would in fact
make it more compatible from the perspective of applications that are
scheduling against replicated files. Thus, it'd be good for beta1 if possible,
but okay for later too.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]