Andrew Wang created HDFS-12534:
----------------------------------

             Summary: Provide logical BlockLocations for EC files for better 
split calculation
                 Key: HDFS-12534
                 URL: https://issues.apache.org/jira/browse/HDFS-12534
             Project: Hadoop HDFS
          Issue Type: Bug
          Components: erasure-coding
    Affects Versions: 3.0.0-beta1
            Reporter: Andrew Wang
            Assignee: Andrew Wang


I talked to [~vanzin] and [~alex.behm] some more about split calculation with 
EC. It turns out HDFS-12222 was resolved prematurely. Applications depend on 
HDFS BlockLocation to understand where the split points are. The current scheme 
of returning one BlockLocation per block group loses this information.

We should change this to provide logical blocks. Divide the file length by the 
block size and provide suitable BlockLocations to match, with virtual offsets 
and lengths too.

I'm not marking this as incompatible, since changing it this way would in fact 
make it more compatible from the perspective of applications that are 
scheduling against replicated files. Thus, it'd be good for beta1 if possible, 
but okay for later too.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

---------------------------------------------------------------------
To unsubscribe, e-mail: hdfs-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: hdfs-issues-h...@hadoop.apache.org

Reply via email to