[ 
https://issues.apache.org/jira/browse/HADOOP-15446?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thomas Marquardt updated HADOOP-15446:
--------------------------------------
    Release Note: WASB: Bug fix to support non-sequential page blob reads.  
Required for HBASE replication.
          Status: Patch Available  (was: Open)

The patch is attached as HADOOP-15446-001.patch.  The patch adds 11 new tests.  
In particular it adds tests that use PageBlobInputStream.skip and 
PageBlobInputStream.seek to perform random reads and validate that page blobs 
now support random reads.  The patch was also validated against an HBASE 
replication scenario.

All WASB test cases are passing against my Azure storage account.  Running in 
IntelliJ, there were 814 tests run and they all passed.  I don't fully grep the 
new parallel test execution options, which seem endless, but all the tests 
passed when I ran the following:

*$ mvn -T 1C -Dparallel-tests clean verify*


Tests run: 252, Failures: 0, Errors: 0, Skipped: 11
Tests run: 1, Failures: 0, Errors: 0, Skipped: 0
Tests run: 418, Failures: 0, Errors: 0, Skipped: 55
Tests run: 126, Failures: 0, Errors: 0, Skipped: 10

 

> WASB: PageBlobInputStream.skip breaks HBASE replication
> -------------------------------------------------------
>
>                 Key: HADOOP-15446
>                 URL: https://issues.apache.org/jira/browse/HADOOP-15446
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs/azure
>    Affects Versions: 3.0.2, 2.9.0
>            Reporter: Thomas Marquardt
>            Assignee: Thomas Marquardt
>            Priority: Major
>         Attachments: HADOOP-15446-001.patch
>
>
> Page Blobs are primarily used by HBASE.  HBASE replication, which apparently 
> has not been used with WASB until recently, performs non-sequential reads on 
> log files using PageBlobInputStream.  There are bugs in this stream 
> implementation which prevent skip and seek from working properly, and 
> eventually the stream state becomes corrupt and unusable.
> I believe this bug affects all releases of WASB/HADOOP.  It appears to be a 
> day-0 bug in PageBlobInputStream.  There were similar bugs opened in the past 
> (HADOOP-15042) but the issue was not properly fixed, and no test coverage was 
> added.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to