[ 
https://issues.apache.org/jira/browse/HADOOP-17853?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17405791#comment-17405791
 ] 

Sneha Vijayarajan commented on HADOOP-17853:
--------------------------------------------

Hi [[email protected]],

Feature update in ABFS driver is to pass Http Request URL, queryparams and 
headers to APIs exposed by a maven artifact that takes up the responsibility to 
effectively use the feature path for read. In addition to the changes to 
redirect the requests, a class which holds on to the session details will also 
be added. The feature will be in dormant state and will have to be enabled 
consciously in environments that are suitable for the feature. One such 
requirements is that the Hadoop cluster needs to be on Azure VM.

The test update does not change or reduce the existing tests that run over 
REST. The read related tests for the feature are triggered separately for the 
respective test scenario.  Considering devs on ABFS driver need not have access 
to feature enabled environment, all the read tests validating the feature will 
run even if the feature flag is off and will rely on test mock code to 
replicate feature flow. This will ensure that future checkins by any dev will 
not break REST or the feature path.

On an actual store connected environment, close to 500 looped test runs have 
been completed successfully with no functional failures. This test run count is 
post the feature code freeze in ABFS driver.

Counters that IOStatistics collects remain valid while the feature is 
exercised. 

If the feature were to be enabled on an environment that isnt suitable to 
utilize it, request will fallback to REST and for the lifetime of that 
InputStream instance reads will only be attempted on REST. Fallbacks are in 
place even for the case should feature hit an irrecoverable issue. Hence 
callers are not exposed to any new error scenarios than what already exists.

Store connectivity over REST will continue to be a workflow that we optimize 
and support. 

Asserts in all the new tests added will use AssertJ.

I have closed the PR that I had initially linked to this JIRA. The changes 
included few test fixes and some refactoring needed for the feature as well. 
[~sumangala] has split the test fixes that we had to 4 PRs. Code updates that 
intent to refactor and make it easy to add the feature are split into 2 
different PRs. JIRAs of those PRs have been added here as children. Once these 
get checked in, we are left with just 1 PR that brings in the feature code.

The pending change is as atomic as possible for the feature. Committing it to a 
separate branch will need us to commit to trunk subsequently with no more add 
ons from the driver. Hence I would prefer to raise the PR on trunk directly.

 

> ABFS: Enable optional store connectivity over azure specific protocol for 
> data egress
> -------------------------------------------------------------------------------------
>
>                 Key: HADOOP-17853
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17853
>             Project: Hadoop Common
>          Issue Type: Sub-task
>          Components: fs/azure
>    Affects Versions: 3.4.0
>            Reporter: Sneha Vijayarajan
>            Assignee: Sneha Vijayarajan
>            Priority: Major
>          Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> This Jira is to provide an option to enable store access on read path over an 
> Azure specific protocol. This will only work on Azure VMs and hence will be 
> disabled by default.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to