[ 
https://issues.apache.org/jira/browse/HADOOP-17990?focusedWorklogId=676987&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-676987
 ]

ASF GitHub Bot logged work on HADOOP-17990:
-------------------------------------------

                Author: ASF GitHub Bot
            Created on: 05/Nov/21 11:40
            Start Date: 05/Nov/21 11:40
    Worklog Time Spent: 10m 
      Work Description: steveloughran commented on pull request #3620:
URL: https://github.com/apache/hadoop/pull/3620#issuecomment-961824234


   had a quick look at the code
   
   1. hadoop trunk has its own VisibleForTesting as we are trying do cut guava 
out of a lot of source (at least isolate it)
   2. see the testing abfs requirements & declare which azure endpoint you ran 
the test suite, ideally including options
   
   example https://github.com/apache/hadoop/pull/3611#issuecomment-961180315


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]


Issue Time Tracking
-------------------

    Worklog Id:     (was: 676987)
    Time Spent: 1h 20m  (was: 1h 10m)

> Failing concurrent FS.initialize commands when 
> fs.azure.createRemoteFileSystemDuringInitialization is enabled on 
> hadoop-azure ABFS
> ----------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: HADOOP-17990
>                 URL: https://issues.apache.org/jira/browse/HADOOP-17990
>             Project: Hadoop Common
>          Issue Type: Bug
>          Components: fs/azure
>    Affects Versions: 3.3.1
>            Reporter: Zamil Majdy
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> *Bug description:*
> When {{fs.azure.createRemoteFileSystemDuringInitialization}} is enabled, the 
> filesystem will create a container if it does not already exist inside the 
> {{initialize}} method. The current flow of creating the container will fail 
> in the case of concurrent {{initialize}} methods being executed 
> simultaneously (only one request can create the container, the rest will fail 
> instead of moving on). This is happen due to the `checkException` method that 
> is not catching the Hadoop `FileAlreadyExists` exception.
> Stacktrace:
> {{Caused by: org.apache.hadoop.fs.FileAlreadyExistsException: Operation 
> failed: "The specified filesystem already exists.", 409, PUT, 
> https://<REDACTED>.dfs.core.windows.net/project?resource=filesystem, 
> FilesystemAlreadyExists, "The specified filesystem already exists. 
> RequestId:<REDACTED> Time:2021-10-18T13:46:05.7504906Z"}}
>  {{ {{at 
> org.apache.hadoop.fs.azurebfs.AzureBlobFileSystem.checkException(AzureBlobFileSystem.java:1182)}}}}
>  {{ {{at 
> org.apache.hadoop.fs.azurebfs.AzureBlobFileSystem.createFileSystem(AzureBlobFileSystem.java:1067)}}}}
>  {{ {{at 
> org.apache.hadoop.fs.azurebfs.AzureBlobFileSystem.initialize(AzureBlobFileSystem.java:126)}}}}
>  {{ {{at 
> org.apache.hadoop.fs.FileSystem.createFileSystem(FileSystem.java:2669)}}}}
> *To reproduce:*
>  * Set `fs.azure.createRemoteFileSystemDuringInitialization` to `true`
>  * Run two concurrent `initialize` commands with the root to the non existing 
> container/filesystem.
>  
> *Proposed fix:*
> [https://github.com/apache/hadoop/pull/3620]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to