[ 
https://issues.apache.org/jira/browse/NIFI-4826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16350460#comment-16350460
 ] 

ASF GitHub Bot commented on NIFI-4826:
--------------------------------------

Github user pvillard31 commented on the issue:

    https://github.com/apache/nifi/pull/2440
  
    LGTM, merging to master, thanks @zenfenan 


> ListAzureBlobStorage doesn't write azure.blobname properly
> ----------------------------------------------------------
>
>                 Key: NIFI-4826
>                 URL: https://issues.apache.org/jira/browse/NIFI-4826
>             Project: Apache NiFi
>          Issue Type: Improvement
>          Components: Extensions
>    Affects Versions: 1.2.0, 1.3.0, 1.4.0, 1.5.0
>            Reporter: zenfenan
>            Priority: Minor
>
> ListAzureBlobStorage as of now takes the substring from the blob's primary 
> URI i.e. primaryUri.lastIndexOf('/') + 1 and writes that as azure.blobname. 
> For ex, if the blob is in the path 
> "mystorageaccountname.blob.core.windows.net/container-name/path/to/the/blob". 
> It will write azure.blobname as "blob". So if we have the blob located under 
> a multiple hierarchy directory structure such as the above one, it will be 
> troublesome in the downstream processors like FetchAzureBlobStorage which 
> expects the full blob name to be given i.e. "path/to/the/blob". Giving just 
> "blob" here will fail.
> A workaround that can be followed right now, is to use "ExecuteScript" and 
> get the substring from primary URI i.e. everything after the 
> "https://"+storageAccountName+"/"+containerName+"/";. A better approach would 
> be to make use of the CloudBlob.getName() API provided in Azure SDK. It 
> should be a minor change since we are already using this SDK and the said 
> class in our processor.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to