[ 
https://issues.apache.org/jira/browse/NIFI-4971?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16451426#comment-16451426
 ] 

ASF GitHub Bot commented on NIFI-4971:
--------------------------------------

Github user ijokarumawak commented on the issue:

    https://github.com/apache/nifi/pull/2542
  
    @MikeThomsen I might have missed your points, but let me answer to your 
comments.
    
    For the lineage graphs showing two outgoing links from /tmp/in/test to two 
`GetFile, PutFile` processes going to the final /temp/out/test, I assume having 
two processes is what you are concerning. Can you share what the `qualified 
name` attributes of those two `nifi_flow_path` entities? I guess those have 
different qualified name and probably each created by 'simple path' and 
'complete path' if you run the same flow with both strategies.
    
    For the comment of not seeing any lineage with 'simple_path', Atlas does 
not draw lineage if you choose an entity which is subclass of 'Process'. When 
you selected the 'nifi_flow_path' entity, didn't its input/output attribute 
have link to the `fs_path` entities? If you follow the link, then lineage will 
be shown from the `fs_path` entity, which is a subclass of 'DataSet'.
    
    Please let me know if above descriptions address your issues. Thanks!


> ReportLineageToAtlas 'complete path' strategy can miss one-time lineages
> ------------------------------------------------------------------------
>
>                 Key: NIFI-4971
>                 URL: https://issues.apache.org/jira/browse/NIFI-4971
>             Project: Apache NiFi
>          Issue Type: Bug
>          Components: Extensions
>    Affects Versions: 1.5.0
>            Reporter: Koji Kawamura
>            Assignee: Koji Kawamura
>            Priority: Major
>
> For the simplest example, with GetFlowFIle (GFF) -> PutFlowFile (PFF), where 
> GFF gets files and PFF saves those files into a different directory, then 
> following provenance events will be generated:
>  # GFF RECEIVE file1
>  # PFF SEND file2
> From above provenance events, following entities and lineages should be 
> created in Atlas, labels in brackets are Atlas type names:
> {code}
> file1 (fs_path) -> GFF, PFF (nifi_flow_path) -> file2 (fs_path)
> {code}
> Entities shown in above graph are created. However, the 'nifi_flow_path' 
> entity do not have inputs/outputs referencing 'fs_path', so lineage can not 
> be seen in Atlas UI.
> This issue was discovered by [~nayakmahesh616]



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to