> On Oct. 16, 2019, 4:28 p.m., Anand Patil wrote:
> > addons/models/1000-Hadoop/1111-ml_model.json
> > Lines 10 (patched)
> > <https://reviews.apache.org/r/71619/diff/1/?file=2169131#file2169131line10>
> >
> >     Why make project a subtype of dataset?

Atlas server has a lot of code that build lineage based on Process and DataSet 
types

1) Process only handle types derives from `DataSet`
   {
      "name": "Process",
      "superTypes": [
        "Asset"
      ],
      "serviceType": "atlas_core",
      "typeVersion": "1.1",
      "attributeDefs": [
        {
          "name": "inputs",
          "typeName": "array<DataSet>",
          "cardinality": "SET",
          "isIndexable": false,
          "isOptional": true,
          "isUnique": false
        },
        {
          "name": "outputs",
          "typeName": "array<DataSet>",
          "cardinality": "SET",
          "isIndexable": false,
          "isOptional": true,
          "isUnique": false
        }
      ]
    }
    
2) Lineage is only built between Process type and DataSet type. 
https://github.com/apache/atlas/blob/master/repository/src/main/java/org/apache/atlas/discovery/EntityLineageService.java#L117
3) If we don't want to re-implement the lineage in Atlas, it is better to 
deriver action type from Process type, and data type from DataSet.


> On Oct. 16, 2019, 4:28 p.m., Anand Patil wrote:
> > addons/models/1000-Hadoop/1111-ml_model.json
> > Lines 51 (patched)
> > <https://reviews.apache.org/r/71619/diff/1/?file=2169131#file2169131line51>
> >
> >     businessUseCase, modelFramework, modelAlgorithms, githubRepoURL, 
> > notebookURL and resourceURL feel prematurely opinionated to me. Some 
> > projects will not use all these fields, and for some projects fields other 
> > than these will probably be more important.
> >     
> >     Would it be possible in Atlas to move these to a single "metadata" 
> > attribute whose value is key-value pairs? We can then move to stronger 
> > typing as common patterns emerge.

replace those attributes by

        {
          "name": "metadata",
          "description": "Contains key-value pairs that provide project 
metadata",
          "typeName": "map<string,string>",
          "cardinality": "SINGLE",
          "isIndexable": false,
          "isOptional": true,
          "isUnique": false
        }


- Na


-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/71619/#review218231
-----------------------------------------------------------


On Oct. 16, 2019, 12:30 a.m., Na Li wrote:
> 
> -----------------------------------------------------------
> This is an automatically generated e-mail. To reply, visit:
> https://reviews.apache.org/r/71619/
> -----------------------------------------------------------
> 
> (Updated Oct. 16, 2019, 12:30 a.m.)
> 
> 
> Review request for atlas, Austin Nobis, Ashutosh Mestry, Karthik Manamcheri, 
> Sridhar K, Madhan Neethiraj, and Sarath Subramanian.
> 
> 
> Bugs: atlas-3464
>     https://issues.apache.org/jira/browse/atlas-3464
> 
> 
> Repository: atlas
> 
> 
> Description
> -------
> 
> Define entities used for Machine Learning Governance
> 
> 
> Diffs
> -----
> 
>   addons/models/1000-Hadoop/1111-ml_model.json PRE-CREATION 
> 
> 
> Diff: https://reviews.apache.org/r/71619/diff/1/
> 
> 
> Testing
> -------
> 
> verified it is valid json file
> 
> 
> Thanks,
> 
> Na Li
> 
>

Reply via email to