-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/70462/
-----------------------------------------------------------

(Updated April 18, 2019, 8:51 p.m.)


Review request for atlas, Ashutosh Mestry, Sridhar K, Le Ma, Madhan Neethiraj, 
and Sarath Subramanian.


Bugs: ATLAS-3133
    https://issues.apache.org/jira/browse/ATLAS-3133


Repository: atlas


Description
-------

Adding support for Process Executions in Atlas


Diffs (updated)
-----

  
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/AtlasHiveHookContext.java
 44c6437fe 
  addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/HiveHook.java 
ee02285df 
  
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/BaseHiveEvent.java
 31346d095 
  
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/CreateHiveProcess.java
 d61f1d74e 
  
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/CreateTable.java
 674a89f6e 
  
addons/hive-bridge/src/main/java/org/apache/atlas/hive/model/HiveDataTypes.java 
94010d0cb 
  addons/hive-bridge/src/test/java/org/apache/atlas/hive/HiveITBase.java 
002b90839 
  addons/hive-bridge/src/test/java/org/apache/atlas/hive/hook/HiveHookIT.java 
142e35561 
  addons/models/0000-Area0/0010-base_model.json 8b41ee89f 
  addons/models/1000-Hadoop/1030-hive_model.json e96443382 
  webapp/src/main/java/org/apache/atlas/examples/QuickStartV2.java 6fcea9ed4 


Diff: https://reviews.apache.org/r/70462/diff/6/

Changes: https://reviews.apache.org/r/70462/diff/5-6/


Testing
-------

We want to add support for Process Executions in Atlas. With the help of 
process executions attributes specific to each execution(like startTime, 
endTime, queryText) will be captured in the execution object. With the current 
model of Atlas this is lost since each execution overrides the previous one. To 
solve this problem. We are creating 2 new entity definitions and one relations 
definition. We create an entity definition called a process execution and 
another one called hive process execution. The hive process execution has all 
the attributes which are specific to each execution. We also create a 
relationship defintion between a hive process and a hive process execution as a 
many to one relation. The lineage does not get affected in this since the 
inputs and outputs are still attached to the hive process. Hive Process can now 
be thought of as a grouping of multiple executions. There is no lineage shown 
for a hive process execution. The criteria for grouping is based on the qualifie
 dName of a Hive Process. As long as the qualified Name of a hive process 
remains the same, each execution gets mapped to the same hive process.


Thanks,

Aadarsh Jajodia

Reply via email to