-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/70462/
-----------------------------------------------------------
(Updated April 18, 2019, 8:51 p.m.)
Review request for atlas, Ashutosh Mestry, Sridhar K, Le Ma, Madhan Neethiraj,
and Sarath Subramanian.
Bugs: ATLAS-3133
https://issues.apache.org/jira/browse/ATLAS-3133
Repository: atlas
Description
-------
Adding support for Process Executions in Atlas
Diffs (updated)
-----
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/AtlasHiveHookContext.java
44c6437fe
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/HiveHook.java
ee02285df
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/BaseHiveEvent.java
31346d095
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/CreateHiveProcess.java
d61f1d74e
addons/hive-bridge/src/main/java/org/apache/atlas/hive/hook/events/CreateTable.java
674a89f6e
addons/hive-bridge/src/main/java/org/apache/atlas/hive/model/HiveDataTypes.java
94010d0cb
addons/hive-bridge/src/test/java/org/apache/atlas/hive/HiveITBase.java
002b90839
addons/hive-bridge/src/test/java/org/apache/atlas/hive/hook/HiveHookIT.java
142e35561
addons/models/0000-Area0/0010-base_model.json 8b41ee89f
addons/models/1000-Hadoop/1030-hive_model.json e96443382
webapp/src/main/java/org/apache/atlas/examples/QuickStartV2.java 6fcea9ed4
Diff: https://reviews.apache.org/r/70462/diff/6/
Changes: https://reviews.apache.org/r/70462/diff/5-6/
Testing
-------
We want to add support for Process Executions in Atlas. With the help of
process executions attributes specific to each execution(like startTime,
endTime, queryText) will be captured in the execution object. With the current
model of Atlas this is lost since each execution overrides the previous one. To
solve this problem. We are creating 2 new entity definitions and one relations
definition. We create an entity definition called a process execution and
another one called hive process execution. The hive process execution has all
the attributes which are specific to each execution. We also create a
relationship defintion between a hive process and a hive process execution as a
many to one relation. The lineage does not get affected in this since the
inputs and outputs are still attached to the hive process. Hive Process can now
be thought of as a grouping of multiple executions. There is no lineage shown
for a hive process execution. The criteria for grouping is based on the qualifie
dName of a Hive Process. As long as the qualified Name of a hive process
remains the same, each execution gets mapped to the same hive process.
Thanks,
Aadarsh Jajodia