-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/74713/
-----------------------------------------------------------

(Updated Nov. 3, 2023, 5:35 a.m.)


Review request for atlas, Jayendra Parab, Pinal Shah, and Sidharth Mishra.


Bugs: ATLAS-4803
    https://issues.apache.org/jira/browse/ATLAS-4803


Repository: atlas


Description
-------

Kafka lag was not decreasing for ATLAS_HOOK topics, create Entity API was 
taking 50-60 sec per request.

Hive_table count was 10mn record.

Impala_lineage_column count was 26mn count.

Able to reproduce the issue.

Metrics


This difference exists because earlier even fromVertex did not have any edges, 
the search would iterate through all the edges of the toVertex and timeConsume 
was high. This condition is detected in the algorithm and used to reduce the 
search cycle.


Before: "getRelationshipEdge":{"count":100000,"timeTaken":50000}
After removing if condition for toVertex.hasEdge: 
"getRelationshipEdge":{"count":100000,"timeTaken":80}


Diffs
-----

  
graphdb/janus/src/main/java/org/apache/atlas/repository/graphdb/janus/AtlasJanusGraph.java
 0dd573b89 
  
repository/src/main/java/org/apache/atlas/repository/store/graph/v2/AtlasRelationshipStoreV2.java
 ef0313e02 


Diff: https://reviews.apache.org/r/74713/diff/1/


Testing
-------

Volume testing
Initiate kafka dump and lag started decreasing.


Thanks,

Paresh Devalia

Reply via email to