mkolod edited a comment on issue #11325: Added TensorRT runtime integration
URL: https://github.com/apache/incubator-mxnet/pull/11325#issuecomment-404320678
 
 
   @reminisce Regarding your question about the cycle check, I'd like to 
juxtapose two approaches, one of which is the one taken here, and the other is 
more typical but probably less efficient for large graphs.
   
   Here's your graph (starred nodes are TensorRT-compatible, unstarred ones are 
not):
   
   ```
   A* →   D
   ↓          ↓
   B* →  C*
   ```
   
   Approach 1 - naively growing the subgraph and checking for cycles each time
   
   This approach tests adding a new node and if a cycle is formed, the node is 
rejected. Taking your graph as an example, say we start with A*. Clearly 
TensorRT only helps when there are 2 or more nodes, due to its benefit being 
mainly due to fusion. So, we get (A*, B*) in the set of nodes in the TensorRT 
graph. This is fine, there's no cycle, because the edge is from B* to C*, not 
the other way around. Now we try adding C*, to get (A*, B*, C*) in the TensorRT 
subgraph. Let's call this subgraph T*. Now D is the only node outside the 
subgraph and we have a cycle from T* to D, back to T*. This was an illegal 
contraction, so the last step is rejected, and only A* → B* gets substituted 
with T*. This is in fact how TensorRT integration with TensorFlow works, or 
used to, the last time I checked.
   
   
   Approach 2 - two search passes, examining sets of pairs
   
   Instead of the naive subgraph growth, let's enumerate all pairs of nodes 
that are TensorRT-compatible or incompatible. A pair compatible if both nodes 
between the edge are TensorRT-compatible - only then can they go into the 
TensorRT subgraph. 
   
   For the graph above, this would be A* → B* and B* → C*.
   
   A* → D is incompatible because D is incompatible, so the whole pair is 
incompatible.
   Similarly, D → C* is incompatible because D is incompatible.
   
   However, if there is a node which is part of both a compatible pair and an 
incompatible pair, it's no longer a candidate for the TensorRT graph. Such a 
node is A*. That's because even though it's compatible in one pair (A* → B*), 
it's incompatible in another (A* → D). So, the only edge that can be collapsed 
is B* → C* and these two nodes become T*. Now the graph after the rewrite will 
look as follows:
   
   ```
   A*  →   D 
    ↘  T* ↙
   ```
   
   The graph partitioner for this commit takes Approach 2. We will add a unit 
test for this to show that it works.
   
   I hope this clarifies your question as to how we handle cycles.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

Reply via email to