[GitHub] mkolod edited a comment on issue #11325: Added TensorRT runtime integration

GitBox Wed, 11 Jul 2018 14:56:09 -0700

mkolod edited a comment on issue #11325: Added TensorRT runtime integration
URL: https://github.com/apache/incubator-mxnet/pull/11325#issuecomment-404320678
 
 
   @reminisce Regarding your question about the cycle check, I'd like to 
juxtapose two approaches, one of which is the one taken here, and the other is 
more typical but probably less efficient for large graphs.
   
   Here's your graph (starred nodes are TensorRT-compatible, unstarred ones are 
not):
   
   ```
   A* →   D
   ↓        ↓
   B* →  C*
   ```
   
   Approach 1 - naively growing the subgraph and checking for cycles each time
   
   This approach tests adding a new node while greedily growing the subgraph 
and if a cycle is formed while adding a new node, the node is rejected. Taking 
your graph as an example, say we start with A*. Clearly TensorRT only helps 
when there are 2 or more nodes, due to its benefit being mainly due to fusion. 
So, we get (A*, B*) in the set of nodes in the TensorRT graph. This is fine, 
there's no cycle, because the edge from the node replacing the subgraph is from 
B* to C*, not the other way around. So, we have the following:
   
   ```
       ↗    D  
   T*       ↓
       ↘    C*
   ```
   
   Now we try adding C*, to get (A*, B*, C*) in the TensorRT subgraph. Let's 
call this subgraph T*. Now D is the only node outside the subgraph and we have 
a cycle from T* to D, back to T*. This was an illegal contraction, so the last 
step is rejected, and only A* → B* gets substituted with T*. This is in fact 
how TensorRT integration with TensorFlow works, or used to, the last time I 
checked.
   
   
   Approach 2 - two search passes, examining sets of pairs
   
   Instead of the naive subgraph growth, let's enumerate all pairs of nodes 
that are TensorRT-compatible or incompatible. A pair compatible if both nodes 
between the edge are TensorRT-compatible - only then can they go into the 
TensorRT subgraph. 
   
   For the graph above, this would be A* → B* and B* → C*.
   
   A* → D is incompatible because D is incompatible, so the whole pair is 
incompatible.
   Similarly, D → C* is incompatible because D is incompatible.
   
   However, if there is a node which is part of both a compatible pair and an 
incompatible pair, it's no longer a candidate for the TensorRT graph. Such a 
node is A*. That's because even though it's compatible in one pair (A* → B*), 
it's incompatible in another (A* → D). So, the only edge that can be collapsed 
is B* → C* and these two nodes become T*. Now the graph after the rewrite will 
look as follows:
   
   ```
   A*  →   D 
    ↘  T* ↙
   ```
   
   The graph partitioner for this commit takes Approach 2. We will add a unit 
test for this to show that it works.
   
   I hope this clarifies your question as to how we handle cycles.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

[GitHub] mkolod edited a comment on issue #11325: Added TensorRT runtime integration

Reply via email to