FrozenGene commented on issue #7563: URL: https://github.com/apache/tvm/issues/7563#issuecomment-796780183
> > @guyzsarun There is no mxnet model attachment. So I can not reproduce your issue. > > @FrozenGene You can use [this model](https://drive.google.com/drive/folders/1yGXAKebez7R_ZMjWWxaSBD5ixB68GUyR?usp=sharing) as already reported in [TVM discuss](https://discuss.tvm.apache.org/t/bug-performance-drop-with-batch-and-opt-level-3/9193/2). I could reproduce this issue use the model mentioned. Besides `CanonicalizeOps`, if we just disable `AlterOpLayout` we could pass it too. If we set the target be `cuda`, we could pass too. The difference I could think of is when we set target be `llvm` and turn on ``AlterOpLayout`, the layout is `NCHWc`. Maybe we trigger some bug of `NCHWc` layout when batch size is not 1. ---------------------------------------------------------------- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. For queries about this service, please contact Infrastructure at: [email protected]
