shingjan commented on PR #12141:
URL: https://github.com/apache/tvm/pull/12141#issuecomment-1197495761
Local integration test for resnet18/llvm:
```
ID |
Name | FLOP | Weight | Speed (GFLOPS) | Latency (us) | Weighted Latency
(us) | Trials | Terminated
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
0 |
fused_nn_conv2d_add | 12870144 | 1 | 375.4210 | 34.2819 |
34.2819 | 32 | Y
1 |
fused_nn_conv2d_add_1 | 12895232 | 1 | 398.5375 | 32.3564 |
32.3564 | 32 | Y
2 |
fused_nn_conv2d_add_2 | 12945408 | 1 | 464.8020 | 27.8514 |
27.8514 | 32 | Y
3 |
fused_layout_transform | 1 | 1 | 0.0002 | 5.7608 |
5.7608 | 2 | Y
4 |
fused_nn_conv2d_add_nn_relu | 237633536 | 1 | 387.8015 |
612.7711 | 612.7711 | 32 | Y
5 |
fused_nn_max_pool2d | 1806336 | 1 | 157.2717 | 11.4854 |
11.4854 | 32 | Y
6 |
fused_nn_conv2d_add_nn_relu_1 | 231612416 | 2 | 383.7106 |
603.6122 | 1207.2245 | 32 | Y
7 |
fused_nn_conv2d_add_add_nn_relu | 231813120 | 2 | 442.1804 |
524.2501 | 1048.5002 | 32 | Y
8 |
fused_nn_conv2d_add_nn_relu_2 | 115806208 | 1 | 362.1544 |
319.7703 | 319.7703 | 32 | Y
9 |
fused_nn_contrib_conv2d_winograd_without_weight_transform_add_nn_relu |
93227008 | 1 | 293.8712 | 317.2377 | 317.2377 |
32 | Y
10 |
fused_nn_contrib_conv2d_winograd_without_weight_transform_add_add_nn_relu |
93327360 | 2 | 281.1145 | 331.9906 | 663.9812 |
32 | Y
11 |
fused_nn_conv2d_add_nn_relu_3 | 115705856 | 1 | 437.5283 |
264.4534 | 264.4534 | 32 | Y
12 |
fused_nn_contrib_conv2d_winograd_without_weight_transform_add_nn_relu_1 |
98600960 | 1 | 330.2098 | 298.6010 | 298.6010 |
32 | Y
13 |
fused_nn_contrib_conv2d_winograd_without_weight_transform_add_add_nn_relu_1 |
98651136 | 2 | 298.1799 | 330.8444 | 661.6887 |
32 | Y
14 |
fused_nn_conv2d_add_nn_relu_4 | 115655680 | 1 | 381.0609 |
303.5097 | 303.5097 | 32 | Y
15 |
fused_nn_conv2d_add_nn_relu_5 | 231261184 | 1 | 408.4514 |
566.1902 | 566.1902 | 32 | Y
16 |
fused_nn_conv2d_add_add_nn_relu_1 | 231286272 | 2 | 332.2502 |
696.1209 | 1392.2417 | 32 | Y
17 |
fused_nn_adaptive_avg_pool2d | 25600 | 1 | 5.7029 |
4.4890 | 4.4890 | 32 | Y
18 |
fused_layout_transform_reshape_squeeze | 1 | 1 | 0.0003 |
3.6907 | 3.6907 | 1 |
19 |
fused_nn_dense_add | 1025000 | 1 | 161.2829 | 6.3553 |
6.3553 | 32 | Y
-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
```
Profiler table:
```
ID | Name | Time (min) | Percentage
----------------------------------------------------------------------------
| Total | 61.9612 | 100.0000
1 | EvoSearch/Evolve/Mutation | 30.6629 | 49.4873
2 | EvoSearch/SampleInitPopulation | 8.4259 | 13.5987
3 | SendToBuilder | 8.4254 | 13.5979
4 | EvoSearch/Evolve/Misc | 6.0477 | 9.7604
5 | EvoSearch/Evolve/PredictNormalizedScore | 3.3436 | 5.3962
6 | SendToRunner | 2.3616 | 3.8115
7 | ApplyHistoryBest | 1.5547 | 2.5091
8 | TaskExtraction | 0.4576 | 0.7386
9 | MeasureCallback/UpdateCostModel | 0.1178 | 0.1901
10 | InitializeTask | 0.1092 | 0.1762
11 | MeasureCallback/AddToDatabase | 0.0181 | 0.0292
12 | EvoSearch/PickWithEpsGreedy | 0.0152 | 0.0245
13 | EvoSearch/PickBestFromDatabase | 0.0149 | 0.0241
14 | MeasureCallback/EchoStatistics | 0.0050 | 0.0081
15 | MeasureCallback/RemoveBuildArtifact | 0.0009 | 0.0015
16 | JoinRunnerFutures | 0.0003 | 0.0005
17 | EvoSearch/Evolve/Misc/CopyMeasuredWorkloads | 0.0000 | 0.0000
----------------------------------------------------------------------------
```
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]