This is an automated email from the ASF dual-hosted git repository. jxie pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/incubator-mxnet.git
The following commit(s) were added to refs/heads/master by this push: new db4de44 Update performance page. (#10876) db4de44 is described below commit db4de44f170df08a1500feabccb55a63266d7ae2 Author: Da Zheng <zhengda1...@gmail.com> AuthorDate: Tue May 15 11:23:32 2018 -0700 Update performance page. (#10876) * Update perf. * add float16. * update header. --- docs/faq/perf.md | 56 ++++++++++++++++++++++++++++++++++++++++---------------- 1 file changed, 40 insertions(+), 16 deletions(-) diff --git a/docs/faq/perf.md b/docs/faq/perf.md index ce74391..ad81b5d 100644 --- a/docs/faq/perf.md +++ b/docs/faq/perf.md @@ -36,7 +36,7 @@ to measure the performance on different AWS EC2 machines. AWS EC2 C5.18xlarge: -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|--------|--------------|--------------|-----------|------------| | 1 | 390.53 | 81.57 | 124.13 | 62.26 | 76.22 | 32.92 | | 2 | 596.45 | 100.84 | 206.58 | 93.36 | 119.55 | 46.80 | @@ -48,7 +48,7 @@ AWS EC2 C5.18xlarge: AWS EC2 C5.9xlarge: -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|-------|--------------|--------------|-----------|------------| | 1 | 257.77 | 50.61 | 130.99 | 66.95 | 75.38 | 32.33 | | 2 | 410.60 | 63.02 | 195.14 | 87.84 | 102.67 | 41.57 | @@ -60,7 +60,7 @@ AWS EC2 C5.9xlarge: AWS EC2 C5.4xlarge: -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|-------|--------------|--------------|-----------|------------| | 1 | 214.15 | 29.32 | 114.97 | 47.96 | 61.01 | 23.92 | | 2 | 310.04 | 34.81 | 150.09 | 60.89 | 71.16 | 27.92 | @@ -72,7 +72,7 @@ AWS EC2 C5.4xlarge: AWS EC2 C5.2xlarge: -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|-------|--------------|--------------|-----------|------------| | 1 | 131.01 | 15.67 | 78.75 | 31.12 | 37.30 | 14.75 | | 2 | 182.29 | 18.01 | 98.59 | 39.13 | 45.98 | 17.84 | @@ -84,7 +84,7 @@ AWS EC2 C5.2xlarge: AWS EC2 C5.xlarge: -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|------|--------------|--------------|-----------|------------| | 1 | 36.64 | 3.93 | 27.06 | 10.09 | 12.98 | 5.06 | | 2 | 49.21 | 4.49 | 29.67 | 10.80 | 12.94 | 5.14 | @@ -117,7 +117,7 @@ and [MXNet-1.2.0.rc1](https://github.com/apache/incubator-mxnet/releases/downlo - K80 (single GPU) -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|--------|--------------|--------------|-----------|------------| | 1 | 243.93 | 43.59 | 68.62 | 35.52 | 67.41 | 23.65 | | 2 | 338.16 | 49.14 | 113.41 | 56.29 | 93.35 | 33.88 | @@ -125,10 +125,13 @@ and [MXNet-1.2.0.rc1](https://github.com/apache/incubator-mxnet/releases/downlo | 8 | 683.52 | 70.50 | 190.49 | 86.23 | 131.32 | 50.54 | | 16 | 1004.66 | 109.01 | 254.20 | 105.70 | 155.40 | 62.55 | | 32 | 1238.55 | 114.98 | 285.49 | 116.79 | 159.42 | 64.99 | +| 64 | 1346.72 | 123.56 | 308.73 | 122.21 | 167.58 | 70.21 | +| 128 | 1416.91 | OOM | 320.98 | 123.11 | 171.55 | 71.85 | +| 256 | 1462.97 | OOM | 329.16 | 127.53 | 153.01 | 57.23 | - M60 -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|--------|--------------|--------------|-----------|------------| | 1 | 243.49 | 59.95 | 101.97 | 48.30 | 95.46 | 39.29 | | 2 | 491.04 | 69.14 | 170.35 | 80.27 | 142.61 | 60.17 | @@ -136,19 +139,38 @@ and [MXNet-1.2.0.rc1](https://github.com/apache/incubator-mxnet/releases/downlo | 8 | 1077.73 | 109.34 | 343.42 | 152.82 | 208.74 | 87.27 | | 16 | 1447.21 | 144.93 | 390.25 | 166.32 | 220.73 | 92.41 | | 32 | 1797.66 | 151.86 | 416.69 | 176.56 | 230.19 | 97.03 | +| 64 | 1779.38 | 150.18 | 427.51 | 183.47 | 239.12 | 101.59 | +| 128 | 1787.36 | OOM | 439.04 | 185.29 | 243.31 | 103.39 | +| 256 | 1899.10 | OOM | 450.22 | 183.42 | 242.36 | 100.98 | - V100 -| Batch | Alexnet | VGG | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| Batch | Alexnet | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | |-------|---------|--------|--------------|--------------|-----------|------------| -| 1 | 659.51 | 205.16 | 136.91 | 76.54 | 162.15 | 61.38 | -| 2 | 1248.21 | 265.40 | 261.85 | 144.23 | 293.74 | 116.30 | -| 4 | 2122.41 | 333.97 | 477.22 | 270.03 | 479.14 | 195.17 | -| 8 | 3894.30 | 420.26 | 831.09 | 450.68 | 699.39 | 294.19 | -| 16 | 5815.58 | 654.16 | 1332.26 | 658.97 | 947.45 | 398.79 | -| 32 | 7906.09 | 708.43 | 1784.23 | 817.33 | 1076.81 | 451.82 | - +| 1 | 659.51 | 205.16 | 157.37 | 87.71 | 162.15 | 61.38 | +| 2 | 1248.21 | 265.40 | 297.34 | 159.24 | 293.74 | 116.30 | +| 4 | 2122.41 | 333.97 | 520.91 | 279.84 | 479.14 | 195.17 | +| 8 | 3894.30 | 420.26 | 898.09 | 455.03 | 699.39 | 294.19 | +| 16 | 5815.58 | 654.16 | 1430.97 | 672.54 | 947.45 | 398.79 | +| 32 | 7906.09 | 708.43 | 1847.26 | 814.59 | 1076.81 | 451.82 | +| 64 | 9486.26 | 701.59 | 2134.89 | 899.01 | 1168.37 | 480.44 | +| 128 | 10177.84 | 703.30 | 2318.32 | 904.33 | 1233.15 | 511.79 | +| 256 | 10990.46 | 473.62 | 2425.28 | 960.20 | 1155.07 | 449.35 | + +Below is the performance result on V100 using float 16. + +| Batch | VGG 16 | Inception-BN | Inception-v3 | Resnet 50 | Resnet 152 | +| ----- | ------- | ------------ | ------------ | --------- | ---------- | +| 1 | 276.29 | 155.53 | 150.99 | 270.89 | 96.79 | +| 2 | 476.91 | 296.45 | 282.02 | 493.99 | 176.88 | +| 4 | 711.92 | 525.05 | 492.45 | 851.15 | 321.52 | +| 8 | 1047.11 | 900.26 | 807.94 | 1282.36 | 517.66 | +| 16 | 1299.88 | 1441.41 | 1192.21 | 1722.97 | 724.57 | +| 32 | 1486.63 | 1854.30 | 1512.08 | 2085.51 | 887.34 | +| 64 | 1219.65 | 2138.61 | 1687.35 | 2341.67 | 1002.90 | +| 128 | 1169.81 | 2317.39 | 1818.26 | 2355.04 | 1046.98 | +| 256 | 764.16 | 2425.16 | 1653.74 | 1991.88 | 976.73 | ### Training results @@ -160,7 +182,7 @@ where the batch size for Alexnet is increased by 16x. - K80 (single GPU) - | Batch | Alexnet(\*8) | Inception-v3 | Resnet 50 | + | Batch | Alexnet(\*16) | Inception-v3 | Resnet 50 | | --- | --- | --- | --- | | 1 | 300.30 | 10.48 | 15.61 | | 2 | 406.08 | 16.00 | 23.88 | @@ -190,6 +212,8 @@ where the batch size for Alexnet is increased by 16x. | 8 | 2919.02 | 118.43 | 174.81 | | 16 | 2994.32 | 173.15 | 251.22 | | 32 | 2585.61 | 214.48 | 298.51 | + | 64 | 1984.21 | 247.43 | 343.19 | + | 128 | OOM | 253.68 | 363.69 | ## Multiple Devices -- To stop receiving notification emails like this one, please contact j...@apache.org.