I regret to say that I can't reproduce my initial result showing 9%
improvement on 3.2.0rc1 against 3.0.4. Both versions show the same speed
now.

I also compared 3.0.4 against trunk in another environment:
Ubuntu 18.04.2 LTS (GNU/Linux 4.15.0-1014-gcp x86_64)
cpu model name: Intel(R) Xeon(R) CPU @ 2.00GHz

This is the raw result from 3.0.4:
640 Examples seen. Accuracy:0.1006 Error:   1.79914 Loss:2.31176 Threads: 4
Forward time:  0.99s Backward time:  0.77s Step time:  1.51s
1280 Examples seen. Accuracy:0.1025 Error:   1.78724 Loss:2.26048 Threads:
4 Forward time:  0.99s Backward time:  0.75s Step time:  1.49s
1920 Examples seen. Accuracy:0.1087 Error:   1.78000 Loss:2.26476 Threads:
4 Forward time:  0.99s Backward time:  0.77s Step time:  1.49s

This is the raw result from trunk:
640 Examples seen. Accuracy:0.1175 Error:   1.79696 Loss:2.30112 Threads: 4
Forward time:  0.94s Backward time:  0.72s Step time:  1.46s
1280 Examples seen. Accuracy:0.1203 Error:   1.79009 Loss:2.27688 Threads:
4 Forward time:  0.94s Backward time:  0.73s Step time:  1.44s
1920 Examples seen. Accuracy:0.1226 Error:   1.76832 Loss:2.20816 Threads:
4 Forward time:  0.93s Backward time:  0.74s Step time:  1.44s

I usually look at the "Step time" for comparisons.

Tested with:
https://github.com/joaopauloschuler/neural-api/blob/master/examples/SimpleImageClassifier/SimpleImageClassifier.lpr
_______________________________________________
fpc-devel maillist  -  fpc-devel@lists.freepascal.org
https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel

Reply via email to