I regret to say that I can't reproduce my initial result showing 9% improvement on 3.2.0rc1 against 3.0.4. Both versions show the same speed now.
I also compared 3.0.4 against trunk in another environment: Ubuntu 18.04.2 LTS (GNU/Linux 4.15.0-1014-gcp x86_64) cpu model name: Intel(R) Xeon(R) CPU @ 2.00GHz This is the raw result from 3.0.4: 640 Examples seen. Accuracy:0.1006 Error: 1.79914 Loss:2.31176 Threads: 4 Forward time: 0.99s Backward time: 0.77s Step time: 1.51s 1280 Examples seen. Accuracy:0.1025 Error: 1.78724 Loss:2.26048 Threads: 4 Forward time: 0.99s Backward time: 0.75s Step time: 1.49s 1920 Examples seen. Accuracy:0.1087 Error: 1.78000 Loss:2.26476 Threads: 4 Forward time: 0.99s Backward time: 0.77s Step time: 1.49s This is the raw result from trunk: 640 Examples seen. Accuracy:0.1175 Error: 1.79696 Loss:2.30112 Threads: 4 Forward time: 0.94s Backward time: 0.72s Step time: 1.46s 1280 Examples seen. Accuracy:0.1203 Error: 1.79009 Loss:2.27688 Threads: 4 Forward time: 0.94s Backward time: 0.73s Step time: 1.44s 1920 Examples seen. Accuracy:0.1226 Error: 1.76832 Loss:2.20816 Threads: 4 Forward time: 0.93s Backward time: 0.74s Step time: 1.44s I usually look at the "Step time" for comparisons. Tested with: https://github.com/joaopauloschuler/neural-api/blob/master/examples/SimpleImageClassifier/SimpleImageClassifier.lpr
_______________________________________________ fpc-devel maillist - fpc-devel@lists.freepascal.org https://lists.freepascal.org/cgi-bin/mailman/listinfo/fpc-devel