WDRshadow commented on PR #2050: URL: https://github.com/apache/systemds/pull/2050#issuecomment-2241065765
@phaniarnab We have implemented test cases based on `EfficientNet`. Each test contains exactly the same training procedure. Parfor-based forward tests are performed using random datasets with the same seed. The number of datasets ranges from `10k` to `500k`. The results using single and dual GPUs are shown below: | test_id | num_interation | 1_gpu_exec_time_sec | 2_gpu_exec_time_sec | |:---------------------|-----------------:|----------------------:|----------------------:| | test01_gpuTest_10k | 10000 | 11.723 | 11.189 | | test01_gpuTest_20k | 20000 | 13.714 | 12.866 | | test01_gpuTest_50k | 50000 | 19.755 | 15.616 | | test01_gpuTest_100k | 100000 | 29.141 | 23.026 | | test01_gpuTest_200k | 200000 | 49.409 | 37.987 | | test01_gpuTest_500k | 500000 | 108.874 | 77.917 | Test environment: - CPU: `24 vCPU Intel(R) Xeon(R) Platinum 8255C CPU @ 2.50GHz` - GPU: `RTX2080Ti` * 2 - RAM: `80G` - OS: `Ubuntu 18.04` - CUDA: `10.2` -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@systemds.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org