Hi, I am trying to benchmark the performance of my code on 8 processors and am trying to find where most of the time is used. When I look at the breakdown of the stages time required, the total add up to ~7s however, the main stage time is ~350s. I am not being able to find out the stage which is taking so much extra time. Could you please suggest something ?
Thanks. Time (sec): 3.508e+02 1.00000 3.508e+02 Objects: 3.310e+02 1.00000 3.310e+02 Flops: 1.279e+08 1.03856 1.267e+08 1.014e+09 Flops/sec: 3.646e+05 1.03856 3.612e+05 2.890e+06 Memory: 2.817e+07 1.06078 2.221e+08 MPI Messages: 3.150e+02 1.26506 2.985e+02 2.388e+03 MPI Message Lengths: 4.011e+06 1.83678 1.191e+04 2.843e+07 MPI Reductions: 6.970e+02 1.00000 VecMDot 51 1.0 6.0108e-02 2.4 5.65e+06 1.0 0.0e+00 0.0e+00 5.1e+01 0 4 0 0 7 0 4 0 0 11 752 VecNorm 67 1.0 1.1708e-02 1.1 7.41e+05 1.0 0.0e+00 0.0e+00 6.7e+01 0 1 0 0 10 0 1 0 0 14 507 VecScale 75 1.0 1.2923e-03 1.0 3.69e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2282 VecCopy 20 1.0 6.3189e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 66 1.0 5.0619e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 51 1.0 8.9883e-04 1.1 2.95e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2625 VecWAXPY 30 1.0 2.0204e-03 1.2 1.09e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 430 VecMAXPY 54 1.0 8.1123e-03 1.0 6.28e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 5 0 0 0 0 5 0 0 0 6192 VecAssemblyBegin 3 1.0 4.9893e-04 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.0e+00 0 0 0 0 1 0 0 0 0 2 0 VecAssemblyEnd 3 1.0 2.0169e-05 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 9 1.0 1.7315e-04 1.0 1.84e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 852 VecScatterBegin 118 1.0 7.5530e-03 1.5 0.00e+00 0.0 2.2e+03 6.0e+03 0.0e+00 0 0 93 47 0 0 0 93 47 0 0 VecScatterEnd 118 1.0 3.5505e-02 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 54 1.0 7.4800e-03 1.4 9.95e+05 1.0 0.0e+00 0.0e+00 5.4e+01 0 1 0 0 8 0 1 0 0 11 1065 MatMult 66 1.0 6.4065e-01 1.1 5.67e+07 1.1 9.7e+02 1.1e+04 0.0e+00 0 44 41 38 0 0 44 41 38 0 694 MatSolve 81 1.0 3.9077e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 35 0 0 0 0 35 0 0 0 901 MatLUFactorSym 8 1.0 1.0749e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 9 1.0 4.7825e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 11 0 0 0 0 11 0 0 0 2288 MatILUFactorSym 1 1.0 1.5453e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 0 0 0 0 1 0 MatConvert 2 1.0 3.7113e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 1 0 0 0 0 2 0 MatScale 16 1.0 3.7416e-05 1.2 4.90e+03 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1047 MatAssemblyBegin 59 1.0 3.1884e+00484.1 0.00e+00 0.0 4.2e+01 3.6e+05 6.0e+00 0 0 2 53 1 0 0 2 53 1 0 MatAssemblyEnd 59 1.0 1.2491e+00 1.0 0.00e+00 0.0 7.6e+01 1.5e+03 2.1e+01 0 0 3 0 3 0 0 3 0 4 0 MatGetValues 792 1.0 1.7098e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 1 1.0 5.9657e-06 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 1 1.0 2.0580e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatZeroEntries 1 1.0 5.6988e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatTranspose 16 1.0 6.8201e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.2e+01 0 0 0 0 5 0 0 0 0 7 0 MatMatMult 32 1.0 1.3172e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 6.4e+01 0 0 0 0 9 0 0 0 0 14 0 MatMatSolve 8 1.0 1.3842e-03 1.1 3.60e+02 1.0 0.0e+00 0.0e+00 3.2e+01 0 0 0 0 5 0 0 0 0 7 2 KSPGMRESOrthog 51 1.0 6.8248e-02 2.1 1.13e+07 1.0 0.0e+00 0.0e+00 5.1e+01 0 9 0 0 7 0 9 0 0 11 1325 KSPSetup 4 1.0 2.2410e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSolve 8 1.0 1.0945e+00 1.0 1.21e+08 1.0 8.8e+02 1.1e+04 1.9e+02 0 95 37 34 27 0 95 37 34 40 877 PCSetUp 3 1.0 6.8884e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 1.1e+01 0 11 0 0 2 0 11 0 0 2 1588 PCSetUpOnBlocks 3 1.0 6.7468e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 7.0e+00 0 11 0 0 1 0 11 0 0 1 1622 PCApply 57 1.0 4.0561e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 5.7e+01 0 35 0 0 8 0 35 0 0 12 868 ------------------------------------------------------------------------------------------------------------------------ -------------- next part -------------- An HTML attachment was scrubbed... URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20110616/fc514af0/attachment-0001.htm>
