[petsc-users] Multigrid

Karthik Duraisamy Wed, 2 May 2012 20:21:57 -0700 (PDT)

Fair enough. I wiped out all references to petsc in my code and started from 
scratch and used just the options you mentioned.. and I get good convergence as 
first reported in the PCMG reference! I have attached the output. Now that it 
is behaving, could you recommend some options to exercise the multigrid?


Thanks a lot for your time (and patience!)

----------------------

KSP Object: 8 MPI processes
  type: gmres
    GMRES: restart=30, using Classical (unmodified) Gram-Schmidt 
Orthogonalization with no iterative refinement
    GMRES: happy breakdown tolerance 1e-30
  maximum iterations=1, initial guess is zero
  tolerances:  relative=1e-05, absolute=1e-50, divergence=1e+10
  left preconditioning
  using PRECONDITIONED norm type for convergence test
PC Object: 8 MPI processes
  type: bjacobi
    block Jacobi: number of blocks = 8
    Local solve is same for all blocks, in the following KSP and PC objects:
  KSP Object:  (sub_)   1 MPI processes  
    type: preonly
    maximum iterations=10000, initial guess is zero
    tolerances:  relative=1e-05, absolute=1e-50, divergence=10000
    left preconditioning
    using NONE norm type for convergence test
  PC Object:  (sub_)   1 MPI processes  
    type: ilu
      ILU: out-of-place factorization
      0 levels of fill
      tolerance for zero pivot 1e-12
      using diagonal shift to prevent zero pivot
      matrix ordering: natural
      factor fill ratio given 1, needed 1
        Factored matrix follows:
          Matrix Object:           1 MPI processes          
            type: seqaij
            rows=9015, cols=9015
            package used to perform factorization: petsc
            total: nonzeros=517777, allocated nonzeros=517777
            total number of mallocs used during MatSetValues calls =0
              using I-node routines: found 3476 nodes, limit used is 5
    linear system matrix = precond matrix:
    Matrix Object:     1 MPI processes    
      type: seqaij
      rows=9015, cols=9015
      total: nonzeros=517777, allocated nonzeros=517777
      total number of mallocs used during MatSetValues calls =0
        using I-node routines: found 3476 nodes, limit used is 5
  linear system matrix = precond matrix:
  Matrix Object:   8 MPI processes  
    type: mpiaij
    rows=75000, cols=75000
    total: nonzeros=4427800, allocated nonzeros=4427800
    total number of mallocs used during MatSetValues calls =0
      using I-node (on process 0) routines: found 3476 nodes, limit used is 5




----- Original Message -----
From: "Barry Smith" <[email protected]>
To: "PETSc users list" <petsc-users at mcs.anl.gov>
Sent: Wednesday, May 2, 2012 10:34:17 AM
Subject: Re: [petsc-users] Multigrid


   Run with -pc_type ml -ksp_type fgmres   -ksp_max_it 100 -ksp_view   and 
NOTHING else. Don'ts set any KSP or PC options in the code!  Then send us ALL 
the output.

    Barry

  I am getting all confused with "I ran with these options and this happened, I 
ran with these options and this happened, I ran with these options and this 
happened". I am getting information overload with too much information and yet 
not enough information.


On May 2, 2012, at 12:27 PM, Karthik Duraisamy wrote:

> Hello,
> 
>    With PCML, I tried 
> 
>    -mg_coarse_pc_type  redundant ; -mg_coarse_ksp_type preonly ;   
> KSPSetType(ksp, KSPGMRES);
> 
> and the residual still diverges after a while. Strangely, if I use PCMG (with 
> 0 levels), it works OK, but not with PCML.
> 
> This is definitely strange. Here is the entire piece of relevant code.
> 
> When initializing Petsc, I do the following: (let's call this the first bit 
> and I do this only once) 
> 
>    KSPSetOperators(ksp, A_, A_, SAME_NONZERO_PATTERN);
>    KSPSetInitialGuessKnoll(ksp, PETSC_TRUE);
>    KSPSetType(ksp, KSPFGMRES);
>    KSPGMRESSetRestart(ksp, 100);
>    KSPSetFromOptions(ksp);
> 
> 
> Then for each outer solver iteration: (let's call this the second bit and I 
> do this every outer iteration)
> 
>    PC pc;
>    KSPGetPC(ksp,&pc);
>    PCSetType(pc, PCMG);
>    KSPSetOperators(ksp, A_, A_, SAME_NONZERO_PATTERN);
>    PetscOptionsSetValue("-ksp_initial_guess_nonzero", "true");
>    KSPSetType(ksp, KSPGMRES);
>    KSPGMRESSetRestart(ksp, 100);
>    KSPSetFromOptions(ksp);
> 
> 
> I have tried various combinations of preconditioners and smoothers (including 
> fgmres, bcgsl, etc) but the only combination that works is the above. Any of 
> the PCML options that I have tried failed. 
> 
> Notes:
> - If I keep both of the above bits, it converges very well
> 
> - If I keep both of the above bits, but replace PCMG with PCML, it diverges 
> quickly
> 
> - If I removed the first bit and kept the second bit, the residuals diverge 
> even with PCMG 
> 
> - If I keep the first bit and remove the second bit, the residuals diverge 
> eventually, but at a slow rate
> 
> 
> I am very confident with the rest of the code as I've been using PETSc (with 
> just the first bit) for 3 years on 100s of problems.
> 
> Regards,
> Karthik
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> 
> ----- Original Message -----
> From: "Barry Smith" <bsmith at mcs.anl.gov>
> To: "PETSc users list" <petsc-users at mcs.anl.gov>
> Sent: Wednesday, May 2, 2012 6:02:06 AM
> Subject: Re: [petsc-users] Multigrid
> 
> 
>   This is likely helping a great deal:   
> 
>  KSPGMRESSetRestart(ksp, 100);
> 
>   You can try this with ml also
> 
> On May 2, 2012, at 1:33 AM, Karthik Duraisamy wrote:
> 
>> Hello again,
>> 
>>     I played with BoomerAMG and ML (for the 2D problem with a reasonable 
>> condition number), and so far, they haven't worked (the residual eventually 
>> diverges). The options that I am using are (for instance)
>> 
>>   PC pc; 
>>   KSPGetPC(ksp,&pc);
>>   PetscOptionsSetValue("-ksp_type", "gmres");
> 
>   Here you NEED  fgmres since the smoother is gmres by default. You can see 
> this with -ksp_view
> 
>>   KSPSetOperators(ksp, A_, A_, SAME_NONZERO_PATTERN);
>>   PetscOptionsSetValue("-ksp_initial_guess_nonzero", "true");
>>   PCSetType(pc, PCML);
>>   PetscOptionsSetValue("-pc_ml_maxNlevels", "3");
>>   PetscOptionsSetValue("-mg_coarse_ksp_type","richardson");
>>   PetscOptionsSetValue("-mg_coarse_pc_type","sor");
>>   PetscOptionsSetValue("-mg_coarse_pc_sor_its","8");
> 
> Just use -mg_coarse_pc_type  redundant -mg_coarse_ksp_type preonly instead of 
> the above.
> 
> 
>   Barry
> 
>>   PetscOptionsSetValue("-ksp_monitor","1");
>>   KSPSetFromOptions(ksp);
> 
> 
>> 
>> I have replaced KSP with bcgsl, fgmres, etc.
>> 
>> As I mentioned earlier, the following works like a charm:
>> 
>>   PC pc; 
>>   KSPGetPC(ksp,&pc);
>>   PCSetType(pc, PCMG);
>>   KSPSetOperators(ksp, A_, A_, SAME_NONZERO_PATTERN);
>>   PetscOptionsSetValue("-ksp_initial_guess_nonzero", "true");
>>   KSPSetType(ksp, KSPGMRES);
>>   KSPGMRESSetRestart(ksp, 100);
>>   KSPSetFromOptions(ksp);
>> 
>> 
>> Regards,
>> Karthik.
>> 
>> 
>> 
>> ----- Original Message -----
>> From: "Barry Smith" <bsmith at mcs.anl.gov>
>> To: "PETSc users list" <petsc-users at mcs.anl.gov>
>> Sent: Tuesday, May 1, 2012 9:14:05 PM
>> Subject: Re: [petsc-users] Multigrid
>> 
>> 
>> On May 1, 2012, at 6:00 PM, Karthik Duraisamy wrote:
>> 
>>> So as I understand it, GMRES is used as a preconditioner and as a solver 
>>> when I use PCMG with defaults. If this is the case, I should be able to 
>>> recreate this set up without the PCMG. Any pointers as to how this can be 
>>> done?
>>> 
>>> Also, yes indeed, my mesh is completely unstructured, so I will have to use 
>>> ml or boomeramg. 
>>> 
>>> The problems that I am attempting involve RANS of compressible turbulent 
>>> combustion (finite volume, steady). The high condition numbers are because 
>>> of the extreme grid stretching and stiff source terms (in the turbulence 
>>> and combustion model).
>> 
>>  With a condition number like that I wouldn't even consider using double 
>> precision, I think you would just be computing meaningless noise. With 
>> petsc-dev you can use quad precision ./configure --with-precision=__float128 
>> using recent GNU compilers
>> (no we haven't tested the gfortran sides of things). 
>> 
>>  Barry
>> 
>> 
>>> I have been trying a reduced problem in these 2D test cases, in which the 
>>> condition number is only 1e7.
>>> 
>>> Thanks,
>>> Karthik.
>>> 
>>> 
>>> ----- Original Message -----
>>> From: "Mark F. Adams" <mark.adams at columbia.edu>
>>> To: "PETSc users list" <petsc-users at mcs.anl.gov>
>>> Sent: Tuesday, May 1, 2012 3:50:31 PM
>>> Subject: Re: [petsc-users] Multigrid
>>> 
>>> 
>>> Also, note that PCMG can not create coarse grid spaces for an (MPI)AIJ 
>>> matrix. If you use regular grids (DA?) then PETSc can construct geometric 
>>> multigrid coarse grid spaces, although I don't know if PCMG will construct 
>>> these for you (I don't think it will and I can see from your output that 
>>> PCMG just used one grid). 'ml', hypre' and 'gamg' (a native AMG solver) 
>>> will do real AMG solvers for you. All three can work on a similar class of 
>>> problems. 
>>> 
>>> 
>>> Also, you mention that you have a condition number of 1.e20. That is 
>>> astronomical for such a small problem. How did you compute that number? Do 
>>> you know where the ill-conditioning comes from? Is this an elliptic 
>>> operator? 
>>> 
>>> 
>>> Mark 
>>> 
>>> 
>>> 
>>> 
>>> On May 1, 2012, at 6:31 PM, Matthew Knepley wrote: 
>>> 
>>> 
>>> 
>>> On Tue, May 1, 2012 at 6:27 PM, Karthik Duraisamy < dkarthik at 
>>> stanford.edu > wrote: 
>>> 
>>> 
>>> 
>>> The following was output for the very first iteration whereas what I had 
>>> attached earlier was output every iteration. I am still a bit perplexed 
>>> because PCMG drops the residual like a rock (after the first few iterations 
>>> whereas with no PCMG, it is very slow) 
>>> 
>>> 
>>> 
>>> Because the smoother IS the solver you were using before. Just like I said 
>>> last time, what you are doing is 
>>> wrapping up the same solver you used before, sticking it in another GMRES 
>>> loop, and only looking at the 
>>> outer loop. This has nothing to do with MG. 
>>> 
>>> 
>>> Matt 
>>> 
>>> 
>>> KSP Object: 8 MPI processes 
>>> type: gmres 
>>> GMRES: restart=100, using Classical (unmodified) Gram-Schmidt 
>>> Orthogonalization with no iterative refinement 
>>> GMRES: happy breakdown tolerance 1e-30 
>>> maximum iterations=1, initial guess is zero 
>>> using preconditioner applied to right hand side for initial guess 
>>> tolerances: relative=0.01, absolute=1e-08, divergence=1e+10 
>>> left preconditioning 
>>> using DEFAULT norm type for convergence test 
>>> PC Object: 8 MPI processes 
>>> type: mg 
>>> MG: type is MULTIPLICATIVE, levels=1 cycles=v 
>>> Cycles per PCApply=1 
>>> Not using Galerkin computed coarse grid matrices 
>>> Coarse grid solver -- level ------------------------------- 
>>> KSP Object: (mg_levels_0_) 8 MPI processes 
>>> type not yet set 
>>> maximum iterations=1, initial guess is zero 
>>> tolerances: relative=1e-05, absolute=1e-50, divergence=10000 
>>> left preconditioning 
>>> using DEFAULT norm type for convergence test 
>>> PC Object: (mg_levels_0_) 8 MPI processes 
>>> type not yet set 
>>> linear system matrix = precond matrix: 
>>> Matrix Object: 8 MPI processes 
>>> type: mpiaij 
>>> rows=75000, cols=75000 
>>> total: nonzeros=4427800, allocated nonzeros=4427800 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node (on process 0) routines: found 3476 nodes, limit used is 5 
>>> 
>>> 
>>> ----- Original Message ----- 
>>> From: "Matthew Knepley" < knepley at gmail.com > 
>>> To: "PETSc users list" < petsc-users at mcs.anl.gov > 
>>> Sent: Tuesday, May 1, 2012 3:22:56 PM 
>>> Subject: Re: [petsc-users] Multigrid 
>>> 
>>> 
>>> On Tue, May 1, 2012 at 6:18 PM, Karthik Duraisamy < dkarthik at 
>>> stanford.edu > wrote: 
>>> 
>>> 
>>> 
>>> Hello, 
>>> 
>>> Sorry (and thanks for the reply). I've attached the no multigrid case. I 
>>> didn't include it because (at least to the untrained eye, everything looks 
>>> the same). 
>>> 
>>> 
>>> 
>>> Did you send all the output from the MG case? There must be a PC around it. 
>>> By default its GMRES, so there would be 
>>> an extra GMRES loop compared to the case without MG. 
>>> 
>>> 
>>> Matt 
>>> 
>>> 
>>> Regards, 
>>> Karthik 
>>> 
>>> KSP Object: 8 MPI processes 
>>> type: gmres 
>>> GMRES: restart=100, using Classical (unmodified) Gram-Schmidt 
>>> Orthogonalization with no iterative refinement 
>>> GMRES: happy breakdown tolerance 1e-30 
>>> maximum iterations=1 
>>> using preconditioner applied to right hand side for initial guess 
>>> tolerances: relative=1e-05, absolute=1e-50, divergence=1e+10 
>>> left preconditioning 
>>> using nonzero initial guess 
>>> using PRECONDITIONED norm type for convergence test 
>>> PC Object: 8 MPI processes 
>>> type: bjacobi 
>>> block Jacobi: number of blocks = 8 
>>> Local solve is same for all blocks, in the following KSP and PC objects: 
>>> KSP Object: (sub_) 1 MPI processes 
>>> type: preonly 
>>> maximum iterations=10000, initial guess is zero 
>>> tolerances: relative=1e-05, absolute=1e-50, divergence=10000 
>>> left preconditioning 
>>> using NONE norm type for convergence test 
>>> PC Object: (sub_) 1 MPI processes 
>>> type: ilu 
>>> ILU: out-of-place factorization 
>>> 0 levels of fill 
>>> tolerance for zero pivot 1e-12 
>>> using diagonal shift to prevent zero pivot 
>>> matrix ordering: natural 
>>> factor fill ratio given 1, needed 1 
>>> Factored matrix follows: 
>>> Matrix Object: 1 MPI processes 
>>> type: seqaij 
>>> rows=9015, cols=9015 
>>> package used to perform factorization: petsc 
>>> total: nonzeros=517777, allocated nonzeros=517777 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node routines: found 3476 nodes, limit used is 5 
>>> linear system matrix = precond matrix: 
>>> Matrix Object: 1 MPI processes 
>>> type: seqaij 
>>> rows=9015, cols=9015 
>>> total: nonzeros=517777, allocated nonzeros=517777 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node routines: found 3476 nodes, limit used is 5 
>>> linear system matrix = precond matrix: 
>>> Matrix Object: 8 MPI processes 
>>> type: mpiaij 
>>> rows=75000, cols=75000 
>>> total: nonzeros=4427800, allocated nonzeros=4427800 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node (on process 0) routines: found 3476 nodes, limit used is 5 
>>> 
>>> 
>>> ----- Original Message ----- 
>>> From: "Matthew Knepley" < knepley at gmail.com > 
>>> To: "PETSc users list" < petsc-users at mcs.anl.gov > 
>>> Sent: Tuesday, May 1, 2012 3:15:14 PM 
>>> Subject: Re: [petsc-users] Multigrid 
>>> 
>>> 
>>> On Tue, May 1, 2012 at 6:12 PM, Karthik Duraisamy < dkarthik at 
>>> stanford.edu > wrote: 
>>> 
>>> 
>>> 
>>> Hello Barry, 
>>> 
>>> Thank you for your super quick response. I have attached the output of 
>>> ksp_view and it is practically the same as that when I don't use PCMG. The 
>>> part I don't understand is how PCMG able to function at the zero grid level 
>>> and still produce a much better convergence than when using the default PC. 
>>> Is there any additional smoothing or interpolation going on? 
>>> 
>>> 
>>> 
>>> You only included one output, so I have no way of knowing what you used 
>>> before. However, this is running GMRES/ILU. 
>>> 
>>> 
>>> Also, for Algebraic Multigrid, would you recommend BoomerAMG or ML ? 
>>> 
>>> 
>>> 
>>> They are different algorithms. Its not possible to say generally that one 
>>> is better. Try them both. 
>>> 
>>> 
>>> Matt 
>>> 
>>> 
>>> Best regards, 
>>> Karthik. 
>>> 
>>> type: mg 
>>> MG: type is MULTIPLICATIVE, levels=1 cycles=v 
>>> Cycles per PCApply=1 
>>> Not using Galerkin computed coarse grid matrices 
>>> Coarse grid solver -- level ------------------------------- 
>>> KSP Object: (mg_levels_0_) 8 MPI processes 
>>> type: gmres 
>>> GMRES: restart=30, using Classical (unmodified) Gram-Schmidt 
>>> Orthogonalization with no iterative refinement 
>>> GMRES: happy breakdown tolerance 1e-30 
>>> maximum iterations=1, initial guess is zero 
>>> tolerances: relative=1e-05, absolute=1e-50, divergence=10000 
>>> left preconditioning 
>>> using PRECONDITIONED norm type for convergence test 
>>> PC Object: (mg_levels_0_) 8 MPI processes 
>>> type: bjacobi 
>>> block Jacobi: number of blocks = 8 
>>> Local solve is same for all blocks, in the following KSP and PC objects: 
>>> KSP Object: (mg_levels_0_sub_) 1 MPI processes 
>>> type: preonly 
>>> maximum iterations=10000, initial guess is zero 
>>> tolerances: relative=1e-05, absolute=1e-50, divergence=10000 
>>> left preconditioning 
>>> using NONE norm type for convergence test 
>>> PC Object: (mg_levels_0_sub_) 1 MPI processes 
>>> type: ilu 
>>> ILU: out-of-place factorization 
>>> 0 levels of fill 
>>> tolerance for zero pivot 1e-12 
>>> using diagonal shift to prevent zero pivot 
>>> matrix ordering: natural 
>>> factor fill ratio given 1, needed 1 
>>> Factored matrix follows: 
>>> Matrix Object: 1 MPI processes 
>>> type: seqaij 
>>> rows=9015, cols=9015 
>>> package used to perform factorization: petsc 
>>> total: nonzeros=517777, allocated nonzeros=517777 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node routines: found 3476 nodes, limit used is 5 
>>> linear system matrix = precond matrix: 
>>> Matrix Object: 1 MPI processes 
>>> type: seqaij 
>>> rows=9015, cols=9015 
>>> total: nonzeros=517777, allocated nonzeros=517777 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node routines: found 3476 nodes, limit used is 5 
>>> linear system matrix = precond matrix: 
>>> Matrix Object: 8 MPI processes 
>>> type: mpiaij 
>>> rows=75000, cols=75000 
>>> total: nonzeros=4427800, allocated nonzeros=4427800 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node (on process 0) routines: found 3476 nodes, limit used is 5 
>>> linear system matrix = precond matrix: 
>>> Matrix Object: 8 MPI processes 
>>> type: mpiaij 
>>> rows=75000, cols=75000 
>>> total: nonzeros=4427800, allocated nonzeros=4427800 
>>> total number of mallocs used during MatSetValues calls =0 
>>> using I-node (on process 0) routines: found 3476 nodes, limit used is 5 
>>> 
>>> 
>>> 
>>> ----- Original Message ----- 
>>> From: "Barry Smith" < bsmith at mcs.anl.gov > 
>>> To: "PETSc users list" < petsc-users at mcs.anl.gov > 
>>> Sent: Tuesday, May 1, 2012 1:39:26 PM 
>>> Subject: Re: [petsc-users] Multigrid 
>>> 
>>> 
>>> On May 1, 2012, at 3:37 PM, Karthik Duraisamy wrote: 
>>> 
>>>> Hello, 
>>>> 
>>>> I have been using PETSc for a couple of years with good success, but 
>>>> lately as my linear problems have become stiffer (condition numbers of the 
>>>> order of 1.e20), I am looking to use better preconditioners. I tried using 
>>>> PCMG with all the default options (i.e., I just specified my 
>>>> preconditioner as PCMG and did not add any options to it) and I am 
>>>> immediately seeing better convergence. 
>>>> 
>>>> What I am not sure of is why? I would like to know more about the default 
>>>> parameters (the manual is not very explicit) and more importantly, want to 
>>>> know why it is working even when I haven't specified any grid levels and 
>>>> coarse grid operators. Any 
>>>> help in this regard will be appreciated. 
>>> 
>>> First run with -ksp_view to see what solver it is actually using. 
>>> 
>>> Barry 
>>> 
>>>> 
>>>> Also, ultimately I want to use algebraic multigrid so is PCML a better 
>>>> option than BoomerAMG? I tried BoomerAMG with mixed results. 
>>>> 
>>>> Thanks, 
>>>> Karthik 
>>>> 
>>>> 
>>>> 
>>>> -- 
>>>> 
>>>> ======================================= 
>>>> Karthik Duraisamy 
>>>> Assistant Professor (Consulting) 
>>>> Durand Building Rm 357 
>>>> Dept of Aeronautics and Astronautics 
>>>> Stanford University 
>>>> Stanford CA 94305 
>>>> 
>>>> Phone: 650-721-2835 
>>>> Web: www.stanford.edu/~dkarthik 
>>>> ======================================= 
>>> 
>>> 
>>> 
>>> 
>>> 
>>> -- 
>>> What most experimenters take for granted before they begin their 
>>> experiments is infinitely more interesting than any results to which their 
>>> experiments lead. 
>>> -- Norbert Wiener 
>>> 
>>> 
>>> 
>>> 
>>> -- 
>>> What most experimenters take for granted before they begin their 
>>> experiments is infinitely more interesting than any results to which their 
>>> experiments lead. 
>>> -- Norbert Wiener 
>>> 
>>> 
>>> 
>>> 
>>> -- 
>>> What most experimenters take for granted before they begin their 
>>> experiments is infinitely more interesting than any results to which their 
>>> experiments lead. 
>>> -- Norbert Wiener 
>>> 
>> 
>

[petsc-users] Multigrid

Reply via email to