I'm running GAMG with CUDA, and I'm wondering how the nominally serial smoother algorithms are implemented on GPU? Specifically SOR/GS and ILU(0) -- in e.g. AMGx these are applied by first creating a coloring, and the smoother passes are done color by color. Is this how it's done in petsc AMG?
Tangential, AMGx and OpenFOAM offer something called "DILU", diagonal ILU. Is there an equivalent in petsc? Thanks, Mark
