BTW, a -device_view would be great. On Tue, Jan 25, 2022 at 12:30 PM Mark Adams <mfad...@lbl.gov> wrote:
> > > On Tue, Jan 25, 2022 at 11:56 AM Jed Brown <j...@jedbrown.org> wrote: > >> Barry Smith <bsm...@petsc.dev> writes: >> >> > Thanks Mark, far more interesting. I've improved the formatting to >> make it easier to read (and fixed width font for email reading) >> > >> > * Can you do same run with say 10 iterations of Jacobi PC? >> > >> > * PCApply performance (looks like GAMG) is terrible! Problems too >> small? >> >> This is -pc_type jacobi. >> >> > * VecScatter time is completely dominated by SFPack! Junchao what's >> up with that? Lots of little kernels in the PCApply? PCJACOBI run will help >> clarify where that is coming from. >> >> It's all in MatMult. >> >> I'd like to see a run that doesn't wait for the GPU. >> >> > Not sure what you mean. Can I do that? > >