We wrote a custom sparse triangular solve. In some case it performed a little better (at most 15-20%). Sometimes it performed worse by a similar amount. I may revive our custom solve at some point. For now, Petsc only uses CUSPARSE.
-Paul > Is this a custom sparse triangular solve or is it from CUSPARSE? If > different, how does the performance compare with the CUSPARSE > implementation?