Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-03-08 Thread Sergey Bylokhov
On 08/03/2018 05:03, Laurent Bourgès wrote: Does anybody know good statistical profiles these days? I have netbeans a try (worked well in the past) but it seems broken, VisualVM reports all time is spent in SunGraphics2D.fill/draw without providing any further details regardin

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-03-08 Thread Sergey Bylokhov
On 08/03/2018 03:21, Clemens Eisserer wrote: Hi Sergey, Thank you for contribution! If there is a chance to implement it soon(at an early test stage) then it is possible to push this to jdk11 and enable it by default to expose all possible issues. If no issues will be found, then we can release

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-03-08 Thread Laurent Bourgès
Hi Clemens, I finally got my hands on a 10-year old NVidia GPU (8800GTS) and can > confirm Laurent's findings. > The proprietary nvidia driver is awesome for the MaskBlit workload, > for x11perf -shmput10 it is 32x faster > than my kaveri APU solution (despite it has to copy data via PCIe). > Yes

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-03-08 Thread Clemens Eisserer
Hi Sergey, > Thank you for contribution! If there is a chance to implement it soon(at an > early test stage) then it is possible to push this to jdk11 and enable it by > default to expose all possible issues. If no issues will be found, then we > can release it as-is, otherwise we can disable it b

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-03-08 Thread Clemens Eisserer
Hi everybody, I finally got my hands on a 10-year old NVidia GPU (8800GTS) and can confirm Laurent's findings. The proprietary nvidia driver is awesome for the MaskBlit workload, for x11perf -shmput10 it is 32x faster than my kaveri APU solution (despite it has to copy data via PCIe). > Finally,

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-03-05 Thread Sergey Bylokhov
Hi, Clemens. On 21/02/2018 23:41, Clemens Eisserer wrote: It is still in prototype state with a few rough edges and a few corner-cases unimplemented (e.g. extra alpha with antialiasing), but should be able to run most workloads: http://93.83.133.214/webrev/ https://sourceforge.net/p/xrender-defer

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-02-27 Thread Clemens Eisserer
Hi Laurent, Thanks a lot for taking the time to test the deferred xrender pipeline. Especially since the proprietary nvdia driver is the only one of the accelerated xrender implementations I didnt test / benchmark against. > On my linux laptop (i7 + nvidia quadro), xrender is already faster than

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-02-27 Thread Laurent Bourgès
Hi Clemens, Sorry this is a long email giving my feedback on your xrender efforts. After achieving huge speedups with Marlin Laurent Bourgès recently > proposed increasing the AA tile size of MaskBlit/MaskFill operations. > The 128x64 tiles size should help the Xrender pipeline a lot for > larger

Re: [OpenJDK 2D-Dev] Initial implementation for batched AA-mask upload for xrender

2018-02-23 Thread Laurent Bourgès
Hi Clemens, As I am enjoying winter holidays, I will try your patch once at home. It seems very promising and will try understanding changes to C code. I will also test on my linux machines with nvidia cards (quadro 610 & 1070). Cheers, Laurent Le 22 févr. 2018 8:42 AM, "Clemens Eisserer" a écr