[Beignet] [PATCH] enable CL_UNSIGNED_INT8 for CL_RG to fix regression

2015-07-01 Thread Guo Yejun
the regression is caused when only enable CL_UNORM_INT8 for CL_RG, the reason is that during the image copy implementation with internal kernel, all formats are considerd as integer format, it becomes unknown since CL_UNSIGNED_INT8 is not enabled yet. Signed-off-by: Guo Yejun --- src/cl_image.c

Re: [Beignet] [PATCH 19/19] runtime: Add fp16 extension to BDW later platform.

2015-07-01 Thread He Junyan
The half float can work for BSW, I will send a standalone patch to enable it later. On 2015年06月19日 15:18, Yang, Rong R wrote: One concern: Does cherryview support half? The other part of the patchset LGTM. -Original Message- From: Beignet [mailto:beignet-boun...@lists.freedesktop.org

Re: [Beignet] thread safety and OpenMP

2015-07-01 Thread Gerald Baier
On 07/01/2015 07:28 AM, He, Junyan wrote: So far as we know, beignet is thread safe. Every thead has its own command buffer and do not have relationship with each others. Do you use subbuffer to divide the image? No, as far as I can tell I don't use anything special or advanced. If you can prov

Re: [Beignet] [PATCH] Set the loop unroll's threshold to 1024.

2015-07-01 Thread Song, Ruiling
Selecting a reasonable threshold seems a difficult job. I am ok with 1024. LGTM. > -Original Message- > From: Beignet [mailto:beignet-boun...@lists.freedesktop.org] On Behalf Of > Yang Rong > Sent: Wednesday, July 01, 2015 11:31 AM > To: beignet@lists.freedesktop.org > Cc: Yang, Rong R > S

[Beignet] [PATCH] GBE: optimize phi elimination.

2015-07-01 Thread Ruiling Song
This is special optimization for below situation: bb1: ... bb2: x = phi [x1, bb1], [x2, bb2] x2 = x+1; after de-ssa: bb2: mov x, x-copy add x2, x, 1 mov x-copy, x2 obviously x2, x-copy and x2 can be mapped to same virtual register. Signed-off-by: Ruiling Song --- backend