Re: [Mesa-dev] [RFC PATCH] gallium/nir: Add shader-based blending helpers

2018-11-12 Thread Alyssa Rosenzweig
> I would love to see this up in src/compiler/nir. Is there a compelling reason to have it in NIR rather in Gallium, adding an extra layer of translation overhead / indirection when coming from Gallium blend state? (Mythical future Vulkan support, I guess?) > Also, we should probably convert vc4

Re: [Mesa-dev] [PATCH 2/2] kmsro: Extend to include hx8357d.

2018-11-12 Thread Eric Anholt
Emil Velikov writes: > On Tue, 30 Oct 2018 at 17:49, Eric Anholt wrote: >> >> Emil Velikov writes: >> >> > Hi Eric, >> > >> > On Thu, 25 Oct 2018 at 17:39, Eric Anholt wrote: >> >> >> >> This allows vc4 to initialize on the Adafruit PiTFT 3.5" touchscreen with >> >> the new tinydrm driver I

Re: [Mesa-dev] [PATCH] nir: add support for removing redundant stores to copy prop var

2018-11-12 Thread Jason Ekstrand
On Thu, Nov 8, 2018 at 1:39 AM Timothy Arceri wrote: > For example the following type of thing is seen in TCS from > a number of Vulkan and DXVK games: > > vec1 32 ssa_557 = deref_var (shader_out float) > vec1 32 ssa_558 = intrinsic load_deref (ssa_557) () > vec1 32

[Mesa-dev] [PATCH v2] nir: add support for removing redundant stores to copy prop var

2018-11-12 Thread Timothy Arceri
For example the following type of thing is seen in TCS from a number of Vulkan and DXVK games: vec1 32 ssa_557 = deref_var (shader_out float) vec1 32 ssa_558 = intrinsic load_deref (ssa_557) () vec1 32 ssa_559 = deref_var @42 (shader_out float) vec1 32 ssa_560 =

[Mesa-dev] [PATCH v3] nir: add support for removing redundant stores to copy prop var

2018-11-12 Thread Timothy Arceri
For example the following type of thing is seen in TCS from a number of Vulkan and DXVK games: vec1 32 ssa_557 = deref_var (shader_out float) vec1 32 ssa_558 = intrinsic load_deref (ssa_557) () vec1 32 ssa_559 = deref_var @42 (shader_out float) vec1 32 ssa_560 =

Re: [Mesa-dev] [PATCH v4] nir: Don't lower the local work group size if it's variable.

2018-11-12 Thread Karol Herbst
Reviewed-by: Karol Herbst On Tue, Nov 13, 2018 at 3:51 AM Jason Ekstrand wrote: > > On Mon, Nov 12, 2018 at 6:10 PM Karol Herbst wrote: >> >> it shouldn't make a difference. This pass lowers load_derefs into >> whatever we want here. If we handle the system value explicitly >> "sysval" gets

[Mesa-dev] [PATCH 2/2] i965/compute: Emit GPGPU_WALKER in genX_state_upload

2018-11-12 Thread Jordan Justen
Signed-off-by: Jordan Justen --- src/mesa/drivers/dri/i965/brw_compute.c | 131 +- src/mesa/drivers/dri/i965/brw_context.h | 2 + src/mesa/drivers/dri/i965/genX_state_upload.c | 102 ++ 3 files changed, 105 insertions(+), 130 deletions(-) diff --git

[Mesa-dev] [PATCH 1/2] i965/genX_state: Add register access functions

2018-11-12 Thread Jordan Justen
Signed-off-by: Jordan Justen --- src/mesa/drivers/dri/i965/genX_state_upload.c | 31 +++ 1 file changed, 31 insertions(+) diff --git a/src/mesa/drivers/dri/i965/genX_state_upload.c b/src/mesa/drivers/dri/i965/genX_state_upload.c index 5acd0922922..6495862e700 100644 ---

Re: [Mesa-dev] [PATCH v2] i965/glsl: don't add unused aoa element to the program resource list

2018-11-12 Thread Timothy Arceri
Sorry for not getting back sooner on this one. I'm leaning towards a NAK on this one. This is just under 300 new lines of code to work around a possibly over strict piglit test. While the test is not wrong an implementation is also not required to optimise away these unused elements. If

Re: [Mesa-dev] [PATCH v2] nir: add support for removing redundant stores to copy prop var

2018-11-12 Thread Jason Ekstrand
On November 12, 2018 19:21:42 Timothy Arceri wrote: For example the following type of thing is seen in TCS from a number of Vulkan and DXVK games: vec1 32 ssa_557 = deref_var (shader_out float) vec1 32 ssa_558 = intrinsic load_deref (ssa_557) () vec1 32 ssa_559 = deref_var @42 (shader_out

Re: [Mesa-dev] [PATCH v3] nir: add support for removing redundant stores to copy prop var

2018-11-12 Thread Jason Ekstrand
On Mon, Nov 12, 2018 at 8:57 PM Timothy Arceri wrote: > For example the following type of thing is seen in TCS from > a number of Vulkan and DXVK games: > > vec1 32 ssa_557 = deref_var (shader_out float) > vec1 32 ssa_558 = intrinsic load_deref (ssa_557) () > vec1 32

Re: [Mesa-dev] [PATCH v4] nir: Don't lower the local work group size if it's variable.

2018-11-12 Thread Jason Ekstrand
On Mon, Nov 12, 2018 at 6:10 PM Karol Herbst wrote: > it shouldn't make a difference. This pass lowers load_derefs into > whatever we want here. If we handle the system value explicitly > "sysval" gets set. If not, we fetch the op through > nir_intrinsic_from_system_value and do the load based

Re: [Mesa-dev] [PATCH 2/2] util/ralloc: Make sizeof(linear_header) a multiple of 8

2018-11-12 Thread Matt Turner
On Mon, Nov 12, 2018 at 3:07 PM Eric Anholt wrote: > > Matt Turner writes: > > > Prior to this patch sizeof(linear_header) was 20 bytes in a > > non-debug build on 32-bit platforms. We do some pointer arithmetic to > > calculate the next available location with > > > >ptr =

Re: [Mesa-dev] [PATCH v4] nir: Don't lower the local work group size if it's variable.

2018-11-12 Thread Karol Herbst
it shouldn't make a difference. This pass lowers load_derefs into whatever we want here. If we handle the system value explicitly "sysval" gets set. If not, we fetch the op through nir_intrinsic_from_system_value and do the load based on that. We just take a different path, but fundamentally we do

Re: [Mesa-dev] [PATCH] st/mesa: don't do L3 thread pinning for Blender

2018-11-12 Thread Marek Olšák
Hi Edmondo, can you test the two attached patches? They re-enable and rework the thread pinning. Thanks, Marek On Mon, Nov 12, 2018 at 4:31 PM Edmondo Tommasina < edmondo.tommas...@gmail.com> wrote: > On Mon, Nov 12, 2018 at 6:43 PM Michel Dänzer wrote: > >> On 2018-11-08 6:23 a.m., Marek

[Mesa-dev] [PATCH 1/2] st/mesa: regularly re-pin driver threads to the CCX where the app thread is

2018-11-12 Thread Marek Olšák
From: Marek Olšák This is used when glthread is disabled. Mesa pretty much chases the app thread on the CPU. The performance is the same as pinning the app thread. --- src/mesa/state_tracker/st_context.h | 2 ++ src/mesa/state_tracker/st_draw.c| 32 + 2 files

[Mesa-dev] [PATCH 2/2] st/mesa: pin driver threads to a fixed CCX when glthread is enabled

2018-11-12 Thread Marek Olšák
From: Marek Olšák radeonsi has 3 driver threads (glthread, gallium, winsys), other drivers may have 2 (glthread, gallium), so it makes sense to pin them to a random CCX and keep that irrespective of the app thread. --- src/gallium/auxiliary/util/u_helpers.c | 63 --

Re: [Mesa-dev] [RFC PATCH] gallium/nir: Add shader-based blending helpers

2018-11-12 Thread Eric Anholt
Alyssa Rosenzweig writes: >> I would love to see this up in src/compiler/nir. > > Is there a compelling reason to have it in NIR rather in Gallium, adding > an extra layer of translation overhead / indirection when coming from > Gallium blend state? (Mythical future Vulkan support, I guess?)

Re: [Mesa-dev] [PATCH v4] nir: Don't lower the local work group size if it's variable.

2018-11-12 Thread Jason Ekstrand
I think we still want to skip the lowering of SYSTEM_VALUE_LOCAL_GROUP_SIZE when that flag is set. I think this works, but we'll end up deleting one load_local_group_size intrinsic and replacing it with another which is pointless. --Jason On Mon, Nov 12, 2018 at 4:02 PM Plamena Manolova <

Re: [Mesa-dev] [RFC PATCH] gallium/nir: Add shader-based blending helpers

2018-11-12 Thread Eric Anholt
Alyssa Rosenzweig writes: > Some mobile GPUs lack fixed-function hardware for blending, instead > emulating blending via internal shaders. In particular for us, vc4 lacks > most of the fragment pipeline, implementing blending in the epilogue of > the fragment shader. Newer Malis supported by

Re: [Mesa-dev] [radeonsi] Blender/vsraytrace/fsraytrace/gsraytrace - GPUShader: compile error

2018-11-12 Thread Dieter Nützel
GREAT hint Tim! Yes, of course. /home/dieter> gcc --version gcc (SUSE Linux) 8.2.1 20181025 [gcc-8-branch revision 265488] So I have to ping SUSE to push the fix, too. Thanks a lot. Dieter Am 12.11.2018 08:28, schrieb Timothy Arceri: I'm guessing your using GCC 8.2.1 to compile Mesa? There

[Mesa-dev] [PATCH] radv: binding streamout buffers doesn't change context regs

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_cmd_buffer.c | 9 +++-- 1 file changed, 7 insertions(+), 2 deletions(-) diff --git a/src/amd/vulkan/radv_cmd_buffer.c b/src/amd/vulkan/radv_cmd_buffer.c index ee5373950f..1f22fda7c5 100644 --- a/src/amd/vulkan/radv_cmd_buffer.c +++

Re: [Mesa-dev] [PATCH] radv: set optimal OVERWRITE_COMBINER_WATERMARK on GFX9

2018-11-12 Thread Bas Nieuwenhuizen
On Thu, Nov 8, 2018 at 2:46 PM Samuel Pitoiset wrote: > > Ported from RadeonSI. > > Signed-off-by: Samuel Pitoiset > --- > src/amd/vulkan/radv_cmd_buffer.c | 7 +++ > src/amd/vulkan/radv_device.c | 12 > src/amd/vulkan/radv_private.h| 2 ++ >

[Mesa-dev] [PATCH v2] radv: set optimal OVERWRITE_COMBINER_WATERMARK on GFX9

2018-11-12 Thread Samuel Pitoiset
Ported from RadeonSI. v2: - use the current subpass instead of the whole fb Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_cmd_buffer.c | 21 + src/amd/vulkan/si_cmd_buffer.c | 3 --- 2 files changed, 21 insertions(+), 3 deletions(-) diff --git

Re: [Mesa-dev] [PATCH] radv: make use of num_good_cu_per_sh in si_emit_graphics() too

2018-11-12 Thread Bas Nieuwenhuizen
Reviewed-by: Bas Nieuwenhuizen On Thu, Nov 8, 2018 at 2:46 PM Samuel Pitoiset wrote: > > Signed-off-by: Samuel Pitoiset > --- > src/amd/vulkan/si_cmd_buffer.c | 3 +-- > 1 file changed, 1 insertion(+), 2 deletions(-) > > diff --git a/src/amd/vulkan/si_cmd_buffer.c

Re: [Mesa-dev] [PATCH v2] radv: set optimal OVERWRITE_COMBINER_WATERMARK on GFX9

2018-11-12 Thread Bas Nieuwenhuizen
Reviewed-by: Bas Nieuwenhuizen On Mon, Nov 12, 2018 at 9:44 AM Samuel Pitoiset wrote: > > Ported from RadeonSI. > > v2: - use the current subpass instead of the whole fb > > Signed-off-by: Samuel Pitoiset > --- > src/amd/vulkan/radv_cmd_buffer.c | 21 + >

Re: [Mesa-dev] [PATCH 2/2] radv: clean up setting partial_es_wave for distributed tess on VI

2018-11-12 Thread Bas Nieuwenhuizen
Reviewed-by: Bas Nieuwenhuizen On Thu, Nov 8, 2018 at 1:58 PM Samuel Pitoiset wrote: > > Only needed when the pipeline actually uses tessellation. I don't > think that changes anything, except improving readability. > > Signed-off-by: Samuel Pitoiset > --- > src/amd/vulkan/radv_pipeline.c | 11

[Mesa-dev] [Bug 108713] Gallium: use after free with transform feedback

2018-11-12 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=108713 --- Comment #1 from Gert Wollny --- I'll send a patch shortly. -- You are receiving this mail because: You are the QA Contact for the bug. You are the assignee for the bug.___ mesa-dev mailing list

[Mesa-dev] [PATCH v2] nir: Allow to skip integer ops in nir_lower_to_source_mods

2018-11-12 Thread Gert Wollny
From: Gert Wollny Some hardware supports source mods only for float operations. Make it possible to skip lowering to source mods in these cases. v2: use option flags instead of a boolean (Jason Ekstrand) Signed-off-by: Gert Wollny --- src/compiler/nir/nir.h | 10 ++-

[Mesa-dev] [Bug 108720] System crash vulkan cemu

2018-11-12 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=108720 Bug ID: 108720 Summary: System crash vulkan cemu Product: Mesa Version: git Hardware: Other OS: All Status: NEW Severity: major Priority:

[Mesa-dev] [PATCH 2/4] st/xa: Support higher color precision for solid pictures

2018-11-12 Thread Thomas Hellstrom
The only solid fill picture type we supported only had 8 bit color channels. Add a new solid picture type that supports float channels. Signed-off-by: Thomas Hellstrom Reviewed-by: Brian Paul --- src/gallium/state_trackers/xa/xa_composite.c | 106 ++-

[Mesa-dev] [PATCH 1/4] st/xa: Render update. Better support for solid pictures

2018-11-12 Thread Thomas Hellstrom
Remove unused and obsolete code for gradients and component-alpha Support solid source- and mask pictures using a variable number of samplers in the composite pipeline rather than the fixed number we used before. Tested using rendercheck for XA. Signed-off-by: Thomas Hellstrom Reviewed-by:

[Mesa-dev] [PATCH 3/4] st/xa: Support a couple of new formats

2018-11-12 Thread Thomas Hellstrom
Signed-off-by: Thomas Hellstrom Reviewed-by: Brian Paul --- src/gallium/state_trackers/xa/xa_tracker.c| 33 ++- src/gallium/state_trackers/xa/xa_tracker.h.in | 5 +++ 2 files changed, 29 insertions(+), 9 deletions(-) diff --git a/src/gallium/state_trackers/xa/xa_tracker.c

[Mesa-dev] [PATCH 4/4] st/xa: Bump minor

2018-11-12 Thread Thomas Hellstrom
Bump minor to signal support for new formats and higher precision solid pictures. Signed-off-by: Thomas Hellstrom Reviewed-by: Brian Paul --- configure.ac | 2 +- src/gallium/state_trackers/xa/meson.build | 2 +- 2 files changed, 2 insertions(+), 2 deletions(-)

Re: [Mesa-dev] [PATCH 2/5] intel/decoders: read ring buffer length

2018-11-12 Thread Toni Lönnberg
Reviewed-by: Toni Lönnberg On Fri, Nov 09, 2018 at 04:49:10PM +, Lionel Landwerlin wrote: > Use this value to limit reading the ring buffer. > > Signed-off-by: Lionel Landwerlin > --- > src/intel/tools/aubinator.c | 4 +++- > src/intel/tools/aubinator_viewer.cpp | 3 ++- > 2

[Mesa-dev] [PATCH mesa] xmlpool: update translation po files

2018-11-12 Thread Eric Engestrom
These files are close to 4 years out of date; a lot's changed since. Let's just check in a recently-regenerated version. Changes generated by running `ninja xmlpool-{pot,update-po,gmo}`. Cc: Dylan Baker Signed-off-by: Eric Engestrom --- src/util/xmlpool/ca.po | 178

Re: [Mesa-dev] [PATCH 3/4] i965: avoid 'unused variable' and 'may be used uninitialized' warnings

2018-11-12 Thread andrey simiklit
On Fri, Nov 9, 2018 at 3:19 PM Eric Engestrom wrote: > On Tuesday, 2018-09-11 15:42:06 +0300, asimiklit.w...@gmail.com wrote: > > From: Andrii Simiklit > > > > 1. brw_blorp.c:1502:4: warning: > > ‘num_layers’ may be used uninitialized in this function > > 2. brw_blorp.c:1502:4: warning: > >

Re: [Mesa-dev] [PATCH mesa 2/3] REVIEWERS: add Emil as EGL reviewer

2018-11-12 Thread Emil Velikov
On Wed, 31 Oct 2018 at 16:51, Emil Velikov wrote: > > On Wed, 31 Oct 2018 at 14:54, Eric Engestrom wrote: > > > > Cc: Emil Velikov > > Signed-off-by: Eric Engestrom > > Acked-by: Emil Velikov > Should be made it clearer - the ack is for the whole series. HTH Emil

[Mesa-dev] [PATCH] mesa: Reference count shaders that are used by transform feedback objects

2018-11-12 Thread Gert Wollny
From: Gert Wollny Transform feedback objects may hold a pointer to a shader program, and at least in Gallium, this must be a valid pointer until ctx->Driver.EndTransformFeedback in glEndTransformFeedback has been called - which is conform with the spec that any program that is part of a current

Re: [Mesa-dev] [PATCH 16/28] Replace IROUND_POS with _mesa_roundevenf

2018-11-12 Thread Erik Faye-Lund
On Fri, 2018-11-09 at 10:40 -0800, Dylan Baker wrote: > Which has the same behavior. Does it? I'm not so sure... IROUND_POS seems to round to nearest integer depending on the FPU rounding mode, _mesa_roundevenf rounds to the nearest *even* value regardless of the FPU rounding mode, no? I'm not

Re: [Mesa-dev] [PATCH mesa 3/3] meson: fix wayland-less builds

2018-11-12 Thread Emil Velikov
On Sun, 11 Nov 2018 at 18:39, Eric Engestrom wrote: > > On Thursday, 2018-10-11 16:36:15 +0100, Eric Engestrom wrote: > > Those empty variables in the !wayland case are useless and running that > > meson.build with them breaks the build: > > > > [287/850] Generating

Re: [Mesa-dev] [PATCH v2] nir: Handle variables dependent on the local work group size.

2018-11-12 Thread Manolova, Plamena
Hi Jason, Thank you so much for reviewing! In my initial series for ARB_compute_variable_group_size (https://patchwork.freedesktop.org/patch/228130) from which this is extracted, I moved lowering these variables to brw_nir_lower_cs_intrinsics and did what you're suggesting i.e. I used the

[Mesa-dev] [PATCH v3] virgl: native fence fd support

2018-11-12 Thread Robert Foss
From: Gustavo Padovan Following the support for fences on the virtio driver add support for native fence on virgl. This was somewhat based on the freedeno one. Signed-off-by: Gustavo Padovan Signed-off-by: Robert Foss --- Retransmission: Missed CC-ing mesa-dev Linux virtgpu fences branch:

[Mesa-dev] [PATCH 07/14] freedreno/ir3: move disasm and optmsgs debug flags

2018-11-12 Thread Rob Clark
Move them to IR3_SHADER_DEBUG so we can remove ir3's dependency on fd_mesa_debug. Signed-off-by: Rob Clark --- src/gallium/drivers/freedreno/freedreno_screen.c | 3 +-- src/gallium/drivers/freedreno/freedreno_util.h | 2 +- src/gallium/drivers/freedreno/ir3/ir3_cmdline.c | 6 +++---

[Mesa-dev] [PATCH 13/14] freedreno/ir3: remove u_inlines usage

2018-11-12 Thread Rob Clark
Signed-off-by: Rob Clark --- .../drivers/freedreno/ir3/ir3_compiler_nir.c | 20 +-- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/src/gallium/drivers/freedreno/ir3/ir3_compiler_nir.c b/src/gallium/drivers/freedreno/ir3/ir3_compiler_nir.c index

[Mesa-dev] [PATCH 05/14] freedreno: remove shader_stage_name()

2018-11-12 Thread Rob Clark
Signed-off-by: Rob Clark --- src/gallium/drivers/freedreno/disasm.h| 19 --- .../drivers/freedreno/ir3/ir3_compiler_nir.c | 2 +- .../drivers/freedreno/ir3/ir3_shader.c| 3 ++- 3 files changed, 3 insertions(+), 21 deletions(-) diff --git

[Mesa-dev] [PATCH 09/14] freedreno/ir3: use env_var_as_unsigned()

2018-11-12 Thread Rob Clark
Signed-off-by: Rob Clark --- src/gallium/drivers/freedreno/freedreno_util.h | 12 src/gallium/drivers/freedreno/ir3/ir3_nir.c| 4 ++-- 2 files changed, 2 insertions(+), 14 deletions(-) diff --git a/src/gallium/drivers/freedreno/freedreno_util.h

[Mesa-dev] [PATCH 12/14] freedreno/ir3: split up ir3_shader

2018-11-12 Thread Rob Clark
Split the parts that are gallium specific into ir3_gallium so the rest can move to a common location outside of gallium. Signed-off-by: Rob Clark --- .../drivers/freedreno/Makefile.sources| 2 + src/gallium/drivers/freedreno/a3xx/fd3_emit.h | 2 +-

[Mesa-dev] [PATCH 06/14] freedreno: FD_SHADER_DEBUG -> IR3_SHADER_DEBUG

2018-11-12 Thread Rob Clark
Only used by ir3, so move it into ir3 to be more self contained. Signed-off-by: Rob Clark --- src/gallium/drivers/freedreno/disasm.h| 21 --- .../drivers/freedreno/freedreno_screen.c | 12 --- .../drivers/freedreno/ir3/ir3_compiler.c | 13

[Mesa-dev] [PATCH 03/14] freedreno/ir3: standalone compiler updates

2018-11-12 Thread Rob Clark
Signed-off-by: Rob Clark --- .../drivers/freedreno/ir3/ir3_cmdline.c | 33 +++ 1 file changed, 27 insertions(+), 6 deletions(-) diff --git a/src/gallium/drivers/freedreno/ir3/ir3_cmdline.c b/src/gallium/drivers/freedreno/ir3/ir3_cmdline.c index bf801953d00..39f6c12c6bc

[Mesa-dev] [PATCH 14/14] freedreno: move ir3 to common location

2018-11-12 Thread Rob Clark
Move (most of) the ir3 compiler to src/freedreno/ir3 so that it can be re-used by some future vulkan driver. The parts that are gallium specific have been refactored out and remain in the gallium driver. Getting the move done now so that it can happen before further refactoring to support a6xx

[Mesa-dev] [PATCH 11/14] freedreno/ir3: remove pipe_stream_output_info dependency

2018-11-12 Thread Rob Clark
A bit annoying to have to copy into our own struct. But this is something the compiler really needs to know, at least on earlier generations where streamout is implemented in shader. Signed-off-by: Rob Clark --- src/gallium/drivers/freedreno/a5xx/fd5_emit.c | 2 +-

[Mesa-dev] [PATCH 01/14] freedreno/drm: remove dependency on gallium driver

2018-11-12 Thread Rob Clark
Prep work to move drm to a common location. Slightly hacky, but the softpin debug flag is only temporary. Signed-off-by: Rob Clark --- src/gallium/drivers/freedreno/drm/msm_pipe.c | 13 +++-- 1 file changed, 11 insertions(+), 2 deletions(-) diff --git

Re: [Mesa-dev] [PATCH] mesa: Reference count shaders that are used by transform feedback objects

2018-11-12 Thread Emil Velikov
On Mon, 12 Nov 2018 at 11:34, Gert Wollny wrote: > > From: Gert Wollny > > Transform feedback objects may hold a pointer to a shader program, and > at least in Gallium, this must be a valid pointer until > ctx->Driver.EndTransformFeedback in glEndTransformFeedback has been called > - which is

[Mesa-dev] [PATCH 00/14] freedreno: move all the things

2018-11-12 Thread Rob Clark
Moves drm and the ir3 compiler to src/freedreno so it can be eventually re-used by vulkan driver. I mostly just wanted to get the splitup and move of ir3 out of the way now before I embark on some ir3 refactoring to handle some of the instructions that use a different encoding on a6xx+ vs earlier

[Mesa-dev] [PATCH 02/14] freedreno: move drm to common location

2018-11-12 Thread Rob Clark
So that we can re-use at least parts of it for vulkan driver, and so that we can move ir3 to a common location (which uses fd_bo to allocate storage for shaders) Signed-off-by: Rob Clark --- configure.ac | 2 + src/Makefile.am | 4

[Mesa-dev] [PATCH 04/14] freedreno: shader_t -> gl_shader_stage

2018-11-12 Thread Rob Clark
Just massive search/replace for the most part. Step towards removing ir3 dependency on disasm.h which is shared by a2xx. One step closer to being able to move ir3 out of gallium. Signed-off-by: Rob Clark --- .../drivers/freedreno/a2xx/disasm-a2xx.c | 10 ++---

[Mesa-dev] [PATCH 08/14] util: env_var_as_unsigned() helper

2018-11-12 Thread Rob Clark
So I can drop env2u() helper from freedreno_util.h and get rid of one small ir3 dependency on gallium/freedreno Signed-off-by: Rob Clark --- src/util/debug.c | 12 src/util/debug.h | 2 ++ 2 files changed, 14 insertions(+) diff --git a/src/util/debug.c b/src/util/debug.c index

[Mesa-dev] [PATCH 10/14] freedreno/ir3: some header file cleanup

2018-11-12 Thread Rob Clark
Clean up some of the low-hanging-fruit usages of freedreno_util.h Signed-off-by: Rob Clark --- .../drivers/freedreno/ir3/disasm-a3xx.c| 7 ++- src/gallium/drivers/freedreno/ir3/instr-a3xx.h | 3 +++ src/gallium/drivers/freedreno/ir3/ir3.c| 3 ++-

Re: [Mesa-dev] [PATCH 2/3] nir: combine fmul and fadd across ffma operations

2018-11-12 Thread Jason Ekstrand
In general, you're not supposed to mess around with the precision of fma... What we do in the Intel drivers is to leave fma split, apply operations, and then we have a special mul+add fusion pass we run at the end. Leaving them split allows for exactly this kind of optimization without mixing up

Re: [Mesa-dev] [PATCH 1/2] nir: add lowering for ffloor

2018-11-12 Thread Jason Ekstrand
On Mon, Nov 12, 2018 at 1:29 PM Christian Gmeiner < christian.gmei...@gmail.com> wrote: > Hi Jason > > Am Sa., 1. Sep. 2018 um 21:23 Uhr schrieb Jason Ekstrand < > ja...@jlekstrand.net>: > > > > I don't think either of these work for negative numbers > > > > I would like to land this patch - can

Re: [Mesa-dev] [PATCH 1/3] nir: add fceil lowering

2018-11-12 Thread Jason Ekstrand
Reviewed-by: Jason Ekstrand On Mon, Nov 12, 2018 at 12:17 PM Jonathan Marek wrote: > lowers ceil(x) as -floor(-x) > > Signed-off-by: Jonathan Marek > --- > src/compiler/nir/nir.h| 3 +++ > src/compiler/nir/nir_opt_algebraic.py | 1 + > 2 files changed, 4 insertions(+) > >

Re: [Mesa-dev] [PATCH 1/2] radv: cleanup and document a Hawaii bug with offchip buffers

2018-11-12 Thread Bas Nieuwenhuizen
Not sure I agree this is a cleanup. The code here is independent of hardware, while the initialization of tess_offchip_block_dw_size in radv_device.c has the specific hardware this applies to. We add unnecessary asserts here and I'd argue the hardware bug should be explained when

[Mesa-dev] [Bug 108713] Gallium: use after free with transform feedback

2018-11-12 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=108713 Bug ID: 108713 Summary: Gallium: use after free with transform feedback Product: Mesa Version: git Hardware: Other OS: All Status: NEW Severity: normal

Re: [Mesa-dev] [PATCH 1/4] intel/tools: avoid 'ignoring return value' and 'unused variable' warnings

2018-11-12 Thread andrey simiklit
On Fri, Nov 9, 2018 at 2:45 PM Eric Engestrom wrote: > On Tuesday, 2018-09-11 15:42:04 +0300, asimiklit.w...@gmail.com wrote: > > From: Andrii Simiklit > > > > 1. tools/i965_disasm.c:58:4: warning: > > ignoring return value of ‘fread’, > > declared with attribute warn_unused_result > >

Re: [Mesa-dev] [PATCH 1/2] nir: add lowering for ffloor

2018-11-12 Thread Christian Gmeiner
Hi Jason Am Sa., 1. Sep. 2018 um 21:23 Uhr schrieb Jason Ekstrand : > > I don't think either of these work for negative numbers > I would like to land this patch - can you provide some details why this does not work for negative numbers? > On September 1, 2018 14:16:11 Christian Gmeiner >

Re: [Mesa-dev] [PATCH 1/3] nir: add fceil lowering

2018-11-12 Thread Christian Gmeiner
Am Mo., 12. Nov. 2018 um 19:17 Uhr schrieb Jonathan Marek : > > lowers ceil(x) as -floor(-x) > > Signed-off-by: Jonathan Marek Reviewed-by: Christian Gmeiner > --- > src/compiler/nir/nir.h| 3 +++ > src/compiler/nir/nir_opt_algebraic.py | 1 + > 2 files changed, 4

[Mesa-dev] [ANNOUNCE] Mesa 18.2.5 release candidate

2018-11-12 Thread Juan A. Suarez Romero
Hello list, The candidate for the Mesa 18.2.5 is now available. Currently we have: - 25 queued - 0 nominated (outstanding) - and 2 rejected patch The current queue consists of: A patch for nine state tracker that fixes several crashes using nine's thread_submit feature. There are other

Re: [Mesa-dev] [PATCH mesa 3/3] meson: fix wayland-less builds

2018-11-12 Thread Dylan Baker
Quoting Eric Engestrom (2018-11-11 10:39:16) > On Thursday, 2018-10-11 16:36:15 +0100, Eric Engestrom wrote: > > Those empty variables in the !wayland case are useless and running that > > meson.build with them breaks the build: > > > > [287/850] Generating wayland-drm-client-protocol.h with a

[Mesa-dev] [Bug 105328] Can't include gl and gles headers simultaneously on non-64 bit architectures

2018-11-12 Thread bugzilla-daemon
https://bugs.freedesktop.org/show_bug.cgi?id=105328 --- Comment #8 from Emil Velikov --- Matt has noticed that this is causing problems on Gentoo https://bugs.gentoo.org/660594 Although it's related to building gst the problem is the same. The gst check was introduced in 2017 with

Re: [Mesa-dev] [PATCH v2] nir: Handle variables dependent on the local work group size.

2018-11-12 Thread Jason Ekstrand
On November 12, 2018 08:25:50 "Manolova, Plamena" wrote: Hi Jason, Thank you so much for reviewing! In my initial series for ARB_compute_variable_group_size (https://patchwork.freedesktop.org/patch/228130) from which this is extracted, I moved lowering these variables to

[Mesa-dev] [PATCH 3/3] glsl/nir: glsl_to_nir fixes for native_integers=false

2018-11-12 Thread Jonathan Marek
Two parts: 1. for intructions that have a BOOL source, insert b2f to so that the backend can identify the source as a BOOL and perform the conversion from NIR_TRUE/NIR_FALSE 2. add missing type conversions (out_type is always GLSL_TYPE_FLOAT, so we are missing some conversion instructions): float

[Mesa-dev] [PATCH 2/3] nir: combine fmul and fadd across ffma operations

2018-11-12 Thread Jonathan Marek
This works by moving the fadd up across the ffma operations, so that it can eventually can be combined with a fmul. I'm not sure it works in all cases, but it works in all the common cases. Example: matrix * vec4(coord, 1.0) is compiled as: fmul, ffma, ffma, fadd and with this patch:

[Mesa-dev] [PATCH 1/3] nir: add fceil lowering

2018-11-12 Thread Jonathan Marek
lowers ceil(x) as -floor(-x) Signed-off-by: Jonathan Marek --- src/compiler/nir/nir.h| 3 +++ src/compiler/nir/nir_opt_algebraic.py | 1 + 2 files changed, 4 insertions(+) diff --git a/src/compiler/nir/nir.h b/src/compiler/nir/nir.h index dc3c729dee..f9b32a5daf 100644 ---

Re: [Mesa-dev] [PATCH mesa] xmlpool: update translation po files

2018-11-12 Thread Emil Velikov
On Mon, 12 Nov 2018 at 14:24, Eric Engestrom wrote: > > These files are close to 4 years out of date; a lot's changed since. > Let's just check in a recently-regenerated version. > Worth removing them from git and letting the build regenerate them as needed? Either way, the patch is Acked-by:

Re: [Mesa-dev] [PATCH 16/28] Replace IROUND_POS with _mesa_roundevenf

2018-11-12 Thread Dylan Baker
Quoting Erik Faye-Lund (2018-11-12 04:51:47) > On Fri, 2018-11-09 at 10:40 -0800, Dylan Baker wrote: > > Which has the same behavior. > > Does it? I'm not so sure... IROUND_POS seems to round to nearest > integer depending on the FPU rounding mode, _mesa_roundevenf rounds to > the nearest *even*

[Mesa-dev] [PATCH 1/6] radv: remove unnecessary goto in the fast clear paths

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_meta_clear.c | 52 +++- 1 file changed, 24 insertions(+), 28 deletions(-) diff --git a/src/amd/vulkan/radv_meta_clear.c b/src/amd/vulkan/radv_meta_clear.c index c8a7f80d55..43da0c8dfe 100644 ---

[Mesa-dev] [PATCH 4/6] radv: check allowed fast HTILE clears a bit earlier

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_meta_clear.c | 5 + 1 file changed, 5 insertions(+) diff --git a/src/amd/vulkan/radv_meta_clear.c b/src/amd/vulkan/radv_meta_clear.c index 47030549a5..5beb4cd7be 100644 --- a/src/amd/vulkan/radv_meta_clear.c +++

[Mesa-dev] [PATCH 6/6] radv: implement fast HTILE clears for depth or stencil only on GFX9

2018-11-12 Thread Samuel Pitoiset
This allows to fast clear the depth part (or the stencil part) of a depth+stencil surface when HTILE is enabled. I didn't test on GFX8, so it's disabled currently. This gives a very nice boost, for example when clearing the depth aspect of a 4096x4096 D32_SFLOAT_S8_UINT image (18x faster).

[Mesa-dev] [PATCH 5/6] radv: rewrite the condition that checks allowed depth/stencil values

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_meta_clear.c | 12 1 file changed, 4 insertions(+), 8 deletions(-) diff --git a/src/amd/vulkan/radv_meta_clear.c b/src/amd/vulkan/radv_meta_clear.c index 5beb4cd7be..b3128d021d 100644 --- a/src/amd/vulkan/radv_meta_clear.c +++

[Mesa-dev] [PATCH 2/6] radv: add radv_get_htile_fast_clear_value() helper

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_meta_clear.c | 21 ++--- 1 file changed, 18 insertions(+), 3 deletions(-) diff --git a/src/amd/vulkan/radv_meta_clear.c b/src/amd/vulkan/radv_meta_clear.c index 43da0c8dfe..a44baefd3a 100644 ---

[Mesa-dev] [PATCH 3/6] radv: add radv_is_fast_clear_{depth, stencil}_allowed() helpers

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_meta_clear.c | 18 -- 1 file changed, 16 insertions(+), 2 deletions(-) diff --git a/src/amd/vulkan/radv_meta_clear.c b/src/amd/vulkan/radv_meta_clear.c index a44baefd3a..47030549a5 100644 ---

Re: [Mesa-dev] [PATCH] st/mesa: don't do L3 thread pinning for Blender

2018-11-12 Thread Michel Dänzer
On 2018-11-08 6:23 a.m., Marek Olšák wrote: > Thanks a lot man. I'll reconsider this depending on the results I receive. > > I may also just pin the Mesa threads and keep the app thread intact. It > should perform OK with glthread, but not without glthread. > > Another option is to have the

Re: [Mesa-dev] [PATCH mesa] xmlpool: update translation po files

2018-11-12 Thread Dylan Baker
Quoting Eric Engestrom (2018-11-12 09:47:22) > On Monday, 2018-11-12 16:56:32 +, Emil Velikov wrote: > > On Mon, 12 Nov 2018 at 14:24, Eric Engestrom > > wrote: > > > > > > These files are close to 4 years out of date; a lot's changed since. > > > Let's just check in a recently-regenerated

Re: [Mesa-dev] [PATCH 4/5] nir: Make nir_lower_clip_vs optionally work with variables.

2018-11-12 Thread Eric Anholt
Kenneth Graunke writes: > The way nir_lower_clip_vs() works with store_output intrinsics makes a > ton of assumptions about the driver_location field. > > In i965, I'd rather do this lowering early and work with variables. > ir3 and vc4 could probably do that as well, but I'm not sure exactly >

Re: [Mesa-dev] [PATCH 6/6] radv: implement fast HTILE clears for depth or stencil only on GFX9

2018-11-12 Thread Connor Abbott
On Mon, Nov 12, 2018 at 9:34 PM Bas Nieuwenhuizen wrote: > > On Mon, Nov 12, 2018 at 5:55 PM Samuel Pitoiset > wrote: > > > > This allows to fast clear the depth part (or the stencil part) > > of a depth+stencil surface when HTILE is enabled. I didn't test > > on GFX8, so it's disabled

[Mesa-dev] [PATCH] radv: Use structured intrinsics instead of indexing workaround for GFX9.

2018-11-12 Thread Bas Nieuwenhuizen
These force the index to be used in the instruction so we don't need the workaround. Totals: SGPRS: 1321642 -> 1321802 (0.01 %) VGPRS: 943664 -> 943788 (0.01 %) Spilled SGPRs: 28468 -> 28480 (0.04 %) Spilled VGPRs: 88 -> 89 (1.14 %) Private memory VGPRs: 0 -> 0 (0.00 %) Scratch size: 80 -> 80

[Mesa-dev] [PATCH v3] nir: Don't lower the local work group size if it's variable.

2018-11-12 Thread Plamena Manolova
If the local work group size is variable it won't be available at compile time so we can't lower it in nir_lower_system_values(). Signed-off-by: Plamena Manolova --- src/compiler/nir/nir_lower_system_values.c | 25 +++-- 1 file changed, 19 insertions(+), 6 deletions(-) diff

[Mesa-dev] [PATCH v4] nir: Don't lower the local work group size if it's variable.

2018-11-12 Thread Plamena Manolova
If the local work group size is variable it won't be available at compile time so we can't lower it in nir_lower_system_values(). Signed-off-by: Plamena Manolova --- src/compiler/nir/nir_lower_system_values.c | 24 ++-- 1 file changed, 18 insertions(+), 6 deletions(-) diff

Re: [Mesa-dev] [PATCH 6/6] radv: implement fast HTILE clears for depth or stencil only on GFX9

2018-11-12 Thread Bas Nieuwenhuizen
On Mon, Nov 12, 2018 at 5:55 PM Samuel Pitoiset wrote: > > This allows to fast clear the depth part (or the stencil part) > of a depth+stencil surface when HTILE is enabled. I didn't test > on GFX8, so it's disabled currently. > > This gives a very nice boost, for example when clearing the depth

[Mesa-dev] [PATCH] radv: set PA.SC_CONSERVATIVE_RASTERIZATION.NULL_SQUAD_AA_MASK_ENABLE

2018-11-12 Thread Samuel Pitoiset
Signed-off-by: Samuel Pitoiset --- src/amd/vulkan/radv_pipeline.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/src/amd/vulkan/radv_pipeline.c b/src/amd/vulkan/radv_pipeline.c index f999383018..a126c03607 100644 --- a/src/amd/vulkan/radv_pipeline.c +++

Re: [Mesa-dev] [PATCH] st/mesa: disable L3 thread pinning

2018-11-12 Thread Edmondo Tommasina
Patch is: Reviewed-by: Edmondo Tommasina On Mon, Nov 12, 2018 at 9:45 PM Marek Olšák wrote: > From: Marek Olšák > > This implementation can have massive drawbacks. > > Cc: 18.3 > --- > src/mesa/state_tracker/st_manager.c | 9 - > 1 file changed, 9 deletions(-) > > diff --git

Re: [Mesa-dev] [PATCH] st/mesa: disable L3 thread pinning

2018-11-12 Thread Marek Olšák
Thanks, pushed. Marek On Mon, Nov 12, 2018 at 4:14 PM Edmondo Tommasina < edmondo.tommas...@gmail.com> wrote: > > Patch is: > Reviewed-by: Edmondo Tommasina > > > On Mon, Nov 12, 2018 at 9:45 PM Marek Olšák wrote: > >> From: Marek Olšák >> >> This implementation can have massive drawbacks.

Re: [Mesa-dev] [PATCH] st/mesa: disable L3 thread pinning

2018-11-12 Thread Axel Davy
Is there any replacement plan with a new feature ? Axel On 12/11/2018 21:45, Marek Olšák wrote: From: Marek Olšák This implementation can have massive drawbacks. Cc: 18.3 --- src/mesa/state_tracker/st_manager.c | 9 - 1 file changed, 9 deletions(-) diff --git

Re: [Mesa-dev] [PATCH] st/mesa: disable L3 thread pinning

2018-11-12 Thread Marek Olšák
Yes, I have a better plan, but I don't know if it will work. Marek On Mon, Nov 12, 2018 at 4:19 PM Axel Davy wrote: > Is there any replacement plan with a new feature ? > > Axel > > On 12/11/2018 21:45, Marek Olšák wrote: > > From: Marek Olšák > > > > This implementation can have massive

Re: [Mesa-dev] [PATCH] st/mesa: don't do L3 thread pinning for Blender

2018-11-12 Thread Edmondo Tommasina
On Mon, Nov 12, 2018 at 6:43 PM Michel Dänzer wrote: > On 2018-11-08 6:23 a.m., Marek Olšák wrote: > > Thanks a lot man. I'll reconsider this depending on the results I > receive. > > > > I may also just pin the Mesa threads and keep the app thread intact. It > > should perform OK with glthread,

[Mesa-dev] [PATCH] st/mesa: disable L3 thread pinning

2018-11-12 Thread Marek Olšák
From: Marek Olšák This implementation can have massive drawbacks. Cc: 18.3 --- src/mesa/state_tracker/st_manager.c | 9 - 1 file changed, 9 deletions(-) diff --git a/src/mesa/state_tracker/st_manager.c b/src/mesa/state_tracker/st_manager.c index 690d5bc2313..076ad42646d 100644 ---

[Mesa-dev] [PATCH 1/2] util/ralloc: Switch from DEBUG to NDEBUG

2018-11-12 Thread Matt Turner
The debug code is all asserts, so protect it with the same thing that controls assert. --- src/util/ralloc.c | 18 -- 1 file changed, 4 insertions(+), 14 deletions(-) diff --git a/src/util/ralloc.c b/src/util/ralloc.c index 5d77f75ee85..745b4cf1226 100644 --- a/src/util/ralloc.c

[Mesa-dev] [PATCH 2/2] util/ralloc: Make sizeof(linear_header) a multiple of 8

2018-11-12 Thread Matt Turner
Prior to this patch sizeof(linear_header) was 20 bytes in a non-debug build on 32-bit platforms. We do some pointer arithmetic to calculate the next available location with ptr = (linear_size_chunk *)((char *)[1] + latest->offset); in linear_alloc_child(). The [1] adds 20 bytes, so an

Re: [Mesa-dev] [PATCH mesa] xmlpool: update translation po files

2018-11-12 Thread Dylan Baker
This is fine with me, Reviewed-by: Dylan Baker Quoting Eric Engestrom (2018-11-12 06:24:33) > These files are close to 4 years out of date; a lot's changed since. > Let's just check in a recently-regenerated version. > > Changes generated by running `ninja xmlpool-{pot,update-po,gmo}`. > > Cc:

  1   2   >