Prior to the GLSL CSE pass, all of our testing happened to have a freshly
computed temporary in op[1], from the multiply by 16 to get a byte offset.
As of CSE you'll get var_refs of a reused value when you've got multiple
loads from the same offset.
Make a proper temporary for computing our tempor
Kenneth Graunke writes:
> Would you mind adding the "mesa: " prefix to the commit title?
I wouldn't mind at all. Thanks for the reminder. Only, I did happen to
push out that commit before I got your message. :-P
>> -ASSERT(!q->Active); /* should be caught earlier */
>> +i
Previously, when packing geometry shader input varyings like this:
in float foo[3];
in float bar[3];
lower_packed_varyings would declare a packed varying like this:
(declare (shader_in flat) (array ivec4 3) packed:foo[0],bar[0])
That's confusing, since the packed varying acutally st
That is indeed insane :)
Reviewed-by: Chris Forbes
On Thu, Oct 31, 2013 at 12:17 PM, Kenneth Graunke wrote:
> Previously, the write of each 32-bit half might land in separate batch
> buffers, which is insane.
>
> Signed-off-by: Kenneth Graunke
> ---
> src/mesa/drivers/dri/i965/gen6_queryobj.c
On Mon, Oct 28, 2013 at 11:59:51AM -0500, Aaron Watry wrote:
> I just ran a quick.tests run on evergreen without any regressions.
>
> Patch looks good to me, and doesn't seem to cause any regressions on
> the hardware I have available to test with.
>
I have pushed this patch, thanks for testing.
On Tue, Oct 29, 2013 at 08:34:16PM +0100, Kai Wasserb??ch wrote:
> Tom Stellard schrieb am 29.10.2013 17:48:
> > On Sun, Oct 27, 2013 at 07:36:07PM +0100, Kai Wasserb??ch wrote:
> >> Fixes a boat load of Piglit tests for me, which crashed like fdo#70913
> >> before.
> >>
> >> Thanks to Michel D??nz
On Wed, Oct 30, 2013 at 08:38:37AM -0500, Aaron Watry wrote:
> Reviewed and Tested-by: Aaron Watry
I have pushed this patch, thanks for testing.
>
> On Tue, Oct 29, 2013 at 11:48 AM, Tom Stellard wrote:
> > From: Tom Stellard
> >
> > The ICD loader should be responsible for installing headers
Previously, when doing intrastage and interstage interface block
linking, we only checked the interface type; this prevented us from
catching some link errors.
We now check the following additional constraints:
- For intrastage linking, the presence/absence of interface names must
match.
- For
Previously, the write of each 32-bit half might land in separate batch
buffers, which is insane.
Signed-off-by: Kenneth Graunke
---
src/mesa/drivers/dri/i965/gen6_queryobj.c | 5 +
1 file changed, 1 insertion(+), 4 deletions(-)
diff --git a/src/mesa/drivers/dri/i965/gen6_queryobj.c
b/src/m
https://bugs.freedesktop.org/show_bug.cgi?id=71022
Emil Velikov changed:
What|Removed |Added
Status|NEW |RESOLVED
Resolution|---
fix: intel_screen.c:1320:4: warning: initialization from
incompatible pointer type [enabled by default]
---
src/mesa/drivers/dri/i965/brw_context.c | 2 +-
src/mesa/drivers/dri/i965/brw_context.h | 2 +-
2 files changed, 2 insertions(+), 2 deletions(-)
diff --git a/src/mesa/drivers/dri/i965/brw_c
https://bugs.freedesktop.org/show_bug.cgi?id=71022
--- Comment #9 from Rob Clark ---
fwiw, this would be nice to push.. it would at least fix my tinderbox:
http://tinderbox.x.org/builds/2013-10-30-0017/logs/libGL/#configure
--
You are receiving this mail because:
You are the assignee for the bu
On 30 October 2013 14:13, Ian Romanick wrote:
> On 10/30/2013 01:01 PM, Chad Versace wrote:
> > For the sake of git-describe, I propose we tag the 10.0 fork
> > point on master as 'mesa-10.0-fork', 'mesa-10.0-base', or
> > something similair. How do people feel about that?
>
> I support this idea
On 10/30/2013 01:01 PM, Chad Versace wrote:
> For the sake of git-describe, I propose we tag the 10.0 fork
> point on master as 'mesa-10.0-fork', 'mesa-10.0-base', or
> something similair. How do people feel about that?
I support this idea.
To avoid confusion with tags on the branch, I think the
From: Marek Olšák
SOPK_32 doesn't use SCC.
This is probably unused by Mesa.
---
Please apply.
lib/Target/R600/SIInstrInfo.td| 5 +
lib/Target/R600/SIInstructions.td | 22 +++---
2 files changed, 16 insertions(+), 11 deletions(-)
diff --git a/lib/Target/R600/SIInstrInf
For the sake of git-describe, I propose we tag the 10.0 fork
point on master as 'mesa-10.0-fork', 'mesa-10.0-base', or
something similair. How do people feel about that?
It's been a long time since git-describe has given useful
info. It's reported snb-magic for a long time.
$ git-describ
Thanks, patch is committed.
___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev
Please disregard this patch. I've implemented the workaround in Mesa
as Christian suggested.
Marek
On Wed, Oct 30, 2013 at 5:19 PM, Christian König
wrote:
> Off hand I don't know any use case exept constant buffers where we use
> S_BUFFER_LOAD, but anybody who uses it should be aware how to use
From: Marek Olšák
---
The UBO series continues here.
This is an alternative solution to scalar-memory-read hw bug.
src/gallium/drivers/radeonsi/radeonsi_pipe.c | 22 ++
src/gallium/drivers/radeonsi/radeonsi_pipe.h | 1 +
src/gallium/drivers/radeonsi/si_descriptors.c | 6
Vedran,
Just to add a bit more context -- programming a strange viewport
causes us to turn off guardband clipping. I haven't measured the
impact on dota2, but when it was initially enabled it was a nice win
for some other games.
-- Chris
On Thu, Oct 31, 2013 at 3:38 AM, Chad Versace
wrote:
> On
On 30 October 2013 11:55, Rogovin, Kevin wrote:
> Hello all,
>
> I've got some questions and I would appreciate if anyone could help me
> out. Here goes:
>
> I've been digging through brw_try_draw_prims(), and trying to figure out
> what it is doing, so far this is what I see:
>
> 1) it is ess
On 30 October 2013 12:15, Paul Berry wrote:
> On 29 October 2013 13:28, Eric Anholt wrote:
>
>> This is a resend of the series after updates for Paul's review feedback.
>> This time I've done a full piglit run with spilling forced on and no
>> regressions were found. It can also be found at gen
On 29 October 2013 13:28, Eric Anholt wrote:
> This is a resend of the series after updates for Paul's review feedback.
> This time I've done a full piglit run with spilling forced on and no
> regressions were found. It can also be found at gen7-scratch-read of my
> tree.
>
I responded to patch
On 29 October 2013 13:28, Eric Anholt wrote:
> I believe this will never happen in SIMD8 mode, but it could for SIMD16
> when we fix it.
> ---
> src/mesa/drivers/dri/i965/brw_fs.h| 1 +
> src/mesa/drivers/dri/i965/brw_fs_reg_allocate.cpp | 23
> +++
> src/mes
On 28 October 2013 15:34, Eric Anholt wrote:
> When faced with a million instructions that all became candidates at the
> same time (none of which individually reduce register pressure), the ones
> on the critical path are more likely to be the ones that will free up some
> candidates soon.
>
> s
Hello all,
I've got some questions and I would appreciate if anyone could help me out.
Here goes:
I've been digging through brw_try_draw_prims(), and trying to figure out what
it is doing, so far this is what I see:
1) it is essentially called each time a non-degenerate "real drawing"
glDr
Fredrik Höglund writes:
> This extension decouples the vertex attributes from the vertex buffer
> bindings, making it possible to change buffers without having to
> respecify the formats. The mapping between attributes and vertex
> buffer bindings also becomes configurable.
Patch 5 as posted do
On 28 October 2013 14:19, Eric Anholt wrote:
> This only operates on constant/uniform values for now, because otherwise
> I'd
> have to deal with killing my available CSE entries when assignments happen,
> and getting even this working in the tree ir was painful enough.
>
> As is, it has the foll
On Tue, Oct 29, 2013 at 10:04 PM, Paul Berry wrote:
> On 29 October 2013 19:22, Anuj Phogat wrote:
>>
>>
>>
>>
>> On Mon, Oct 28, 2013 at 5:10 PM, Paul Berry
>> wrote:
>>>
>>> On 25 October 2013 16:45, Anuj Phogat wrote:
V2:
- Update comments
- Use fs_reg(0x) in AN
https://bugs.freedesktop.org/show_bug.cgi?id=69874
--- Comment #7 from Emil Velikov ---
(In reply to comment #6)
> bin_PROGRAMS = abc
> abc_SOURCES = sub/main.c sub/abc.c
>
> When you set subdir-objects, it always creates object files in the same dir
> as the source file. E.g: sub/main.o, sub/
On 28 October 2013 11:33, Matt Turner wrote:
> The first six patches have been on the list previously, but it got
> confusing when they changed order and some were dropped.
>
> The last nine patches implement a new SEL-generating peephole, a dead
> flow control elimination pass, and some silly al
On 28 October 2013 11:31, Matt Turner wrote:
> The pass previously searched only backward from an ENDIF (and ELSE) to
> find MOVs with the same destination. This commit extends the pass to
> also search forward from the IF (and ELSE) to find matching MOVs which
> it can replace with SEL instructi
On 30 October 2013 10:23, Paul Berry wrote:
> On 30 October 2013 10:10, Matt Turner wrote:
>
>> On Wed, Oct 30, 2013 at 9:30 AM, Paul Berry
>> wrote:
>> > On 28 October 2013 11:31, Matt Turner wrote:
>> >>
>> >> fs_visitor::try_replace_with_sel optimizes only if statements whose
>> >> "then" a
On 29 October 2013 23:37, Matt Turner wrote:
> Before this patch, the following code would not be optimized even though
> the final two instructions were common to the then and else blocks:
>
>(+f0) IF
>MOV dst2 ...
>MOV dst1 ...
>MOV dst0 ...
>ELSE
>MOV dst3 ...
>MOV
On Wed, Oct 30, 2013 at 08:15:43AM +0100, David Herrmann wrote:
> Hi Tom
>
> On Tue, Oct 29, 2013 at 9:00 PM, Tom Stellard wrote:
> > From: Tom Stellard
> >
> > You can use the --enable-pipe-loader-render-nodes configure flag to
> > make the pipe-loader use render nodes for talking with the devi
On 30 October 2013 10:10, Matt Turner wrote:
> On Wed, Oct 30, 2013 at 9:30 AM, Paul Berry
> wrote:
> > On 28 October 2013 11:31, Matt Turner wrote:
> >>
> >> fs_visitor::try_replace_with_sel optimizes only if statements whose
> >> "then" and "else" bodies contain a single MOV instruction. It a
On Wed, Oct 30, 2013 at 10:06 AM, Paul Berry wrote:
> On 28 October 2013 11:31, Matt Turner wrote:
>>
>> fs_visitor::try_replace_with_sel optimizes only if statements whose
>> "then" and "else" bodies contain a single MOV instruction. It also did
>> could not handle constant arguments, since they
On 28 October 2013 11:31, Matt Turner wrote:
> total instructions in shared programs: 1287488 -> 1287457 (-0.00%)
> instructions in affected programs: 1745 -> 1714 (-1.78%)
> ---
> src/mesa/drivers/dri/i965/brw_fs_sel_peephole.cpp | 38
> +--
> 1 file changed, 21 insertio
On Wed, Oct 30, 2013 at 9:30 AM, Paul Berry wrote:
> On 28 October 2013 11:31, Matt Turner wrote:
>>
>> fs_visitor::try_replace_with_sel optimizes only if statements whose
>> "then" and "else" bodies contain a single MOV instruction. It also did
>> could not handle constant arguments, since they
On 28 October 2013 11:31, Matt Turner wrote:
> fs_visitor::try_replace_with_sel optimizes only if statements whose
> "then" and "else" bodies contain a single MOV instruction. It also did
> could not handle constant arguments, since they cause an extra MOV
> immediate to be generated (since we ha
Hi Vincent,
It turns out that it's not possible to correctly schedule uses and defs
of the OQAP register without proper alias analysis in the MachineScheduler. See
the explanation in the lds-output-queue.ll test case.
Here is an updated patch that fixes all the outstanding LDS scheduling
bugs th
https://bugs.freedesktop.org/show_bug.cgi?id=71042
Matt Turner changed:
What|Removed |Added
CC|matts...@gmail.com |
--
You are receiving this mail because:
https://bugs.freedesktop.org/show_bug.cgi?id=69874
--- Comment #6 from Shriram V ---
bin_PROGRAMS = abc
abc_SOURCES = sub/main.c sub/abc.c
When you set subdir-objects, it always creates object files in the same dir as
the source file. E.g: sub/main.o, sub/abc.o
So, its working as expected, how
On 28 October 2013 11:31, Matt Turner wrote:
> fs_visitor::try_replace_with_sel optimizes only if statements whose
> "then" and "else" bodies contain a single MOV instruction. It also did
> could not handle constant arguments, since they cause an extra MOV
>
s/did could not/could not/
> immedi
The symptom is a VM protection fault with the address of 0 (probably
because the whole descriptor contains zeros), which
should be harmless, but it spams dmesg.
Marek
On Wed, Oct 30, 2013 at 5:19 PM, Christian König
wrote:
> Off hand I don't know any use case exept constant buffers where we use
Off hand I don't know any use case exept constant buffers where we use
S_BUFFER_LOAD, but anybody who uses it should be aware how to use it.
What are the symptoms of issuing a S_BUFFER_LOAD with NumRecords=0?
Hangs or just undefined behaviour?
Christian.
Am 30.10.2013 17:00, schrieb Marek Ol
On Wed, Oct 30, 2013 at 8:59 AM, Paul Berry wrote:
> On 29 October 2013 17:23, Matt Turner wrote:
>>
>> On Tue, Oct 29, 2013 at 2:32 AM, Pohjolainen, Topi
>> wrote:
>> > On Mon, Oct 28, 2013 at 11:31:32AM -0700, Matt Turner wrote:
>> >> ---
>> >> src/mesa/drivers/dri/i965/brw_fs.cpp | 30
>> >>
Yeah, it's unusual.
What if S_BUFFER_LOAD is also used by something else, like texture
buffers, or OpenCL? Will we have to fix that as well?
Marek
On Wed, Oct 30, 2013 at 3:32 PM, Christian König
wrote:
> Mhm, I'm assumed that having NumRecord zero is actually something quite
> unusual. E.g. a
Both patches are:
Reviewed-by: Tom Stellard
On Wed, Oct 30, 2013 at 04:48:18PM +0100, Vincent Lejeune wrote:
> R600/R700 implementation of tex buffer fetch requires the result of the VFETCH
> instruction to be ANDed with R600_BUFFER_INFO_CONST_BUFFER, and the last
> channel
> to be ORed with th
On 29 October 2013 17:23, Matt Turner wrote:
> On Tue, Oct 29, 2013 at 2:32 AM, Pohjolainen, Topi
> wrote:
> > On Mon, Oct 28, 2013 at 11:31:32AM -0700, Matt Turner wrote:
> >> ---
> >> src/mesa/drivers/dri/i965/brw_fs.cpp | 30
> ++
> >> src/mesa/drivers/dri/i965/br
On R600/R700 hw the data are stored from the channel 2 of the second dword.
---
src/gallium/drivers/r600/r600_llvm.c | 15 ++-
1 file changed, 14 insertions(+), 1 deletion(-)
diff --git a/src/gallium/drivers/r600/r600_llvm.c
b/src/gallium/drivers/r600/r600_llvm.c
index d7fa5f8..5afe3
R600/R700 implementation of tex buffer fetch requires the result of the VFETCH
instruction to be ANDed with R600_BUFFER_INFO_CONST_BUFFER, and the last channel
to be ORed with the same const buffer.
---
src/gallium/drivers/r600/r600_llvm.c | 29 +
1 file changed, 29 ins
Please send patches using git-send-email and no other means. Patches
sent as attachments can't get in-line review comments. That said...
NAK this patch. Putting the function call in the assert means that the
function won't get called in non-debug builds. That will break
everything. Maybe just
On 10/30/2013 07:15 AM, Brian Paul wrote:
> On 10/29/2013 07:07 PM, Ian Romanick wrote:
>> From: Ian Romanick
>>
>> These will be used to determine whether to signal a GPU reset after
>> another context in the share group has observed a reset.
>>
>> Signed-off-by: Ian Romanick
>> ---
>> src/mes
Hi Tom
On Tue, Oct 29, 2013 at 9:00 PM, Tom Stellard wrote:
> From: Tom Stellard
>
> You can use the --enable-pipe-loader-render-nodes configure flag to
> make the pipe-loader use render nodes for talking with the device.
> ---
> configure.ac | 6 ++
> src
Hi Tom
On Tue, Oct 29, 2013 at 9:00 PM, Tom Stellard wrote:
> From: Tom Stellard
>
> DRM_IOCTL_VERSION is not currently a legal ioctl for render nodes, so we
> need to hard code the drm version when drmGetVersion() returns NULL.
> ---
> src/gallium/winsys/radeon/drm/radeon_drm_winsys.c | 36
>
All the patches look fine.
I think they are ready for merging (there are few trailing whitespace
errors when applying, but nothing bad).
We want these patches for Wayland Prime support, and it should be useful
too for X Dri3.
Axel Davy
___
mesa-d
On 10/30/2013 06:43 AM, Paul Berry wrote:
On 30 October 2013 00:00, Eric Anholt wrote:
One of the things I was wondering about for dota2 performance was
whether missing the fast clears was a big performance hit --
particularly with the fips numbers indicating a lot of time spent in
clears. Ho
Mhm, I'm assumed that having NumRecord zero is actually something quite
unusual. E.g. a shader that accesses a not defined constant buffer or
something like that. So I would rather optimize for the common use case.
Anyway branch instructions are quite expensive, you can issue something
between
On Wed, Oct 30, 2013 at 2:43 PM, Paul Berry wrote:
> - What percentage of clears are affected by this patch? Even with this
> patch, some clears still won't take the fast path (e.g. stencil clears, MSAA
> color clears, and scissored clears).
>
When I was doing my Wine tweaks for the Windows Dot
diff --git a/src/mesa/drivers/dri/i965/brw_vec4_gs.c
b/src/mesa/drivers/dri/i965/brw_vec4_gs.c
index b52d646..2ae5818 100644
--- a/src/mesa/drivers/dri/i965/brw_vec4_gs.c
+++ b/src/mesa/drivers/dri/i965/brw_vec4_gs.c
@@ -283,9 +283,7 @@ brw_upload_gs_prog(struct brw_context *brw)
if (!brw_sear
diff --git a/src/egl/wayland/wayland-drm/wayland-drm.c
b/src/egl/wayland/wayland-drm/wayland-drm.c
index 2f5acb2..e9c6e0a 100644
--- a/src/egl/wayland/wayland-drm/wayland-drm.c
+++ b/src/egl/wayland/wayland-drm/wayland-drm.c
@@ -247,8 +247,6 @@ bind_drm(struct wl_client *client, void *data, uint32
diff --git a/src/gallium/drivers/r600/evergreen_compute.c
b/src/gallium/drivers/r600/evergreen_compute.c
index ea77f5e..ffdc5c3 100644
--- a/src/gallium/drivers/r600/evergreen_compute.c
+++ b/src/gallium/drivers/r600/evergreen_compute.c
@@ -392,7 +392,6 @@ static void compute_emit_cs(struct r600_c
On 10/29/2013 07:07 PM, Ian Romanick wrote:
From: Ian Romanick
These will be used to determine whether to signal a GPU reset after
another context in the share group has observed a reset.
Signed-off-by: Ian Romanick
---
src/mesa/main/context.c | 1 +
src/mesa/main/mtypes.h | 18 +
I thought that doing S_CMPK followed by S_CBRANCH has less overhead
than doing a memory read. If we used one of
S_BUFFER_LOAD_DWORDX2,4,8,16, it wouldn't be so bad. I don't know.
Marek
On Wed, Oct 30, 2013 at 2:48 PM, Christian König
wrote:
> Am 30.10.2013 14:23, schrieb Marek Olšák:
>
>> From:
On 29 October 2013 23:31, Timothy Arceri wrote:
> On Tue, 2013-10-29 at 13:13 -0700, Eric Anholt wrote:
> > Josh Klint writes:
> >
> > > Hi,
> > >
> > > As I understand it, Mesa is in charge of providing OpenGL drivers for
> > > Intel graphics hardware running on Linux. I'm in the process of
>
Am 30.10.2013 14:23, schrieb Marek Olšák:
From: Marek Olšák
This also fixes scalar compare instructions which were always eliminated,
because they didn't have a destination of SCC.
Uff, that looks like quite a bit of overhead, isn't there a simpler
approach? Like setting the the NumRecord to
On 30 October 2013 00:00, Eric Anholt wrote:
> One of the things I was wondering about for dota2 performance was
> whether missing the fast clears was a big performance hit --
> particularly with the fips numbers indicating a lot of time spent in
> clears. However, applying this patch:
>
> diff
From: Marek Olšák
---
src/gallium/auxiliary/tgsi/tgsi_scan.c | 13 -
src/gallium/auxiliary/tgsi/tgsi_scan.h | 1 +
2 files changed, 13 insertions(+), 1 deletion(-)
diff --git a/src/gallium/auxiliary/tgsi/tgsi_scan.c
b/src/gallium/auxiliary/tgsi/tgsi_scan.c
index 05b7111..0f10556 1
Reviewed and Tested-by: Aaron Watry
On Tue, Oct 29, 2013 at 11:48 AM, Tom Stellard wrote:
> From: Tom Stellard
>
> The ICD loader should be responsible for installing headers.
> ---
> src/gallium/state_trackers/clover/Makefile.am | 21 +++--
> 1 file changed, 11 insertions(+),
From: Marek Olšák
---
src/gallium/drivers/radeonsi/radeonsi_pipe.c | 2 +-
src/gallium/drivers/radeonsi/radeonsi_shader.c | 65 +++---
src/gallium/drivers/radeonsi/si_state.c| 2 +-
src/gallium/drivers/radeonsi/si_state.h| 3 +-
4 files changed, 41 insert
https://bugs.freedesktop.org/show_bug.cgi?id=71042
Chia-I Wu changed:
What|Removed |Added
Status|NEW |RESOLVED
Resolution|---
From: Marek Olšák
This also fixes scalar compare instructions which were always eliminated,
because they didn't have a destination of SCC.
Signed-off-by: Marek Olšák
---
lib/Target/R600/SIISelLowering.cpp | 30 ++
lib/Target/R600/SIInsertWaits.cpp | 6 ++
lib/
https://bugs.freedesktop.org/show_bug.cgi?id=71042
Fabio Pedretti changed:
What|Removed |Added
CC||matts...@gmail.com
--
You are receivin
https://bugs.freedesktop.org/show_bug.cgi?id=71042
Priority: medium
Bug ID: 71042
CC: johannesoberm...@gmx.de, olva...@gmail.com
Assignee: mesa-dev@lists.freedesktop.org
Summary: [ilo] fatal error: brw_structs.h: No such file or
One of the things I was wondering about for dota2 performance was
whether missing the fast clears was a big performance hit --
particularly with the fips numbers indicating a lot of time spent in
clears. However, applying this patch:
diff --git a/src/mesa/drivers/dri/i965/brw_blorp_clear.cpp b/sr
76 matches
Mail list logo