Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-24 Thread Pavel Machek
Hi!

After update to 3.15-rc2, only top 20% of screen works on X.
   
00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset
Integrated Graphics Controller (rev 03)
   
00:02.1 Display controller: Intel Corporation 4 Series Chipset
Integrated Graphics Controller (rev 03)
   Subsystem: Intel Corporation Device d614
   Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
   ParErr- Stepping- SERR- FastB2B- DisINTx-
   Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast
   TAbort- TAbort- MAbort- SERR- PERR- INTx-
   Latency: 0
   Region 0: Memory at d040 (64-bit, non-prefetchable)
   [size=1M]
   Capabilities: access denied
   
This worked before. I believe it worked in 3.14. It definitely works
in 3.11-rc2.
   
   Screenshot or more detailed description of what only top 20% of
   screen works in X means?
   Anything in dmesg?
  
  Actually yes, dmesg suggests it is quite
  sick. drivers/gpu/drm/drm_mm.c:767 warning triggered
  repeatedly. Also.. initial framebuffer does not work ; I don't seem to
  see anything before X start up. (This is Debian 6.0.9)
 
 That says that i915.ko failed to initialise the GPU (or rather the GPU
 wasn't responding) and bailed during module load. The key line here is

 [drm:init_ring_common] *ERROR* render ring initialization failed ctl
 0001f001 head 2034 tail  start 0012f000

Actually, I'm not using modules -- everything is build-in. Can you try
that config? Perhaps you can then reproduce the failure.

 Jiri has been seeing a similar issue creep in during resume, but it is
 not reliable enough to bisect. Is your boot failure reliable enough to
 bisect? Also drm-intel-nightly should mitigate this failure and allow
 i915.ko to continue to load and run X, which would be worth testing to
 make sure that works as intended.

So far it failed 100% of time, but this is my main machine, so
bringing it down for extended periods is no-no.

Greetings to Prague :-). Jiri, do you have  i915 a module or build-in?

Best regards,
Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) 
http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-24 Thread Pavel Machek
On Thu 2014-04-24 08:03:04, Daniel Vetter wrote:
 On Thu, Apr 24, 2014 at 7:50 AM, Chris Wilson ch...@chris-wilson.co.uk 
 wrote:
  That says that i915.ko failed to initialise the GPU (or rather the GPU
  wasn't responding) and bailed during module load. The key line here is
 
  [drm:init_ring_common] *ERROR* render ring initialization failed ctl
  0001f001 head 2034 tail  start 0012f000
 
  Jiri has been seeing a similar issue creep in during resume, but it is
  not reliable enough to bisect. Is your boot failure reliable enough to
  bisect? Also drm-intel-nightly should mitigate this failure and allow
  i915.ko to continue to load and run X, which would be worth testing to
  make sure that works as intended.
 
 Oh right, g4x going beserk :( Apparently something changed in 3.15
 somewhere which made this much more likely, but like Chris said in
 Jiri's case it's too unreliable to reproduce for a bisect. We've had
 this comego pretty much ever since kms support was merged and never
 tracked it down.

So far I went back to 3.14, and yes, graphics works there.

Back to 3.15, put it to smaller monitor. This time, top 30% of screen
works, and last working scanline is copied to the rest 60% of
screen. [With the exception of mouse cursor, that somehow affects that
magically.]

 The bug is https://bugs.freedesktop.org/show_bug.cgi?id=76554
 
 Like Chris said please test latest drm-intel-nighlty from
 http://cgit.freedesktop.org/drm-intel to make sure that the recently
 merged mitigation measures work properly. But those won't get your gpu
 back, only the display and it's only for 3.16. We're still hunting a
 proper fix for 3.15.

So you know where the bug is or not?

 And if you can indeed reliably reproduce this a bisect could be really useful.

I'm confused. And yes, it seems reliable. But I'm not sure if I see
same bug you are talking about.
Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) 
http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-24 Thread Pavel Machek
Hi!

 Like Chris said please test latest drm-intel-nighlty from
 http://cgit.freedesktop.org/drm-intel to make sure that the recently
 merged mitigation measures work properly. But those won't get your gpu
 back, only the display and it's only for 3.16. We're still hunting a

What does it means won't get my gpu back, just my display? Gpu is
the thing driving the display... no?

I checked out drm-intel-nightly. Now I can see some kernel messages
scrolling around on vga console (improvement?), but then end up with
completely blank screen, and dead machine, AFAICT. Can't seem to ping
it.

BTW you might want to fix these...

drivers/gpu/drm/i915/i915_cmd_parser.c: In function ‘i915_parse_cmds’:
drivers/gpu/drm/i915/i915_cmd_parser.c:919: warning: format ‘%td’
expects type ‘ptrdiff_t’, but argument 5 has type ‘long unsigned int’
  CC  drivers/gpu/drm/i915/i915_gem_context.o
  LD [M]  drivers/gpu/drm/udl/udl.o
  CC  drivers/gpu/drm/i915/i915_gem_debug.o
  CC  drivers/gpu/drm/i915/i915_gem_dmabuf.o
  CC  drivers/gpu/drm/i915/i915_gem_evict.o
  CC  drivers/gpu/drm/i915/i915_gem_execbuffer.o
  CC  drivers/gpu/drm/i915/i915_gem_gtt.o
drivers/gpu/drm/i915/i915_gem_gtt.c: In function
‘gen8_ggtt_insert_entries’:
drivers/gpu/drm/i915/i915_gem_gtt.c:1389: warning: ‘addr’ may be used
uninitialized in this function
drivers/gpu/drm/i915/i915_gem_gtt.c: In function
‘gen6_ggtt_insert_entries’:
drivers/gpu/drm/i915/i915_gem_gtt.c:1435: warning: ‘addr’ may be used
uninitialized in this function

Thanks,
Pavel


-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) 
http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-24 Thread Jiri Kosina
On Thu, 24 Apr 2014, Pavel Machek wrote:

   [drm:init_ring_common] *ERROR* render ring initialization failed ctl
   0001f001 head 2034 tail  start 0012f000
  
   Jiri has been seeing a similar issue creep in during resume, but it is
   not reliable enough to bisect. Is your boot failure reliable enough to
   bisect? Also drm-intel-nightly should mitigate this failure and allow
   i915.ko to continue to load and run X, which would be worth testing to
   make sure that works as intended.
  
  Oh right, g4x going beserk :( Apparently something changed in 3.15
  somewhere which made this much more likely, but like Chris said in
  Jiri's case it's too unreliable to reproduce for a bisect. We've had
  this comego pretty much ever since kms support was merged and never
  tracked it down.
 
 So far I went back to 3.14, and yes, graphics works there.
 
 Back to 3.15, put it to smaller monitor. This time, top 30% of screen
 works, and last working scanline is copied to the rest 60% of
 screen. [With the exception of mouse cursor, that somehow affects that
 magically.]
 
  The bug is https://bugs.freedesktop.org/show_bug.cgi?id=76554
  
  Like Chris said please test latest drm-intel-nighlty from
  http://cgit.freedesktop.org/drm-intel to make sure that the recently
  merged mitigation measures work properly. But those won't get your gpu
  back, only the display and it's only for 3.16. We're still hunting a
  proper fix for 3.15.
 
 So you know where the bug is or not?

No, but there is a workaround for cases kernel finds out that it cannot 
execute GPU commands (because the rings failed to initialize), and instead 
it falls back to CPU rendering directly into the framebuffer.

So it's highly sub-optimal, but works around the complete wreckage.

-- 
Jiri Kosina
SUSE Labs

___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-24 Thread Chris Wilson
On Thu, Apr 24, 2014 at 03:43:45PM +0200, Pavel Machek wrote:
 Hi!
 
  Like Chris said please test latest drm-intel-nighlty from
  http://cgit.freedesktop.org/drm-intel to make sure that the recently
  merged mitigation measures work properly. But those won't get your gpu
  back, only the display and it's only for 3.16. We're still hunting a
 
 What does it means won't get my gpu back, just my display? Gpu is
 the thing driving the display... no?

We think of it as discrete functional units, i.e. the display engine is
separate from the GPU. What happens here is that the render engine
doesn't respond to our commands to set it up, but that should not
prevent us from setting the register values in the display engine to drive
the actual outputs.
 
 I checked out drm-intel-nightly. Now I can see some kernel messages
 scrolling around on vga console (improvement?), but then end up with
 completely blank screen, and dead machine, AFAICT. Can't seem to ping
 it.

Eeek. Do you have netconsole handy?
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-24 Thread Jiri Kosina
On Thu, 24 Apr 2014, Pavel Machek wrote:

  That says that i915.ko failed to initialise the GPU (or rather the GPU 
  wasn't responding) and bailed during module load. The key line here is
 
  [drm:init_ring_common] *ERROR* render ring initialization failed ctl
  0001f001 head 2034 tail  start 0012f000
 
 Actually, I'm not using modules -- everything is build-in. Can you try
 that config? Perhaps you can then reproduce the failure.

  Jiri has been seeing a similar issue creep in during resume, but it is 
  not reliable enough to bisect. Is your boot failure reliable enough to 
  bisect? Also drm-intel-nightly should mitigate this failure and allow 
  i915.ko to continue to load and run X, which would be worth testing to 
  make sure that works as intended.
 
 So far it failed 100% of time, but this is my main machine, so
 bringing it down for extended periods is no-no.
 
 Greetings to Prague :-). Jiri, do you have  i915 a module or build-in?

Hi there Pavel, long time no see :)

I have i915 built as a module.

The thing is, that I can't really bisect this; the problem is, that I've 
started seeing this quite a long time ago, but only very sporadically 
(something like once in 40 resumes). But it was getting gradually worse 
and worse with newer kernels, and with current tree (both Linus' tree and 
drm-intel-testing), I am getting it 100% reproducibly on every resume.

This makes bisection completely impossible though.

-- 
Jiri Kosina
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-23 Thread Daniel Vetter
On Wed, Apr 23, 2014 at 10:22 PM, Pavel Machek pa...@ucw.cz wrote:
 After update to 3.15-rc2, only top 20% of screen works on X.

 00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset
 Integrated Graphics Controller (rev 03)

 00:02.1 Display controller: Intel Corporation 4 Series Chipset
 Integrated Graphics Controller (rev 03)
Subsystem: Intel Corporation Device d614
Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
ParErr- Stepping- SERR- FastB2B- DisINTx-
Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast
TAbort- TAbort- MAbort- SERR- PERR- INTx-
Latency: 0
Region 0: Memory at d040 (64-bit, non-prefetchable)
[size=1M]
Capabilities: access denied

 This worked before. I believe it worked in 3.14. It definitely works
 in 3.11-rc2.

Screenshot or more detailed description of what only top 20% of
screen works in X means?
Anything in dmesg?
bisect result presuming that it reproduces reliably?

Thanks, Daniel
-- 
Daniel Vetter
Software Engineer, Intel Corporation
+41 (0) 79 365 57 48 - http://blog.ffwll.ch
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-23 Thread Pavel Machek
On Wed 2014-04-23 23:09:45, Daniel Vetter wrote:
 On Wed, Apr 23, 2014 at 10:22 PM, Pavel Machek pa...@ucw.cz wrote:
  After update to 3.15-rc2, only top 20% of screen works on X.
 
  00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset
  Integrated Graphics Controller (rev 03)
 
  00:02.1 Display controller: Intel Corporation 4 Series Chipset
  Integrated Graphics Controller (rev 03)
 Subsystem: Intel Corporation Device d614
 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
 ParErr- Stepping- SERR- FastB2B- DisINTx-
 Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast
 TAbort- TAbort- MAbort- SERR- PERR- INTx-
 Latency: 0
 Region 0: Memory at d040 (64-bit, non-prefetchable)
 [size=1M]
 Capabilities: access denied
 
  This worked before. I believe it worked in 3.14. It definitely works
  in 3.11-rc2.
 
 Screenshot or more detailed description of what only top 20% of
 screen works in X means?

Well, top cca 20% is ok, then I got repeated pattern of some part of screen.

 Anything in dmesg?

That will take a look.

 bisect result presuming that it reproduces reliably?

If there's no other chance, I guess I could do bisect. But
first Do you have similar hardware? Does it work for you? Are
there any experimental changes that went in recently and I should
try reverting first?

Pavel
-- 
(english) http://www.livejournal.com/~pavelmachek
(cesky, pictures) 
http://atrey.karlin.mff.cuni.cz/~pavel/picture/horses/blog.html
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx


Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-23 Thread Pavel Machek
Hi!

  After update to 3.15-rc2, only top 20% of screen works on X.
 
  00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset
  Integrated Graphics Controller (rev 03)
 
  00:02.1 Display controller: Intel Corporation 4 Series Chipset
  Integrated Graphics Controller (rev 03)
 Subsystem: Intel Corporation Device d614
 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
 ParErr- Stepping- SERR- FastB2B- DisINTx-
 Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast
 TAbort- TAbort- MAbort- SERR- PERR- INTx-
 Latency: 0
 Region 0: Memory at d040 (64-bit, non-prefetchable)
 [size=1M]
 Capabilities: access denied
 
  This worked before. I believe it worked in 3.14. It definitely works
  in 3.11-rc2.
 
 Screenshot or more detailed description of what only top 20% of
 screen works in X means?
 Anything in dmesg?

Actually yes, dmesg suggests it is quite
sick. drivers/gpu/drm/drm_mm.c:767 warning triggered
repeatedly. Also.. initial framebuffer does not work ; I don't seem to
see anything before X start up. (This is Debian 6.0.9)

Best regards,
Pavel

Initializing cgroup subsys cpu
Linux version 3.15.0-rc1+ (pavel@amd) (gcc version 4.4.5 (Debian 4.4.5-8) ) 
#335 SMP Sat Apr 19 17:58:01 CEST 2014
e820: BIOS-provided physical RAM map:
BIOS-e820: [mem 0x-0x0009ebff] usable
BIOS-e820: [mem 0x0009ec00-0x0009] reserved
BIOS-e820: [mem 0x000e-0x000f] reserved
BIOS-e820: [mem 0x0010-0xbd87dfff] usable
BIOS-e820: [mem 0xbd87e000-0xbd900fff] ACPI NVS
BIOS-e820: [mem 0xbd901000-0xbda42fff] reserved
BIOS-e820: [mem 0xbda43000-0xbda56fff] ACPI NVS
BIOS-e820: [mem 0xbda57000-0xbdb54fff] reserved
BIOS-e820: [mem 0xbdb55000-0xbdb5dfff] ACPI data
BIOS-e820: [mem 0xbdb5e000-0xbdb67fff] ACPI NVS
BIOS-e820: [mem 0xbdb68000-0xbdb88fff] reserved
BIOS-e820: [mem 0xbdb89000-0xbdb8efff] ACPI NVS
BIOS-e820: [mem 0xbdb8f000-0xbdcf] usable
BIOS-e820: [mem 0xfec0-0xfec00fff] reserved
BIOS-e820: [mem 0xfed0-0xfed00fff] reserved
BIOS-e820: [mem 0xfed1c000-0xfed8] reserved
BIOS-e820: [mem 0xfff0-0x] reserved
BIOS-e820: [mem 0x0001-0x00013fff] usable
Notice: NX (Execute Disable) protection cannot be enabled: non-PAE kernel!
SMBIOS 2.4 present.
DMI:  /DG41MJ, BIOS MJG4110H.86A.0006.2009.1223.1155 12/23/2009
e820: update [mem 0x-0x0fff] usable == reserved
e820: remove [mem 0x000a-0x000f] usable
e820: last_pfn = 0xbdd00 max_arch_pfn = 0x10
MTRR default type: uncachable
MTRR fixed ranges enabled:
  0-9 write-back
  A-E7FFF uncachable
  E8000-F write-protect
MTRR variable ranges enabled:
  0 base 0 mask F write-back
  1 base 1 mask FC000 write-back
  2 base 0BDD0 mask 0 write-through
  3 base 0BDE0 mask FFFE0 uncachable
  4 base 0BE00 mask FFE00 uncachable
  5 base 0C000 mask FC000 uncachable
  6 disabled
x86 PAT enabled: cpu 0, old 0x7040600070406, new 0x7010600070106
initial memory mapped: [mem 0x-0x053f]
Base memory trampoline at [c009a000] 9a000 size 16384
init_memory_mapping: [mem 0x-0x000f]
 [mem 0x-0x000f] page 4k
init_memory_mapping: [mem 0x3700-0x373f]
 [mem 0x3700-0x373f] page 4k
BRK [0x05109000, 0x05109fff] PGTABLE
init_memory_mapping: [mem 0x3000-0x36ff]
 [mem 0x3000-0x36ff] page 4k
BRK [0x0510a000, 0x0510afff] PGTABLE
BRK [0x0510b000, 0x0510bfff] PGTABLE
BRK [0x0510c000, 0x0510cfff] PGTABLE
BRK [0x0510d000, 0x0510dfff] PGTABLE
BRK [0x0510e000, 0x0510efff] PGTABLE
init_memory_mapping: [mem 0x0010-0x2fff]
 [mem 0x0010-0x2fff] page 4k
init_memory_mapping: [mem 0x3740-0x377fdfff]
 [mem 0x3740-0x377fdfff] page 4k
ACPI: RSDP 0x000F03C0 24 (v02 INTEL )
ACPI: XSDT 0xBDB5CE18 44 (v01 INTEL  DG41MJ   0006 MSFT 00010013)
ACPI: FACP 0xBDB5BD98 F4 (v04 INTEL  DG41RQ   0006 MSFT 00010013)
ACPI BIOS Warning (bug): 32/64X FACS address mismatch in FADT: 
0xBDB5FF40/0xBDB64F40, using 64-bit address (20140214/tbfadt-271)
ACPI: DSDT 0xBDB55018 005983 (v01 INTEL  DG41MJ   0006 INTL 20051117)
ACPI: FACS 0xBDB64F40 40
ACPI: APIC 0xBDB5BF18 6C (v02 INTEL  DG41MJ   0006 MSFT 00010013)
ACPI: MCFG 0xBDB66E18 3C (v01 INTEL  DG41MJ   0006 MSFT 0097)
ACPI: HPET 0xBDB66D98 38 (v01 INTEL  DG41MJ   0006 AMI. 0003)
ACPI: Local APIC address 0xfee0
2149MB HIGHMEM available.
887MB LOWMEM available.
  mapped low ram: 0 - 377fe000
  low ram: 0 - 377fe000
Zone 

Re: [Intel-gfx] 3.15-rc2: i915 regression: only top 20% of screen works in X

2014-04-23 Thread Chris Wilson
On Thu, Apr 24, 2014 at 12:09:52AM +0200, Pavel Machek wrote:
 Hi!
 
   After update to 3.15-rc2, only top 20% of screen works on X.
  
   00:02.0 VGA compatible controller: Intel Corporation 4 Series Chipset
   Integrated Graphics Controller (rev 03)
  
   00:02.1 Display controller: Intel Corporation 4 Series Chipset
   Integrated Graphics Controller (rev 03)
  Subsystem: Intel Corporation Device d614
  Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop-
  ParErr- Stepping- SERR- FastB2B- DisINTx-
  Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast
  TAbort- TAbort- MAbort- SERR- PERR- INTx-
  Latency: 0
  Region 0: Memory at d040 (64-bit, non-prefetchable)
  [size=1M]
  Capabilities: access denied
  
   This worked before. I believe it worked in 3.14. It definitely works
   in 3.11-rc2.
  
  Screenshot or more detailed description of what only top 20% of
  screen works in X means?
  Anything in dmesg?
 
 Actually yes, dmesg suggests it is quite
 sick. drivers/gpu/drm/drm_mm.c:767 warning triggered
 repeatedly. Also.. initial framebuffer does not work ; I don't seem to
 see anything before X start up. (This is Debian 6.0.9)

That says that i915.ko failed to initialise the GPU (or rather the GPU
wasn't responding) and bailed during module load. The key line here is

[drm:init_ring_common] *ERROR* render ring initialization failed ctl
0001f001 head 2034 tail  start 0012f000

Jiri has been seeing a similar issue creep in during resume, but it is
not reliable enough to bisect. Is your boot failure reliable enough to
bisect? Also drm-intel-nightly should mitigate this failure and allow
i915.ko to continue to load and run X, which would be worth testing to
make sure that works as intended.
-Chris

-- 
Chris Wilson, Intel Open Source Technology Centre
___
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx