Re: null pointer dereference while loading i915

2012-08-10 Thread Mihai Moldovan
* On 10.08.2012 07:44 PM, Mihai Moldovan wrote:
> Hm, OK.
>
> Well, I'm done now.
>
> bisect log:
>
> git bisect start
> # good: [805a6af8dba5dfdd35ec35dc52ec0122400b2610] Linux 3.2
> git bisect good 805a6af8dba5dfdd35ec35dc52ec0122400b2610
> # bad: [28a33cbc24e4256c143dce96c7d93bf423229f92] Linux 3.5
> git bisect bad 28a33cbc24e4256c143dce96c7d93bf423229f92
> # good: [49d99a2f9c4d033cc3965958a1397b1fad573dd3] Merge branch 'for-linus' of
> git://oss.sgi.com/xfs/xfs
> git bisect good 49d99a2f9c4d033cc3965958a1397b1fad573dd3
> # good: [813a95e5b4fa936bbde10ef89188932745dcd7f4] Merge tag 'pinctrl' of
> git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc
> git bisect good 813a95e5b4fa936bbde10ef89188932745dcd7f4
> # bad: [9978306e31a8f89bd81fbc4c49fd9aefb1d30d10] Merge branch 'for-linus' of
> git://oss.sgi.com/xfs/xfs
> git bisect bad 9978306e31a8f89bd81fbc4c49fd9aefb1d30d10
> # good: [927ad551031798d4cba49766549600bbb33872d7] Merge tag
> 'ktest-v3.5-spelling' of
> git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-ktest
> git bisect good 927ad551031798d4cba49766549600bbb33872d7
> # good: [2c01e7bc46f10e9190818437e564f7e0db875ae9] Merge branch 'for-linus' of
> git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input
> git bisect good 2c01e7bc46f10e9190818437e564f7e0db875ae9
> # bad: [5f54d29ee9dace1e2ef4e8c9873ad4dd7a06d11a] drm/nva3/pm: make pll->pll
> mode work
> git bisect bad 5f54d29ee9dace1e2ef4e8c9873ad4dd7a06d11a
> # bad: [8b2e326dc7c5aa6952c88656d04d0d81fd85a6f8] drm/i915: Unconditionally
> initialise the interrupt workers
> git bisect bad 8b2e326dc7c5aa6952c88656d04d0d81fd85a6f8
> # bad: [f637fde434c9e3687798730c7ddd367e93666013] drm/i915: inline
> enable/disable_irq into ring->get/put_irq
> git bisect bad f637fde434c9e3687798730c7ddd367e93666013
> # bad: [23e3f9b37e7368ee8530ba99907508363feebc14] drm/i915: check for disabled
> interrupts on ValleyView
> git bisect bad 23e3f9b37e7368ee8530ba99907508363feebc14
> # good: [8489731c9bd22c27ab17a2190cd7444604abf95f] drm/i915: move clflushing
> into shmem_pread
> git bisect good 8489731c9bd22c27ab17a2190cd7444604abf95f
> # good: [3bd7d90938f1fe77de5991dc4b727843c4980b2a] drm/i915/intel_i2c: 
> refactor
> using intel_gmbus_get_adapter
> git bisect good 3bd7d90938f1fe77de5991dc4b727843c4980b2a
> # bad: [57f350b6722f9569f407872f6ead56e2d221d98a] drm/i915: add DPIO support
> git bisect bad 57f350b6722f9569f407872f6ead56e2d221d98a
> # bad: [93e537a10f2c8c0f2e74409b6cb473fc221758fa] drm/i915: split LVDS update
> code out of i9xx_crtc_mode_set
> git bisect bad 93e537a10f2c8c0f2e74409b6cb473fc221758fa
> # bad: [f2c9677be3158c31ba19f527e2be0f7a519e19d1] drm/i915/intel_i2c: allocate
> gmbus array as part of drm_i915_private
> git bisect bad f2c9677be3158c31ba19f527e2be0f7a519e19d1
> # bad: [2ed06c93a1fce057808894d73167aae03c76deaf] drm/i915/intel_i2c: gmbus
> disabled and reserved ports are invalid
> git bisect bad 2ed06c93a1fce057808894d73167aae03c76deaf

Just to be safe, I also tested git HEAD (3.6.0-rc1-00209-gf62bf17), no dice 
either.

Best regards,


Mihai



smime.p7s
Description: S/MIME Cryptographic Signature


Re: null pointer dereference while loading i915

2012-08-10 Thread Mihai Moldovan
* On 10.08.2012 06:39 PM, Daniel Vetter wrote:
> On Fri, Aug 10, 2012 at 6:05 PM, Mihai Moldovan  wrote:
>> * On 10.08.2012 12:10 PM, Daniel Vetter wrote:
>>> On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan  wrote:
 Hi Daniel, hi list

 ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
 fine),
 my box is crashing when loading the i915 driver (mode-setting enabled.)

 The current version I'm testing with is 3.5.0.

 I was able to get the BUG output (please forgive any errors/flips in the 
 output,
 I have had to transcribe the messages from the screen/images), however, 
 I'm not
 able to find out what's wrong.

 If I see it correctly, there's a null pointer dereference in a printk 
 called
 from inside gmbus_xfer. The only printk calls I can see in
 drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
 DRM_DEBUG_KMS() and DRM_INFO() macros.
 Neither call looks wrong to me, I even tried to swap adapter->name with
 bus->adapter.name and make *sure* i < num is true, but haven't had any 
 success.

 I'd really like to see this bug fixed, as it's preventing me from updating 
 the
 kernel for over a year now.

 Also, while 3.0.2 works, it *does* spew error/warning messages related to 
 gmbus
 and I've had corrupted VTs in the past (albeit after a long uptime with 
 multiple
 X restarting and DVI cable unplugging/reattaching events), so maybe 
 there's a
 lot more broken than "expected".
>>> Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
>>> since exactly this issue might happen. We've re-enabled gmbus again on
>>> 3.5 after having fixed this bug. Are you sure that this is plain 3.2
>>> you're running?
>> Sorry, I messed up the version numbers. Started bisecting yesterday and 
>> noticed,
>> that 3.0 up to 3.2 still work "fine" (see below), instead I've had another
>> problem with 3.2 (completely lockup after the kernel is running for a few
>> minutes, but I have no idea where this issue is coming from. Seems to be
>> happening with 3.2.0 only, so... *shrug*)
>>
>> 3.0.2   => working, gmbus warnings as posted.
>> 3.1-09933/07170 => working, NO gmbus warnings, but render errors (see below)
>> 3.2-rc2 to rc4  => working, NO gmbus warnings, but render errors (see below)
>> --- (stopped bisecting 3.0 to 3.2 as this was pointless) ---
>> --- (restarted bisecting with 3.2 to 3.5) ---
>> 3.3.0-06109 => working, gmbus warnings just like with 3.0, render errors
>> (see below)
>> 3.4.0-07487 => working, gmbus warnings, hang errors (see below)
>> ...
>>
>> I've done more steps, but have not yet finished bisecting, so stay tuned.
>> All those render errors look like that:
>>
>> [drm] capturing error event; look for more information in
>> /debug/dri/0/i915_error_state
>> render error detected, EIR: 0x0010
>>   IPEIR: 0x
>>   IPEHR: 0x0200
>>   INSTDONE: 0x
>>   INSTPS: 0x8001e025
>>   INSTDONE1: 0xbfbb
>>   ACTHD: 0x00a4203c
>> page table error
>>   PGTBL_ER: 0x0010
>> [drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x0010, masking
>>
>> I'll finish bisecting (and hope, that my guess was right, concerning the
>> varaiant I wasn't able to build) and will post the bisect log when done.
>>
>> Meanwhile: at least for 3.0.2 and even older versions, gmbus must have been
>> enabled as I'm pretty sure I always saw those errors when booting (just
>> confirmed via logs for 3.0.0, 26.38.6, 2.6.39). Doesn't come up with 2.6.34,
>> 2.6.36.1, 3.1-..., 3.2-... though.
> Yeah, we've enabled gmbus a few times and then disabled it again due
> to bugs. Also, the usual debug messsage says gmbus even when gmbus
> isn't on ... yeah, slightly confusing, but that should be fixed, too.

Hm, OK.

Well, I'm done now.

bisect log:

git bisect start
# good: [805a6af8dba5dfdd35ec35dc52ec0122400b2610] Linux 3.2
git bisect good 805a6af8dba5dfdd35ec35dc52ec0122400b2610
# bad: [28a33cbc24e4256c143dce96c7d93bf423229f92] Linux 3.5
git bisect bad 28a33cbc24e4256c143dce96c7d93bf423229f92
# good: [49d99a2f9c4d033cc3965958a1397b1fad573dd3] Merge branch 'for-linus' of
git://oss.sgi.com/xfs/xfs
git bisect good 49d99a2f9c4d033cc3965958a1397b1fad573dd3
# good: [813a95e5b4fa936bbde10ef89188932745dcd7f4] Merge tag 'pinctrl' of
git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc
git bisect good 813a95e5b4fa936bbde10ef89188932745dcd7f4
# bad: [9978306e31a8f89bd81fbc4c49fd9aefb1d30d10] Merge branch 'for-linus' of
git://oss.sgi.com/xfs/xfs
git bisect bad 9978306e31a8f89bd81fbc4c49fd9aefb1d30d10
# good: [927ad551031798d4cba49766549600bbb33872d7] Merge tag
'ktest-v3.5-spelling' of
git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-ktest
git bisect good 927ad551031798d4cba49766549600bbb33872d7
# good: [2c01e7bc46f10e9190818437e564f7e0db875ae9] Merge branch 'for-linus' of

Re: null pointer dereference while loading i915

2012-08-10 Thread Daniel Vetter
On Fri, Aug 10, 2012 at 6:05 PM, Mihai Moldovan  wrote:
> * On 10.08.2012 12:10 PM, Daniel Vetter wrote:
>> On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan  wrote:
>>> Hi Daniel, hi list
>>>
>>> ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
>>> fine),
>>> my box is crashing when loading the i915 driver (mode-setting enabled.)
>>>
>>> The current version I'm testing with is 3.5.0.
>>>
>>> I was able to get the BUG output (please forgive any errors/flips in the 
>>> output,
>>> I have had to transcribe the messages from the screen/images), however, I'm 
>>> not
>>> able to find out what's wrong.
>>>
>>> If I see it correctly, there's a null pointer dereference in a printk called
>>> from inside gmbus_xfer. The only printk calls I can see in
>>> drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
>>> DRM_DEBUG_KMS() and DRM_INFO() macros.
>>> Neither call looks wrong to me, I even tried to swap adapter->name with
>>> bus->adapter.name and make *sure* i < num is true, but haven't had any 
>>> success.
>>>
>>> I'd really like to see this bug fixed, as it's preventing me from updating 
>>> the
>>> kernel for over a year now.
>>>
>>> Also, while 3.0.2 works, it *does* spew error/warning messages related to 
>>> gmbus
>>> and I've had corrupted VTs in the past (albeit after a long uptime with 
>>> multiple
>>> X restarting and DVI cable unplugging/reattaching events), so maybe there's 
>>> a
>>> lot more broken than "expected".
>>
>> Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
>> since exactly this issue might happen. We've re-enabled gmbus again on
>> 3.5 after having fixed this bug. Are you sure that this is plain 3.2
>> you're running?
>
> Sorry, I messed up the version numbers. Started bisecting yesterday and 
> noticed,
> that 3.0 up to 3.2 still work "fine" (see below), instead I've had another
> problem with 3.2 (completely lockup after the kernel is running for a few
> minutes, but I have no idea where this issue is coming from. Seems to be
> happening with 3.2.0 only, so... *shrug*)
>
> 3.0.2   => working, gmbus warnings as posted.
> 3.1-09933/07170 => working, NO gmbus warnings, but render errors (see below)
> 3.2-rc2 to rc4  => working, NO gmbus warnings, but render errors (see below)
> --- (stopped bisecting 3.0 to 3.2 as this was pointless) ---
> --- (restarted bisecting with 3.2 to 3.5) ---
> 3.3.0-06109 => working, gmbus warnings just like with 3.0, render errors
> (see below)
> 3.4.0-07487 => working, gmbus warnings, hang errors (see below)
> ...
>
> I've done more steps, but have not yet finished bisecting, so stay tuned.
> All those render errors look like that:
>
> [drm] capturing error event; look for more information in
> /debug/dri/0/i915_error_state
> render error detected, EIR: 0x0010
>   IPEIR: 0x
>   IPEHR: 0x0200
>   INSTDONE: 0x
>   INSTPS: 0x8001e025
>   INSTDONE1: 0xbfbb
>   ACTHD: 0x00a4203c
> page table error
>   PGTBL_ER: 0x0010
> [drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x0010, masking
>
> I'll finish bisecting (and hope, that my guess was right, concerning the
> varaiant I wasn't able to build) and will post the bisect log when done.
>
> Meanwhile: at least for 3.0.2 and even older versions, gmbus must have been
> enabled as I'm pretty sure I always saw those errors when booting (just
> confirmed via logs for 3.0.0, 26.38.6, 2.6.39). Doesn't come up with 2.6.34,
> 2.6.36.1, 3.1-..., 3.2-... though.

Yeah, we've enabled gmbus a few times and then disabled it again due
to bugs. Also, the usual debug messsage says gmbus even when gmbus
isn't on ... yeah, slightly confusing, but that should be fixed, too.

For the gpu hang, please ensure that you're running the latest stable
release of everything (to avoid hunting down already known issues and
also because recent kernels dump more useful stuff), grab the entire
i915_error_state from debugfs and file a bug report with the usual
details at bugs.freedesktop.org against dri -> drm/intel.

Thanks,

Daniel
-- 
Daniel Vetter
daniel.vet...@ffwll.ch - +41 (0) 79 365 57 48 - http://blog.ffwll.ch
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: null pointer dereference while loading i915

2012-08-10 Thread Mihai Moldovan
* On 10.08.2012 12:10 PM, Daniel Vetter wrote:
> On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan  wrote:
>> Hi Daniel, hi list
>>
>> ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
>> fine),
>> my box is crashing when loading the i915 driver (mode-setting enabled.)
>>
>> The current version I'm testing with is 3.5.0.
>>
>> I was able to get the BUG output (please forgive any errors/flips in the 
>> output,
>> I have had to transcribe the messages from the screen/images), however, I'm 
>> not
>> able to find out what's wrong.
>>
>> If I see it correctly, there's a null pointer dereference in a printk called
>> from inside gmbus_xfer. The only printk calls I can see in
>> drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
>> DRM_DEBUG_KMS() and DRM_INFO() macros.
>> Neither call looks wrong to me, I even tried to swap adapter->name with
>> bus->adapter.name and make *sure* i < num is true, but haven't had any 
>> success.
>>
>> I'd really like to see this bug fixed, as it's preventing me from updating 
>> the
>> kernel for over a year now.
>>
>> Also, while 3.0.2 works, it *does* spew error/warning messages related to 
>> gmbus
>> and I've had corrupted VTs in the past (albeit after a long uptime with 
>> multiple
>> X restarting and DVI cable unplugging/reattaching events), so maybe there's a
>> lot more broken than "expected".
>
> Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
> since exactly this issue might happen. We've re-enabled gmbus again on
> 3.5 after having fixed this bug. Are you sure that this is plain 3.2
> you're running?

Sorry, I messed up the version numbers. Started bisecting yesterday and noticed,
that 3.0 up to 3.2 still work "fine" (see below), instead I've had another
problem with 3.2 (completely lockup after the kernel is running for a few
minutes, but I have no idea where this issue is coming from. Seems to be
happening with 3.2.0 only, so... *shrug*)

3.0.2   => working, gmbus warnings as posted.
3.1-09933/07170 => working, NO gmbus warnings, but render errors (see below)
3.2-rc2 to rc4  => working, NO gmbus warnings, but render errors (see below)
--- (stopped bisecting 3.0 to 3.2 as this was pointless) ---
--- (restarted bisecting with 3.2 to 3.5) ---
3.3.0-06109 => working, gmbus warnings just like with 3.0, render errors
(see below)
3.4.0-07487 => working, gmbus warnings, hang errors (see below)
...

I've done more steps, but have not yet finished bisecting, so stay tuned.
All those render errors look like that:

[drm] capturing error event; look for more information in
/debug/dri/0/i915_error_state
render error detected, EIR: 0x0010
  IPEIR: 0x
  IPEHR: 0x0200
  INSTDONE: 0x
  INSTPS: 0x8001e025
  INSTDONE1: 0xbfbb
  ACTHD: 0x00a4203c
page table error
  PGTBL_ER: 0x0010
[drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x0010, masking

I'll finish bisecting (and hope, that my guess was right, concerning the
varaiant I wasn't able to build) and will post the bisect log when done.

Meanwhile: at least for 3.0.2 and even older versions, gmbus must have been
enabled as I'm pretty sure I always saw those errors when booting (just
confirmed via logs for 3.0.0, 26.38.6, 2.6.39). Doesn't come up with 2.6.34,
2.6.36.1, 3.1-..., 3.2-... though.

Best regards,


Mihai




smime.p7s
Description: S/MIME Cryptographic Signature


Re: null pointer dereference while loading i915

2012-08-10 Thread Daniel Vetter
On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan  wrote:
> Hi Daniel, hi list
>
> ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
> fine),
> my box is crashing when loading the i915 driver (mode-setting enabled.)
>
> The current version I'm testing with is 3.5.0.
>
> I was able to get the BUG output (please forgive any errors/flips in the 
> output,
> I have had to transcribe the messages from the screen/images), however, I'm 
> not
> able to find out what's wrong.
>
> If I see it correctly, there's a null pointer dereference in a printk called
> from inside gmbus_xfer. The only printk calls I can see in
> drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
> DRM_DEBUG_KMS() and DRM_INFO() macros.
> Neither call looks wrong to me, I even tried to swap adapter->name with
> bus->adapter.name and make *sure* i < num is true, but haven't had any 
> success.
>
> I'd really like to see this bug fixed, as it's preventing me from updating the
> kernel for over a year now.
>
> Also, while 3.0.2 works, it *does* spew error/warning messages related to 
> gmbus
> and I've had corrupted VTs in the past (albeit after a long uptime with 
> multiple
> X restarting and DVI cable unplugging/reattaching events), so maybe there's a
> lot more broken than "expected".

Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
since exactly this issue might happen. We've re-enabled gmbus again on
3.5 after having fixed this bug. Are you sure that this is plain 3.2
you're running?

Yours, Daniel
-- 
Daniel Vetter
daniel.vet...@ffwll.ch - +41 (0) 79 365 57 48 - http://blog.ffwll.ch
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: null pointer dereference while loading i915

2012-08-10 Thread Daniel Vetter
On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan io...@ionic.de wrote:
 Hi Daniel, hi list

 ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
 fine),
 my box is crashing when loading the i915 driver (mode-setting enabled.)

 The current version I'm testing with is 3.5.0.

 I was able to get the BUG output (please forgive any errors/flips in the 
 output,
 I have had to transcribe the messages from the screen/images), however, I'm 
 not
 able to find out what's wrong.

 If I see it correctly, there's a null pointer dereference in a printk called
 from inside gmbus_xfer. The only printk calls I can see in
 drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
 DRM_DEBUG_KMS() and DRM_INFO() macros.
 Neither call looks wrong to me, I even tried to swap adapter-name with
 bus-adapter.name and make *sure* i  num is true, but haven't had any 
 success.

 I'd really like to see this bug fixed, as it's preventing me from updating the
 kernel for over a year now.

 Also, while 3.0.2 works, it *does* spew error/warning messages related to 
 gmbus
 and I've had corrupted VTs in the past (albeit after a long uptime with 
 multiple
 X restarting and DVI cable unplugging/reattaching events), so maybe there's a
 lot more broken than expected.

Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
since exactly this issue might happen. We've re-enabled gmbus again on
3.5 after having fixed this bug. Are you sure that this is plain 3.2
you're running?

Yours, Daniel
-- 
Daniel Vetter
daniel.vet...@ffwll.ch - +41 (0) 79 365 57 48 - http://blog.ffwll.ch
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: null pointer dereference while loading i915

2012-08-10 Thread Mihai Moldovan
* On 10.08.2012 12:10 PM, Daniel Vetter wrote:
 On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan io...@ionic.de wrote:
 Hi Daniel, hi list

 ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
 fine),
 my box is crashing when loading the i915 driver (mode-setting enabled.)

 The current version I'm testing with is 3.5.0.

 I was able to get the BUG output (please forgive any errors/flips in the 
 output,
 I have had to transcribe the messages from the screen/images), however, I'm 
 not
 able to find out what's wrong.

 If I see it correctly, there's a null pointer dereference in a printk called
 from inside gmbus_xfer. The only printk calls I can see in
 drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
 DRM_DEBUG_KMS() and DRM_INFO() macros.
 Neither call looks wrong to me, I even tried to swap adapter-name with
 bus-adapter.name and make *sure* i  num is true, but haven't had any 
 success.

 I'd really like to see this bug fixed, as it's preventing me from updating 
 the
 kernel for over a year now.

 Also, while 3.0.2 works, it *does* spew error/warning messages related to 
 gmbus
 and I've had corrupted VTs in the past (albeit after a long uptime with 
 multiple
 X restarting and DVI cable unplugging/reattaching events), so maybe there's a
 lot more broken than expected.

 Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
 since exactly this issue might happen. We've re-enabled gmbus again on
 3.5 after having fixed this bug. Are you sure that this is plain 3.2
 you're running?

Sorry, I messed up the version numbers. Started bisecting yesterday and noticed,
that 3.0 up to 3.2 still work fine (see below), instead I've had another
problem with 3.2 (completely lockup after the kernel is running for a few
minutes, but I have no idea where this issue is coming from. Seems to be
happening with 3.2.0 only, so... *shrug*)

3.0.2   = working, gmbus warnings as posted.
3.1-09933/07170 = working, NO gmbus warnings, but render errors (see below)
3.2-rc2 to rc4  = working, NO gmbus warnings, but render errors (see below)
--- (stopped bisecting 3.0 to 3.2 as this was pointless) ---
--- (restarted bisecting with 3.2 to 3.5) ---
3.3.0-06109 = working, gmbus warnings just like with 3.0, render errors
(see below)
3.4.0-07487 = working, gmbus warnings, hang errors (see below)
...

I've done more steps, but have not yet finished bisecting, so stay tuned.
All those render errors look like that:

[drm] capturing error event; look for more information in
/debug/dri/0/i915_error_state
render error detected, EIR: 0x0010
  IPEIR: 0x
  IPEHR: 0x0200
  INSTDONE: 0x
  INSTPS: 0x8001e025
  INSTDONE1: 0xbfbb
  ACTHD: 0x00a4203c
page table error
  PGTBL_ER: 0x0010
[drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x0010, masking

I'll finish bisecting (and hope, that my guess was right, concerning the
varaiant I wasn't able to build) and will post the bisect log when done.

Meanwhile: at least for 3.0.2 and even older versions, gmbus must have been
enabled as I'm pretty sure I always saw those errors when booting (just
confirmed via logs for 3.0.0, 26.38.6, 2.6.39). Doesn't come up with 2.6.34,
2.6.36.1, 3.1-..., 3.2-... though.

Best regards,


Mihai




smime.p7s
Description: S/MIME Cryptographic Signature


Re: null pointer dereference while loading i915

2012-08-10 Thread Daniel Vetter
On Fri, Aug 10, 2012 at 6:05 PM, Mihai Moldovan io...@ionic.de wrote:
 * On 10.08.2012 12:10 PM, Daniel Vetter wrote:
 On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan io...@ionic.de wrote:
 Hi Daniel, hi list

 ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
 fine),
 my box is crashing when loading the i915 driver (mode-setting enabled.)

 The current version I'm testing with is 3.5.0.

 I was able to get the BUG output (please forgive any errors/flips in the 
 output,
 I have had to transcribe the messages from the screen/images), however, I'm 
 not
 able to find out what's wrong.

 If I see it correctly, there's a null pointer dereference in a printk called
 from inside gmbus_xfer. The only printk calls I can see in
 drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
 DRM_DEBUG_KMS() and DRM_INFO() macros.
 Neither call looks wrong to me, I even tried to swap adapter-name with
 bus-adapter.name and make *sure* i  num is true, but haven't had any 
 success.

 I'd really like to see this bug fixed, as it's preventing me from updating 
 the
 kernel for over a year now.

 Also, while 3.0.2 works, it *does* spew error/warning messages related to 
 gmbus
 and I've had corrupted VTs in the past (albeit after a long uptime with 
 multiple
 X restarting and DVI cable unplugging/reattaching events), so maybe there's 
 a
 lot more broken than expected.

 Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
 since exactly this issue might happen. We've re-enabled gmbus again on
 3.5 after having fixed this bug. Are you sure that this is plain 3.2
 you're running?

 Sorry, I messed up the version numbers. Started bisecting yesterday and 
 noticed,
 that 3.0 up to 3.2 still work fine (see below), instead I've had another
 problem with 3.2 (completely lockup after the kernel is running for a few
 minutes, but I have no idea where this issue is coming from. Seems to be
 happening with 3.2.0 only, so... *shrug*)

 3.0.2   = working, gmbus warnings as posted.
 3.1-09933/07170 = working, NO gmbus warnings, but render errors (see below)
 3.2-rc2 to rc4  = working, NO gmbus warnings, but render errors (see below)
 --- (stopped bisecting 3.0 to 3.2 as this was pointless) ---
 --- (restarted bisecting with 3.2 to 3.5) ---
 3.3.0-06109 = working, gmbus warnings just like with 3.0, render errors
 (see below)
 3.4.0-07487 = working, gmbus warnings, hang errors (see below)
 ...

 I've done more steps, but have not yet finished bisecting, so stay tuned.
 All those render errors look like that:

 [drm] capturing error event; look for more information in
 /debug/dri/0/i915_error_state
 render error detected, EIR: 0x0010
   IPEIR: 0x
   IPEHR: 0x0200
   INSTDONE: 0x
   INSTPS: 0x8001e025
   INSTDONE1: 0xbfbb
   ACTHD: 0x00a4203c
 page table error
   PGTBL_ER: 0x0010
 [drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x0010, masking

 I'll finish bisecting (and hope, that my guess was right, concerning the
 varaiant I wasn't able to build) and will post the bisect log when done.

 Meanwhile: at least for 3.0.2 and even older versions, gmbus must have been
 enabled as I'm pretty sure I always saw those errors when booting (just
 confirmed via logs for 3.0.0, 26.38.6, 2.6.39). Doesn't come up with 2.6.34,
 2.6.36.1, 3.1-..., 3.2-... though.

Yeah, we've enabled gmbus a few times and then disabled it again due
to bugs. Also, the usual debug messsage says gmbus even when gmbus
isn't on ... yeah, slightly confusing, but that should be fixed, too.

For the gpu hang, please ensure that you're running the latest stable
release of everything (to avoid hunting down already known issues and
also because recent kernels dump more useful stuff), grab the entire
i915_error_state from debugfs and file a bug report with the usual
details at bugs.freedesktop.org against dri - drm/intel.

Thanks,

Daniel
-- 
Daniel Vetter
daniel.vet...@ffwll.ch - +41 (0) 79 365 57 48 - http://blog.ffwll.ch
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: null pointer dereference while loading i915

2012-08-10 Thread Mihai Moldovan
* On 10.08.2012 06:39 PM, Daniel Vetter wrote:
 On Fri, Aug 10, 2012 at 6:05 PM, Mihai Moldovan io...@ionic.de wrote:
 * On 10.08.2012 12:10 PM, Daniel Vetter wrote:
 On Wed, Aug 8, 2012 at 6:50 AM, Mihai Moldovan io...@ionic.de wrote:
 Hi Daniel, hi list

 ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working 
 fine),
 my box is crashing when loading the i915 driver (mode-setting enabled.)

 The current version I'm testing with is 3.5.0.

 I was able to get the BUG output (please forgive any errors/flips in the 
 output,
 I have had to transcribe the messages from the screen/images), however, 
 I'm not
 able to find out what's wrong.

 If I see it correctly, there's a null pointer dereference in a printk 
 called
 from inside gmbus_xfer. The only printk calls I can see in
 drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
 DRM_DEBUG_KMS() and DRM_INFO() macros.
 Neither call looks wrong to me, I even tried to swap adapter-name with
 bus-adapter.name and make *sure* i  num is true, but haven't had any 
 success.

 I'd really like to see this bug fixed, as it's preventing me from updating 
 the
 kernel for over a year now.

 Also, while 3.0.2 works, it *does* spew error/warning messages related to 
 gmbus
 and I've had corrupted VTs in the past (albeit after a long uptime with 
 multiple
 X restarting and DVI cable unplugging/reattaching events), so maybe 
 there's a
 lot more broken than expected.
 Hm, this is rather strange. gmbus should not be enable on 3.2 nor 3.0,
 since exactly this issue might happen. We've re-enabled gmbus again on
 3.5 after having fixed this bug. Are you sure that this is plain 3.2
 you're running?
 Sorry, I messed up the version numbers. Started bisecting yesterday and 
 noticed,
 that 3.0 up to 3.2 still work fine (see below), instead I've had another
 problem with 3.2 (completely lockup after the kernel is running for a few
 minutes, but I have no idea where this issue is coming from. Seems to be
 happening with 3.2.0 only, so... *shrug*)

 3.0.2   = working, gmbus warnings as posted.
 3.1-09933/07170 = working, NO gmbus warnings, but render errors (see below)
 3.2-rc2 to rc4  = working, NO gmbus warnings, but render errors (see below)
 --- (stopped bisecting 3.0 to 3.2 as this was pointless) ---
 --- (restarted bisecting with 3.2 to 3.5) ---
 3.3.0-06109 = working, gmbus warnings just like with 3.0, render errors
 (see below)
 3.4.0-07487 = working, gmbus warnings, hang errors (see below)
 ...

 I've done more steps, but have not yet finished bisecting, so stay tuned.
 All those render errors look like that:

 [drm] capturing error event; look for more information in
 /debug/dri/0/i915_error_state
 render error detected, EIR: 0x0010
   IPEIR: 0x
   IPEHR: 0x0200
   INSTDONE: 0x
   INSTPS: 0x8001e025
   INSTDONE1: 0xbfbb
   ACTHD: 0x00a4203c
 page table error
   PGTBL_ER: 0x0010
 [drm:i915_report_and_clear_eir] *ERROR* EIR stuck: 0x0010, masking

 I'll finish bisecting (and hope, that my guess was right, concerning the
 varaiant I wasn't able to build) and will post the bisect log when done.

 Meanwhile: at least for 3.0.2 and even older versions, gmbus must have been
 enabled as I'm pretty sure I always saw those errors when booting (just
 confirmed via logs for 3.0.0, 26.38.6, 2.6.39). Doesn't come up with 2.6.34,
 2.6.36.1, 3.1-..., 3.2-... though.
 Yeah, we've enabled gmbus a few times and then disabled it again due
 to bugs. Also, the usual debug messsage says gmbus even when gmbus
 isn't on ... yeah, slightly confusing, but that should be fixed, too.

Hm, OK.

Well, I'm done now.

bisect log:

git bisect start
# good: [805a6af8dba5dfdd35ec35dc52ec0122400b2610] Linux 3.2
git bisect good 805a6af8dba5dfdd35ec35dc52ec0122400b2610
# bad: [28a33cbc24e4256c143dce96c7d93bf423229f92] Linux 3.5
git bisect bad 28a33cbc24e4256c143dce96c7d93bf423229f92
# good: [49d99a2f9c4d033cc3965958a1397b1fad573dd3] Merge branch 'for-linus' of
git://oss.sgi.com/xfs/xfs
git bisect good 49d99a2f9c4d033cc3965958a1397b1fad573dd3
# good: [813a95e5b4fa936bbde10ef89188932745dcd7f4] Merge tag 'pinctrl' of
git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc
git bisect good 813a95e5b4fa936bbde10ef89188932745dcd7f4
# bad: [9978306e31a8f89bd81fbc4c49fd9aefb1d30d10] Merge branch 'for-linus' of
git://oss.sgi.com/xfs/xfs
git bisect bad 9978306e31a8f89bd81fbc4c49fd9aefb1d30d10
# good: [927ad551031798d4cba49766549600bbb33872d7] Merge tag
'ktest-v3.5-spelling' of
git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-ktest
git bisect good 927ad551031798d4cba49766549600bbb33872d7
# good: [2c01e7bc46f10e9190818437e564f7e0db875ae9] Merge branch 'for-linus' of
git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input
git bisect good 2c01e7bc46f10e9190818437e564f7e0db875ae9
# bad: [5f54d29ee9dace1e2ef4e8c9873ad4dd7a06d11a] drm/nva3/pm: make pll-pll
mode work
git bisect bad 5f54d29ee9dace1e2ef4e8c9873ad4dd7a06d11a
# 

Re: null pointer dereference while loading i915

2012-08-10 Thread Mihai Moldovan
* On 10.08.2012 07:44 PM, Mihai Moldovan wrote:
 Hm, OK.

 Well, I'm done now.

 bisect log:

 git bisect start
 # good: [805a6af8dba5dfdd35ec35dc52ec0122400b2610] Linux 3.2
 git bisect good 805a6af8dba5dfdd35ec35dc52ec0122400b2610
 # bad: [28a33cbc24e4256c143dce96c7d93bf423229f92] Linux 3.5
 git bisect bad 28a33cbc24e4256c143dce96c7d93bf423229f92
 # good: [49d99a2f9c4d033cc3965958a1397b1fad573dd3] Merge branch 'for-linus' of
 git://oss.sgi.com/xfs/xfs
 git bisect good 49d99a2f9c4d033cc3965958a1397b1fad573dd3
 # good: [813a95e5b4fa936bbde10ef89188932745dcd7f4] Merge tag 'pinctrl' of
 git://git.kernel.org/pub/scm/linux/kernel/git/arm/arm-soc
 git bisect good 813a95e5b4fa936bbde10ef89188932745dcd7f4
 # bad: [9978306e31a8f89bd81fbc4c49fd9aefb1d30d10] Merge branch 'for-linus' of
 git://oss.sgi.com/xfs/xfs
 git bisect bad 9978306e31a8f89bd81fbc4c49fd9aefb1d30d10
 # good: [927ad551031798d4cba49766549600bbb33872d7] Merge tag
 'ktest-v3.5-spelling' of
 git://git.kernel.org/pub/scm/linux/kernel/git/rostedt/linux-ktest
 git bisect good 927ad551031798d4cba49766549600bbb33872d7
 # good: [2c01e7bc46f10e9190818437e564f7e0db875ae9] Merge branch 'for-linus' of
 git://git.kernel.org/pub/scm/linux/kernel/git/dtor/input
 git bisect good 2c01e7bc46f10e9190818437e564f7e0db875ae9
 # bad: [5f54d29ee9dace1e2ef4e8c9873ad4dd7a06d11a] drm/nva3/pm: make pll-pll
 mode work
 git bisect bad 5f54d29ee9dace1e2ef4e8c9873ad4dd7a06d11a
 # bad: [8b2e326dc7c5aa6952c88656d04d0d81fd85a6f8] drm/i915: Unconditionally
 initialise the interrupt workers
 git bisect bad 8b2e326dc7c5aa6952c88656d04d0d81fd85a6f8
 # bad: [f637fde434c9e3687798730c7ddd367e93666013] drm/i915: inline
 enable/disable_irq into ring-get/put_irq
 git bisect bad f637fde434c9e3687798730c7ddd367e93666013
 # bad: [23e3f9b37e7368ee8530ba99907508363feebc14] drm/i915: check for disabled
 interrupts on ValleyView
 git bisect bad 23e3f9b37e7368ee8530ba99907508363feebc14
 # good: [8489731c9bd22c27ab17a2190cd7444604abf95f] drm/i915: move clflushing
 into shmem_pread
 git bisect good 8489731c9bd22c27ab17a2190cd7444604abf95f
 # good: [3bd7d90938f1fe77de5991dc4b727843c4980b2a] drm/i915/intel_i2c: 
 refactor
 using intel_gmbus_get_adapter
 git bisect good 3bd7d90938f1fe77de5991dc4b727843c4980b2a
 # bad: [57f350b6722f9569f407872f6ead56e2d221d98a] drm/i915: add DPIO support
 git bisect bad 57f350b6722f9569f407872f6ead56e2d221d98a
 # bad: [93e537a10f2c8c0f2e74409b6cb473fc221758fa] drm/i915: split LVDS update
 code out of i9xx_crtc_mode_set
 git bisect bad 93e537a10f2c8c0f2e74409b6cb473fc221758fa
 # bad: [f2c9677be3158c31ba19f527e2be0f7a519e19d1] drm/i915/intel_i2c: allocate
 gmbus array as part of drm_i915_private
 git bisect bad f2c9677be3158c31ba19f527e2be0f7a519e19d1
 # bad: [2ed06c93a1fce057808894d73167aae03c76deaf] drm/i915/intel_i2c: gmbus
 disabled and reserved ports are invalid
 git bisect bad 2ed06c93a1fce057808894d73167aae03c76deaf

Just to be safe, I also tested git HEAD (3.6.0-rc1-00209-gf62bf17), no dice 
either.

Best regards,


Mihai



smime.p7s
Description: S/MIME Cryptographic Signature


null pointer dereference while loading i915

2012-08-07 Thread Mihai Moldovan
Hi Daniel, hi list

ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working fine),
my box is crashing when loading the i915 driver (mode-setting enabled.)

The current version I'm testing with is 3.5.0.

I was able to get the BUG output (please forgive any errors/flips in the output,
I have had to transcribe the messages from the screen/images), however, I'm not
able to find out what's wrong.

If I see it correctly, there's a null pointer dereference in a printk called
from inside gmbus_xfer. The only printk calls I can see in
drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
DRM_DEBUG_KMS() and DRM_INFO() macros.
Neither call looks wrong to me, I even tried to swap adapter->name with
bus->adapter.name and make *sure* i < num is true, but haven't had any success.

I'd really like to see this bug fixed, as it's preventing me from updating the
kernel for over a year now.

Also, while 3.0.2 works, it *does* spew error/warning messages related to gmbus
and I've had corrupted VTs in the past (albeit after a long uptime with multiple
X restarting and DVI cable unplugging/reattaching events), so maybe there's a
lot more broken than "expected".

PCI-IDs:

00:02.0 VGA compatible controller [0300]: Intel Corporation 4 Series Chipset
Integrated Graphics Controller [8086:2e12] (rev 03) (prog-if 00 [VGA 
controller])
Subsystem: Intel Corporation Device [8086:1003]
00:02.1 Display controller [0380]: Intel Corporation 4 Series Chipset Integrated
Graphics Controller [8086:2e13] (rev 03)
Subsystem: Intel Corporation Device [8086:1003]

Messages are attached.

Any help is appreciated, thanks. :)

Best regards,


Mihai


i915_kernel_BUG_gmbus_nullptrderef.txt.bz2
Description: BZip2 compressed data


i915_3.0.2_warning_messages.txt.bz2
Description: BZip2 compressed data


smime.p7s
Description: S/MIME Cryptographic Signature


null pointer dereference while loading i915

2012-08-07 Thread Mihai Moldovan
Hi Daniel, hi list

ever since version 3.2.0 (maybe even earlier, but 3.0.2 is still working fine),
my box is crashing when loading the i915 driver (mode-setting enabled.)

The current version I'm testing with is 3.5.0.

I was able to get the BUG output (please forgive any errors/flips in the output,
I have had to transcribe the messages from the screen/images), however, I'm not
able to find out what's wrong.

If I see it correctly, there's a null pointer dereference in a printk called
from inside gmbus_xfer. The only printk calls I can see in
drivers/gpu/drm/i915/intel_i2c.c gmbus_xfer() however are issued by the
DRM_DEBUG_KMS() and DRM_INFO() macros.
Neither call looks wrong to me, I even tried to swap adapter-name with
bus-adapter.name and make *sure* i  num is true, but haven't had any success.

I'd really like to see this bug fixed, as it's preventing me from updating the
kernel for over a year now.

Also, while 3.0.2 works, it *does* spew error/warning messages related to gmbus
and I've had corrupted VTs in the past (albeit after a long uptime with multiple
X restarting and DVI cable unplugging/reattaching events), so maybe there's a
lot more broken than expected.

PCI-IDs:

00:02.0 VGA compatible controller [0300]: Intel Corporation 4 Series Chipset
Integrated Graphics Controller [8086:2e12] (rev 03) (prog-if 00 [VGA 
controller])
Subsystem: Intel Corporation Device [8086:1003]
00:02.1 Display controller [0380]: Intel Corporation 4 Series Chipset Integrated
Graphics Controller [8086:2e13] (rev 03)
Subsystem: Intel Corporation Device [8086:1003]

Messages are attached.

Any help is appreciated, thanks. :)

Best regards,


Mihai


i915_kernel_BUG_gmbus_nullptrderef.txt.bz2
Description: BZip2 compressed data


i915_3.0.2_warning_messages.txt.bz2
Description: BZip2 compressed data


smime.p7s
Description: S/MIME Cryptographic Signature