Bug#464962: immediate crash on boot on TM5800

2008-02-13 Thread maximilian attems
On Tue, 12 Feb 2008, H. Peter Anvin wrote:

 http://www.kernel.org/pub/linux/kernel/people/hpa/0001-x86-do-not-promote-TM3x00-TM5x00-to-i686-class.patch

   -hpa

thanks!

hmm no mention of boot crash in description
please also add pointer to bug report
http://bugs.debian.og/464962

on our side we'll have to have to add a release notes entry.




-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

maximilian attems wrote:


sure, ack.
so i'll circumvent bugzilla and add the new x86 maintainers
on cc to let them know about the 2.6.24 and 2.6.25-rc1 boot error
on shiny fujitsu p700 lifebook, with a Crusoe processor.
http://bugs.debian.org/464962
686 config attached.



INT 6 is #UD, undefined instruction.

If you could send me a copy of your vmlinux file (not bzImage), it would 
speed things up.


I happen to have an old TM5800-based machine sitting around, so I can 
probably reproduce it.


-hpa



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin
Thought some more about this, and since this probably means gcc will 
generate this for userspace code as well nowadays, tm5800 should 
probably be downgraded to a 586-class machine.  Hence the Linux policy 
of promoting it to a 686-class machine for having CMOV is actually 
incorrect, it doesn't have all the userspace-visible features of a 
686-class machine, lacking long NOP.


-hpa

diff --git a/arch/x86/kernel/cpu/transmeta.c b/arch/x86/kernel/cpu/transmeta.c
index 200fb3f..e8b422c 100644
--- a/arch/x86/kernel/cpu/transmeta.c
+++ b/arch/x86/kernel/cpu/transmeta.c
@@ -76,13 +76,6 @@ static void __cpuinit init_transmeta(struct cpuinfo_x86 *c)
/* All Transmeta CPUs have a constant TSC */
set_bit(X86_FEATURE_CONSTANT_TSC, c-x86_capability);

-   /* If we can run i686 user-space code, call us an i686 */
-#define USER686 ((1  X86_FEATURE_TSC)|\
-(1  X86_FEATURE_CX8)|\
-(1  X86_FEATURE_CMOV))
-if (c-x86 == 5  (c-x86_capability[0]  USER686) == USER686)
-   c-x86 = 6;
-
 #ifdef CONFIG_SYSCTL
/* randomize_va_space slows us down enormously;
   it probably triggers retranslation of x86-native bytecode */


Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

maximilian attems wrote:

On Tue, Feb 12, 2008 at 01:14:04PM -0800, H. Peter Anvin wrote:

Are you sure that build matches the bug report?


urrgs right sorry, the posted vmlinux is a newer 
2.6.24-git22 and not  Version: 2.6.24-3
 
The EIP given falls inside the .data segment of that kernel, 
specifically inside the symbol init_task.


-hpa


will rebuild aboves.


Okay, the faulting instruction is the following:

c0383360:   0f 1f 40 00 nopl   0x0(%eax)

The Crusoe code morphing software apparently doesn't recognize these 
long noops, and (presumably) the rest of the hinting NOOP group.  gcc 
didn't use to generate them, and Crusoe/Efficeon generally do not 
benefit from code alignment anyway.  I suspect the best thing to do is 
to use either a 586 kernel or build a dedicated Crusoe kernel without 
code alignment.


-hpa




--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread Joey Hess
H. Peter Anvin wrote:
 maximilian attems wrote:
 On Tue, Feb 12, 2008 at 12:32:27PM -0800, H. Peter Anvin wrote:
 INT 6 is #UD, undefined instruction.

 If you could send me a copy of your vmlinux file (not bzImage), it 
 would speed things up.

 cp -l src/linux-2.6-2.6.24/debian/build/build_i386_none_686/vmlinux 
 ~/public_html/

 http://charm.itp.tuwien.ac.at/~mattems/
  

 Are you sure that build matches the bug report?

 The EIP given falls inside the .data segment of that kernel,  
 specifically inside the symbol init_task.

The stack trace I copied down originally was for a 2.6.24 kernel. I can
copy down the one .25-pre if that'd be useful, but it sounds like you're
reproduced it on your own.

(I'm raising the priority of this bug report since it sounds like it
affects more than just my hardware.)

-- 
see shy jo


signature.asc
Description: Digital signature


Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread Bastian Blank
On Tue, Feb 12, 2008 at 01:42:50PM -0800, H. Peter Anvin wrote:
 Okay, the faulting instruction is the following:
 c0383360:   0f 1f 40 00 nopl   0x0(%eax)

include/asm-x86/nops.h:
| /* P6 nops */
| /* uses eax dependencies (Intel-recommended choice) */
[...]
| #define P6_NOP4 .byte 0x0f,0x1f,0x40,0\n

Bastian

-- 
Leave bigotry in your quarters; there's no room for it on the bridge.
-- Kirk, Balance of Terror, stardate 1709.2



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

Bastian Blank wrote:

On Tue, Feb 12, 2008 at 01:42:50PM -0800, H. Peter Anvin wrote:
The Crusoe code morphing software apparently doesn't recognize these 
long noops, and (presumably) the rest of the hinting NOOP group.  gcc 
didn't use to generate them, and Crusoe/Efficeon generally do not 
benefit from code alignment anyway.  I suspect the best thing to do is 
to use either a 586 kernel or build a dedicated Crusoe kernel without 
code alignment.


Crusoe is explicitely marked as 586 with TSC in the kernel config.



Yes, but the kernel in question wasn't compiled as a Crusoe kernel, but 
a generic 686 kernel.


-hpa



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

Bastian Blank wrote:

On Tue, Feb 12, 2008 at 01:42:50PM -0800, H. Peter Anvin wrote:

Okay, the faulting instruction is the following:
c0383360:   0f 1f 40 00 nopl   0x0(%eax)


include/asm-x86/nops.h:
| /* P6 nops */
| /* uses eax dependencies (Intel-recommended choice) */
[...]
| #define P6_NOP4 .byte 0x0f,0x1f,0x40,0\n



Yes, which requires a 686-class machine; gcc is also free to use these 
for -m686, so there is a userspace-visible difference, and thus the hack 
to call Crusoe a 686-class machine is not correct.


Unfortunately there is no CPUID flag for these, only the CPU level.

-hpa



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

http://www.kernel.org/pub/linux/kernel/people/hpa/0001-x86-do-not-promote-TM3x00-TM5x00-to-i686-class.patch

-hpa



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread Bastian Blank
On Tue, Feb 12, 2008 at 01:42:50PM -0800, H. Peter Anvin wrote:
 The Crusoe code morphing software apparently doesn't recognize these 
 long noops, and (presumably) the rest of the hinting NOOP group.  gcc 
 didn't use to generate them, and Crusoe/Efficeon generally do not 
 benefit from code alignment anyway.  I suspect the best thing to do is 
 to use either a 586 kernel or build a dedicated Crusoe kernel without 
 code alignment.

Crusoe is explicitely marked as 586 with TSC in the kernel config.

Bastian

-- 
Either one of us, by himself, is expendable.  Both of us are not.
-- Kirk, The Devil in the Dark, stardate 3196.1



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread Joey Hess
H. Peter Anvin wrote:
 Okay, the faulting instruction is the following:

 c0383360:   0f 1f 40 00 nopl   0x0(%eax)

 The Crusoe code morphing software apparently doesn't recognize these  
 long noops, and (presumably) the rest of the hinting NOOP group.  gcc  
 didn't use to generate them, and Crusoe/Efficeon generally do not  
 benefit from code alignment anyway.  I suspect the best thing to do is  
 to use either a 586 kernel or build a dedicated Crusoe kernel without  
 code alignment.

If the debian kernel team concurs with this let me know and I'll arrange
for d-i to install the -486 build for crusoe.

(Will suck for upgraders tho..)

-- 
see shy jo


signature.asc
Description: Digital signature


Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread maximilian attems
On Tue, Feb 12, 2008 at 01:14:04PM -0800, H. Peter Anvin wrote:
 
 Are you sure that build matches the bug report?

urrgs right sorry, the posted vmlinux is a newer 
2.6.24-git22 and not  Version: 2.6.24-3
 
 The EIP given falls inside the .data segment of that kernel, 
 specifically inside the symbol init_task.
 
   -hpa

will rebuild aboves.



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

maximilian attems wrote:

On Tue, Feb 12, 2008 at 12:32:27PM -0800, H. Peter Anvin wrote:

INT 6 is #UD, undefined instruction.

If you could send me a copy of your vmlinux file (not bzImage), it would 
speed things up.


cp -l src/linux-2.6-2.6.24/debian/build/build_i386_none_686/vmlinux 
~/public_html/

http://charm.itp.tuwien.ac.at/~mattems/
 


Are you sure that build matches the bug report?

The EIP given falls inside the .data segment of that kernel, 
specifically inside the symbol init_task.


-hpa



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread maximilian attems
On Tue, Feb 12, 2008 at 12:32:27PM -0800, H. Peter Anvin wrote:
 
 INT 6 is #UD, undefined instruction.
 
 If you could send me a copy of your vmlinux file (not bzImage), it would 
 speed things up.

cp -l src/linux-2.6-2.6.24/debian/build/build_i386_none_686/vmlinux 
~/public_html/

http://charm.itp.tuwien.ac.at/~mattems/
 
 I happen to have an old TM5800-based machine sitting around, so I can 
 probably reproduce it.
 
   -hpa

cool



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread maximilian attems
On Tue, Feb 12, 2008 at 02:52:54PM -0500, Joey Hess wrote:
 maximilian attems wrote:
  thanks for quick feedback, have prebuild 2.6.25-rc1-git2
  (they contain the security fix, but don't seem to suspend here)
  anyway please test for boot
  - http://charm.itp.tuwien.ac.at/~mattems/2.6.25-rc1-git2/
 
 Still fails the same.
 
  if those again don't boot please file info upstream on
  bugzilla.kernel.org so that relevant new x86 arch maintainer
  fix that up.
 
 Um, could the kernel team forward this bug?

well we have no automated way to do so
as we are underpowered in terms of bug nrs, it is
routine to ask to file upstream. the debian kernel
does not carry any x86 specific patch.
also it is often much easier if upstream speaks directly
with the person that has the hardware at hand.
 
 * I don't have an account on this bugzilla, or actually, on any bugzilla
   on earth.
 * I don't know exactly what versions you've had me test.
 * I guess they'll want to know things about compile options that I don't
   know.

sure, ack.
so i'll circumvent bugzilla and add the new x86 maintainers
on cc to let them know about the 2.6.24 and 2.6.25-rc1 boot error
on shiny fujitsu p700 lifebook, with a Crusoe processor.
http://bugs.debian.org/464962
686 config attached.



config-2.6.24-1-686.bz2
Description: Binary data


Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread Joey Hess
maximilian attems wrote:
 thanks for quick feedback, have prebuild 2.6.25-rc1-git2
 (they contain the security fix, but don't seem to suspend here)
 anyway please test for boot
 - http://charm.itp.tuwien.ac.at/~mattems/2.6.25-rc1-git2/

Still fails the same.

 if those again don't boot please file info upstream on
 bugzilla.kernel.org so that relevant new x86 arch maintainer
 fix that up.

Um, could the kernel team forward this bug?

* I don't have an account on this bugzilla, or actually, on any bugzilla
  on earth.
* I don't know exactly what versions you've had me test.
* I guess they'll want to know things about compile options that I don't
  know.

-- 
see shy jo


signature.asc
Description: Digital signature


Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread H. Peter Anvin

maximilian attems wrote:

On Tue, Feb 12, 2008 at 01:14:04PM -0800, H. Peter Anvin wrote:

Are you sure that build matches the bug report?


urrgs right sorry, the posted vmlinux is a newer 
2.6.24-git22 and not  Version: 2.6.24-3
 
The EIP given falls inside the .data segment of that kernel, 
specifically inside the symbol init_task.


-hpa


will rebuild aboves.


Don't worry about it, already have reproduced it.

-hpa



--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread Joey Hess
maximilian attems wrote:
 on shiny fujitsu p700 lifebook, with a Crusoe processor.

FWIW, I misremembered the model number. It's a not-so-shiny P2110.

-- 
see shy jo


signature.asc
Description: Digital signature


Bug#464962: immediate crash on boot on TM5800

2008-02-12 Thread maximilian attems
On Sun, 10 Feb 2008, Joey Hess wrote:

 maximilian attems wrote:
  i haven't yet compiled latest git21 (will do later today), in the
  menatime i have i386 snapshots of git15, can you try there the -686
  http://charm.itp.tuwien.ac.at/~mattems/git15/
  if upstream fixed this bug inbetween?
 
 Nope, still fails.

thanks for quick feedback, have prebuild 2.6.25-rc1-git2
(they contain the security fix, but don't seem to suspend here)
anyway please test for boot
- http://charm.itp.tuwien.ac.at/~mattems/2.6.25-rc1-git2/

if those again don't boot please file info upstream on
bugzilla.kernel.org so that relevant new x86 arch maintainer
fix that up.



-- 
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of unsubscribe. Trouble? Contact [EMAIL PROTECTED]



Bug#464962: immediate crash on boot on TM5800

2008-02-10 Thread maximilian attems
On Sat, 09 Feb 2008, Joey Hess wrote:

 As soon as the kernel is booted from grub, before it prints any normal
 messages, it crashes as follows:
 
 BUG: Int 6: CR2 
 EDI   ESI 1000  EBP 0020 ESP c0373f54
 EBX c03e5140  EDX 000C  ECX  EAX c034f3e0
 err   EIP c0389390   CS 0060 flg 00010082
 Stack:  0010 1dfc c039a9ec c03b934c c0373fe8 c03800c8 c039a9ec
c037d0b8 0083  c0304c2f  0071 c039a9ec c03b934c
c0373fe8 0071 c039a9ec c03b934c c0373fd8 c037d4ef c0304a76 00ef

 ÉÉ
 
 (The ÉÉ has a grey background and is flashing; it appears in the
 lower-left corner of the screen.)
 
 2.6.22-3-686 works fine.

that is before the x86 merge, so probably trouble due to it.
 
 This system is a fujitsu p700 lifebook, with a Crusoe processor.

i haven't yet compiled latest git21 (will do later today), in the
menatime i have i386 snapshots of git15, can you try there the -686
http://charm.itp.tuwien.ac.at/~mattems/git15/
if upstream fixed this bug inbetween?

thanks




Bug#464962: immediate crash on boot on TM5800

2008-02-10 Thread Joey Hess
maximilian attems wrote:
 i haven't yet compiled latest git21 (will do later today), in the
 menatime i have i386 snapshots of git15, can you try there the -686
 http://charm.itp.tuwien.ac.at/~mattems/git15/
 if upstream fixed this bug inbetween?

Nope, still fails.

-- 
see shy jo


signature.asc
Description: Digital signature


Bug#464962: immediate crash on boot on TM5800

2008-02-09 Thread Joey Hess
Package: linux-image-2.6.24-1-686
Version: 2.6.24-3
Severity: important

As soon as the kernel is booted from grub, before it prints any normal
messages, it crashes as follows:

BUG: Int 6: CR2 
EDI   ESI 1000  EBP 0020 ESP c0373f54
EBX c03e5140  EDX 000C  ECX  EAX c034f3e0
err   EIP c0389390   CS 0060 flg 00010082
Stack:  0010 1dfc c039a9ec c03b934c c0373fe8 c03800c8 c039a9ec
   c037d0b8 0083  c0304c2f  0071 c039a9ec c03b934c
   c0373fe8 0071 c039a9ec c03b934c c0373fd8 c037d4ef c0304a76 00ef
   
ÉÉ

(The ÉÉ has a grey background and is flashing; it appears in the
lower-left corner of the screen.)

2.6.22-3-686 works fine.

This system is a fujitsu p700 lifebook, with a Crusoe processor.

[EMAIL PROTECTED]:~cat /proc/cpuinfo
processor   : 0
vendor_id   : GenuineTMx86
cpu family  : 6
model   : 4
model name  : Transmeta(tm) Crusoe(tm) Processor TM5800
stepping: 3
cpu MHz : 859.310
cache size  : 512 KB
fdiv_bug: no
hlt_bug : no
f00f_bug: no
coma_bug: no
fpu : yes
fpu_exception   : yes
cpuid level : 1
wp  : yes
flags   : fpu vme de pse tsc msr cx8 sep cmov mmx longrun lrti 
constant_tsc up
bogomips: 1750.29
clflush size: 32

-- Package-specific info:

-- System Information:
Debian Release: lenny/sid
  APT prefers unstable
  APT policy: (500, 'unstable')
Architecture: i386 (i686)

Kernel: Linux 2.6.22-3-686 (SMP w/1 CPU core)
Locale: LANG=en_US.UTF-8, LC_CTYPE=en_US.UTF-8 (charmap=UTF-8)
Shell: /bin/sh linked to /bin/bash

Versions of packages linux-image-2.6.24-1-686 depends on:
ii  debconf [debconf-2.0]1.5.19  Debian configuration management sy
ii  initramfs-tools [linux-initr 0.91d   tools for generating an initramfs
ii  module-init-tools3.3-pre11-4 tools for managing Linux kernel mo

Versions of packages linux-image-2.6.24-1-686 recommends:
ii  libc6-i6862.7-6  GNU C Library: Shared libraries [i

-- debconf information excluded

-- 
see shy jo