Re: [uml-devel] Fwd: Re: UML kernel crash of v2.6.36-rcX kernel

2010-10-16 Thread Toralf Förster
Hello,

I'm currently in the progress of bisecting the issue again- with a fully 
autoamted scripts which doesn't alck a "make clean" - *shame*

However I get with v2.6.35-rc3-173-g3638632 and the attached .config this :

nitializing software serial port version 1
console [mc-1] enabled
 ubda:
EIP: 0073:[<081bfec3>] CPU: 0 Not tainted ESP: 007b:1845aa08 EFLAGS: 00010297
Not tainted
EAX:  EBX: 184da000 ECX: 0001 EDX: 
ESI: 185fb930 EDI: 185f6900 EBP: 1845aa20 DS: 007b ES: 007b
082fab68:  [<08075f5d>] show_regs+0xed/0x120
082fab84:  [<0805e50e>] segv+0x25e/0x2b0
082fac3c:  [<0805e5a1>] segv_handler+0x41/0x60
082fac5c:  [<0806eb64>] sig_handler_common+0x54/0xa0
082facd8:  [<0806ed07>] sig_handler+0x27/0x50
082face8:  [<0806ef5c>] handle_signal+0x5c/0xa0
082fad0c:  [<080712d8>] hard_handler+0x18/0x20
082fad1c:  [] 0xb77de400

Kernel panic - not syncing: Segfault with no mm
082fab50:  [<08277d5b>] dump_stack+0x22/0x24
082fab68:  [<08277dbe>] panic+0x61/0xdb
082fab84:  [<0805e51a>] segv+0x26a/0x2b0
082fac3c:  [<0805e5a1>] segv_handler+0x41/0x60
082fac5c:  [<0806eb64>] sig_handler_common+0x54/0xa0
082facd8:  [<0806ed07>] sig_handler+0x27/0x50
082face8:  [<0806ef5c>] handle_signal+0x5c/0xa0
082fad0c:  [<080712d8>] hard_handler+0x18/0x20
082fad1c:  [] 0xb77de400


EIP: 0073:[] CPU: 0 Not tainted ESP: 007b:bf83a460 EFLAGS: 0246
Not tainted
EAX:  EBX: 6c83 ECX: 0013 EDX: 6c83
ESI: 6c7f EDI: bf83a4f4 EBP: bf83a494 DS: 007b ES: 007b
082fab04:  [<08075f5d>] show_regs+0xed/0x120
082fab20:  [<0805e7dc>] panic_exit+0x2c/0x50
082fab30:  [<0809928e>] notifier_call_chain+0x3e/0x70
082fab58:  [<08099313>] atomic_notifier_call_chain+0x23/0x30
082fab68:  [<08277de6>] panic+0x89/0xdb
082fab84:  [<0805e51a>] segv+0x26a/0x2b0
082fac3c:  [<0805e5a1>] segv_handler+0x41/0x60
082fac5c:  [<0806eb64>] sig_handler_common+0x54/0xa0
082facd8:  [<0806ed07>] sig_handler+0x27/0x50
082face8:  [<0806ef5c>] handle_signal+0x5c/0xa0
082fad0c:  [<080712d8>] hard_handler+0x18/0x20
082fad1c:  [] 0xb77de400

/home/tfoerste/workspace/bin/start_uml.sh: line 64: 27775 Segmentation fault
  
(core dumped) $LINUX ubda=$ROOT_FS ubdb=$SWAP_FS $CD eth0=tuntap,,,$TAP 
mem=256M umid=$USER $TTY $*


Richard Weinberger wrote at 19:46:53
> Hi!
> 
> Can you please post more details.
> What exactly causes the crash?
> phpsysinfo does a lot of stuff.
> On my test bed it works fine.
> 
> Am Freitag 15 Oktober 2010, 11:33:37 schrieben Sie:
> > Hello,
> > 
> > git bisect gave :
> > 
> > tfoer...@n22 ~/devel/linux-2.6 $ git bisect bad
> > 365b18189789bfa1acd9939e6312b8a4b4577b28 is the first bad commit
> > commit 365b18189789bfa1acd9939e6312b8a4b4577b28
> > Author: Christoph Hellwig 
> > Date:   Wed Jul 7 18:53:25 2010 +0200
> > 
> > add f_flags to struct statfs(64)
> > 
> > Add a flags field to help glibc implementing statvfs(3) efficiently.
> > 
> > We copy the flag values from glibc, and add a new ST_VALID flag to
> > denote that f_flags is implemented.
> > 
> > Signed-off-by: Christoph Hellwig 
> > Signed-off-by: Al Viro 
> > :
> > :04 04 92f04bf96f0c6bb2df3ddea83a4a27c35985bdcc
> > 
> > e702d8dd95a3897e085bf3fe4919e2dd30a57b47 M  arch
> > 
> > :04 04 5705bba0e9a2a61112281f7bb1a1030ef9eb337e
> > 
> > 34fc3ea912581ae7923a15246291387bfc354e47 M  fs
> > 
> > :04 04 578a5bb0743aa85adfec7a25f82d440b2ea4db27
> > 
> > 7e96f95b6f1d6ccc9e004b560670638db66b8a93 M  include
> > 
> > 
> > BTW is UML dead ?
> > (I'm asking b/c there're so many different issues - compile errors or the
> > UML kernel crashes during start or during shutdown, independent of the
> > result of the wget issue I was investigating - there are 120 skip'ed
> > commits within the git bisect log beside 46 good and 15 bad commits and I
> > didn't saw any chance to fully automate the bisect)
> 
> Hmm, I did a lot git bisects on UML 2.6.35 to 2.6.36.
> 2.6.36-rc1 does not always compile. You need no apply f3c072a first.
> I've not seen any crashes during bootup nor shutdown. Can you provide some
> more details?
> 
> Thanks,
> //richard


-- 
MfG/Kind regards
Toralf Förster

pgp finger print: 7B1A 07F4 EC82 0F90 D4C2 8936 872A E508 7DB6 9DA3

#
# Automatically generated make config: don't edit
# Linux kernel version: 2.6.35-rc3
# Sat Oct 16 10:32:28 2010
#
CONFIG_DEFCONFIG_LIST="arch/$ARCH/defconfig"
CONFIG_GENERIC_HARDIRQS=y
CONFIG_UML=y
CONFIG_MMU=y
CONFIG_NO_IOMEM=y
# CONFIG_TRACE_IRQFLAGS_SUPPORT is not set
CONFIG_LOCKDEP_SUPPORT=y
# CONFIG_STACKTRACE_SUPPORT is not set
CONFIG_GENERIC_CALIBRATE_DELAY=y
CONFIG_GENERIC_BUG=y
CONFIG_GENERIC_TIME=y
CONFIG_GENERIC_CLOCKEVENTS=y
CONFIG_IRQ_RELEASE_METHOD=y
CONFIG_HZ=100

#
# UML-specific options
#

#
# Host processor type and features
#
# CONFIG_M386 is not set
# CONFIG_M486 is not set
# CONFIG_M586 is not set
# CONFIG_M586TSC is not set
# CONFIG_M586MMX is not set
# CONFIG_M686 is not set
# CONFIG_MPENTIUMII is not set
# CONFIG_MPENTIUMI

[uml-devel] [PATCH 1/1] um: fix global timer issue when using CONFIG_NO_HZ

2010-10-16 Thread Richard Weinberger
This fixes a issue which was introduced by fe2cc53e.
timeval_to_ns() returns long long and not int.
Due to that UML's timer did not work proper and caused timer freezes.

LKML-Reference: 
Signed-off-by: Richard Weinberger 
---
 arch/um/os-Linux/time.c |2 +-
 1 files changed, 1 insertions(+), 1 deletions(-)

diff --git a/arch/um/os-Linux/time.c b/arch/um/os-Linux/time.c
index dec5678..6e3359d 100644
--- a/arch/um/os-Linux/time.c
+++ b/arch/um/os-Linux/time.c
@@ -60,7 +60,7 @@ static inline long long timeval_to_ns(const struct timeval 
*tv)
 long long disable_timer(void)
 {
struct itimerval time = ((struct itimerval) { { 0, 0 }, { 0, 0 } });
-   int remain, max = UM_NSEC_PER_SEC / UM_HZ;
+   long long remain, max = UM_NSEC_PER_SEC / UM_HZ;
 
if (setitimer(ITIMER_VIRTUAL, &time, &time) < 0)
printk(UM_KERN_ERR "disable_timer - setitimer failed, "
-- 
1.6.6.1


--
Download new Adobe(R) Flash(R) Builder(TM) 4
The new Adobe(R) Flex(R) 4 and Flash(R) Builder(TM) 4 (formerly 
Flex(R) Builder(TM)) enable the development of rich applications that run
across multiple browsers and platforms. Download your free trials today!
http://p.sf.net/sfu/adobe-dev2dev
___
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel


Re: [uml-devel] [REGRESSION] um: rcu_sched_state detected stall on CPU 0

2010-10-16 Thread richard -rw- weinberger
On Fri, Oct 15, 2010 at 9:48 AM, Peter Zijlstra  wrote:
> On Fri, 2010-10-15 at 10:02 +0300, Pekka Enberg wrote:
>> On Fri, Oct 15, 2010 at 2:44 AM, richard -rw- weinberger
>>  wrote:
>> > On Thu, Oct 14, 2010 at 9:50 PM, Arjan van de Ven
>>  wrote:
>> >>  On 10/14/2010 11:27 AM, richard -rw- weinberger wrote:
>> >>>
>> >>> Hi Arjan!
>> >>>
>> >>> This commit causes some problems on UML.
>> >>>
>> >> that is extremely weird.
>> >>>
>> >>> The kernel freezes after a few seconds until it gets some input.
>> >>> e.g: When I run top it stops refreshing the process list until i
>> press a
>> >>> button.
>> >>
>> >> a slab timer change (to not be as critical) causing global timer
>> issues
>> >> that's very obviously not a problem with this patch.
>> >> has this been seem anywhere except UML ?
>> >
>> > A small update:
>> > It seems that CONFIG_NO_HZ is broken on UML. :-(
>> >
>> > CONFIG_NO_HZ + CONFIG_SLAB: works
>> > CONFIG_NO_HZ + CONFIG_SLAB + your patch: broken
>> > CONFIG_NO_HZ + CONFIG_SLUB: broken
>> >
>> > CONFIG_SLAB + your patch: works
>> > CONFIG_SLAB: works
>> > CONFIG_SLUB: works
>>
>> Thanks for testing! Thomas, Ingo, Peter, I'm not sure who maintains
>> CONFIG_NO_HZ so I CC'd you. The problem here is that Arjan's
>> deferrable timers patch in SLAB triggered something that looks like a
>> latent bug with UML and NOHZ.
>
> Thomas does mostly, but if its UML specific, I guess its Jeff Dike
> you'll be wanting to talk to, since he's the arch maintainer.

After reviewing the code for hours I've found the bug.
It's a int/long long issue within arch/um/os-Linux/time.c.
A patch is on the way!

-- 
Thanks,
//richard

--
Download new Adobe(R) Flash(R) Builder(TM) 4
The new Adobe(R) Flex(R) 4 and Flash(R) Builder(TM) 4 (formerly 
Flex(R) Builder(TM)) enable the development of rich applications that run
across multiple browsers and platforms. Download your free trials today!
http://p.sf.net/sfu/adobe-dev2dev
___
User-mode-linux-devel mailing list
User-mode-linux-devel@lists.sourceforge.net
https://lists.sourceforge.net/lists/listinfo/user-mode-linux-devel