RE: [patch 32/36] XFS: Make xfsbufd threads freezable

2007-12-13 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Greg KH
> Envoyé : 13 décembre 2007 01:35
> 
> 2.6.22-stable review patch.  If anyone has any objections, 
> please let us know.
> 
> --
> From: Rafael J. Wysocki <[EMAIL PROTECTED]>
> 
> patch 978c7b2ff49597ab76ff7529a933bd366941ac25 in mainline
> 
> Fix breakage caused by commit 831441862956fffa17b9801db37e6ea1650b0f69
> that did not introduce the necessary call to set_freezable() 
> in xfs/linux-2.6/xfs_buf.c .
> 
> SGI-PV: 974224
> SGI-Modid: xfs-linux-melb:xfs-kern:30203a
> 
> Signed-off-by: Rafael J. Wysocki <[EMAIL PROTECTED]>
> Signed-off-by: David Chinner <[EMAIL PROTECTED]>
> Signed-off-by: Lachlan McIlroy <[EMAIL PROTECTED]>
> Cc: Oliver Pintr <[EMAIL PROTECTED]>
> Signed-off-by: Greg Kroah-Hartman <[EMAIL PROTECTED]>
> 
> ---

Hi Greg,

Don't know if it is related but I got this while building on Debian Etch 4.0:
  Building modules, stage 2.
  MODPOST 1882 modules
ERROR: "set_freezable" [fs/xfs/xfs.ko] undefined!
make[2]: *** [__modpost] Error 1
make[1]: *** [modules] Error 2
make[1]: Leaving directory 
`/usr/src/linux-headers-2.6.22.15-rc1-cfs-etch-686-envcan'
make: *** [debian/stamp-build-kernel] Error 2

- vin
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch 32/36] XFS: Make xfsbufd threads freezable

2007-12-13 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Greg KH
 Envoyé : 13 décembre 2007 01:35
 
 2.6.22-stable review patch.  If anyone has any objections, 
 please let us know.
 
 --
 From: Rafael J. Wysocki [EMAIL PROTECTED]
 
 patch 978c7b2ff49597ab76ff7529a933bd366941ac25 in mainline
 
 Fix breakage caused by commit 831441862956fffa17b9801db37e6ea1650b0f69
 that did not introduce the necessary call to set_freezable() 
 in xfs/linux-2.6/xfs_buf.c .
 
 SGI-PV: 974224
 SGI-Modid: xfs-linux-melb:xfs-kern:30203a
 
 Signed-off-by: Rafael J. Wysocki [EMAIL PROTECTED]
 Signed-off-by: David Chinner [EMAIL PROTECTED]
 Signed-off-by: Lachlan McIlroy [EMAIL PROTECTED]
 Cc: Oliver Pintr [EMAIL PROTECTED]
 Signed-off-by: Greg Kroah-Hartman [EMAIL PROTECTED]
 
 ---

Hi Greg,

Don't know if it is related but I got this while building on Debian Etch 4.0:
  Building modules, stage 2.
  MODPOST 1882 modules
ERROR: set_freezable [fs/xfs/xfs.ko] undefined!
make[2]: *** [__modpost] Error 1
make[1]: *** [modules] Error 2
make[1]: Leaving directory 
`/usr/src/linux-headers-2.6.22.15-rc1-cfs-etch-686-envcan'
make: *** [debian/stamp-build-kernel] Error 2

- vin
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-12 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani
> 
> On Wed, Dec 12, 2007 at 07:57:33AM -0500, Fortier,Vincent 
> [Montreal] wrote:
> > > -Message d'origine-
> > > De : Dhaval Giani [mailto:[EMAIL PROTECTED]
> > > 
> > > On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote:
> > > > 
> > > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote:
> > > > 
> > > > > > That has changed from /sys/kernel/uids//cpu_share
> > > > > 
> > > > > Here is my config.
> > > > > 
> > > > > Maybie I should give it a shot without CFS at all and see what

> > > > > happends ?
> > 
> > It got triggerred also using a 2.6.22.14:

Here are my preliminary test results:
   2.6.21.7: OK
   2.6.22.13/14: Failure
   2.6.23.9: OK
2.6.24-rc5-git2: OK

It seems to only hang using a 2.6.22 kernel.

> 
> No, not any more. Would it be possible for you to do a 
> git-bisect? I am not too well versed with sysfs, so it is not 
> apparent to me what is causing this oops. It seems to be 
> easily reproducible. I don't still have a reliable method to 
> reproduce it without the CFS patch. Could sysfs experts 
> please help debugging?
> 

I seriously doubt I have the time to do a git-bisect at the moment

- vin
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-12 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Fortier,Vincent [Montreal]
> 
> > -Message d'origine-
> > De : Dhaval Giani [mailto:[EMAIL PROTECTED]
> > 
> > On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote:
> > > 
> > > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote:
> > > 
> > > > > That has changed from /sys/kernel/uids//cpu_share
> > > > 
> > > > Here is my config.
> > > > 
> > > > Maybie I should give it a shot without CFS at all and see what 
> > > > happends ?
> 
> It got triggerred also using a 2.6.22.14:

Just to clarify... this is a non CFS kernel oops...

> [57560.396000] BUG: unable to handle kernel paging request at 
> virtual address 8000 [57560.396000]  printing eip:
> [57560.396000] c01d6c56
> [57560.396000] *pdpt = 08d02001
> [57560.396000] *pde = 
> [57560.396000] Oops:  [#34]
> [57560.396000] SMP
> [57560.396000] last sysfs file: 
> /devices/platform/floppy.0/uevent [57560.396000] Modules 
> linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc 
> ppdev parport_pc lp parport button ac battery ipv6 fuse 
> ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt 
> iTCO_vendor_support psmouse e752x_edac edac_mc serio_raw 
> evdev pcspkr sg floppy shpchp pci_hotplug sr_mod cdrom ext3 
> jbd mbcache dm_mirror dm_snapshot dm_mod generic piix 
> ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal 
> processor fan mptscsih mptbase megaraid_sas megaraid_mbox 
> megaraid_mm cciss aacraid
> [57560.396000] CPU:2
> [57560.396000] EIP:0060:[]Not tainted VLI
> [57560.396000] EFLAGS: 00010297   (2.6.22.14-etch-686-envcan #1)
> [57560.396000] EIP is at vsnprintf+0x2af/0x48c
> [57560.396000] eax: 8000   ebx:    ecx: 8000   edx:
> fffe
> [57560.396000] esi: edf37017   edi: edf09eac   ebp:    esp:
> edf09e4c
> [57560.396000] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
> [57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530
> task.ti=edf08000)
> [57560.396000] Stack: c852b000 1000 c0338c78 f895b56c 
> c0233bf5 c852b000 120c8fe8 edf37017
> [57560.396000]00c3bd08    
> c03354eb 0003 0017
> [57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac
> c0233170 edf37017 c03354ea
> [57560.396000] Call Trace:
> [57560.396000]  [] dev_uevent+0x189/0x1e0 
> [57560.396000]  [] sprintf+0x20/0x23 [57560.396000] 
>  [] show_uevent+0xad/0xd5 [57560.396000]  
> [] get_page_from_freelist+0x296/0x32d
> [57560.396000]  [] group_send_sig_info+0x12/0x56 
> [57560.396000]  [] __alloc_pages+0x52/0x294 
> [57560.396000]  [] show_uevent+0x0/0xd5 
> [57560.396000]  [] dev_attr_show+0x15/0x18 
> [57560.396000]  [] sysfs_read_file+0x87/0xd8 
> [57560.396000]  [] sys_getxattr+0x46/0x4e 
> [57560.396000]  [] sysfs_read_file+0x0/0xd8 
> [57560.396000]  [] vfs_read+0xa6/0x128 
> [57560.396000]  [] sys_read+0x41/0x67 
> [57560.396000]  [] syscall_call+0x7/0xb 
> [57560.396000]  === [57560.396000] Code: 
> 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 00 00 8b 
> 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 
> c8 eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 
> 30 10 89 c3 [57560.396000] EIP: [] 
> vsnprintf+0x2af/0x48c SS:ESP 0068:edf09e4c
> 
> > > 
> > > and also with CFS but without CONFIG_FAIR_GROUP_SCHED.
> > > 
> 
> Is it still required since it now does not seems to be CFS related?
> 
> > 
> > Hi Ingo,
> > 
> > I am able to reproduce the oops here on my system with
> > 2.6.22.14 + CFS backport. I am not able to reproduce it with
> > 2.6.22.13 + CFS backport. I believe the CFS backport is 
> just exposing 
> > the bug. Can't find an obvious culprit and am looking into 
> this issue.
> > 
> > Vincent, could you please confirm if you are able to reproduce this 
> > with
> > 2.6.22.13 + CFS?
> 
> Using 2.6.13 + CFS v24 I was also able to reproduce the bug 
> (I already had one built in my depot without the 
> display_most-recently-opened_sysfs_file_name_when_oopsing.patc
> h).  So it looks like it is at least related to >= 2.6.22.13 
> and probably not directly CFS related.  Note that to get a 
> oops on a 2.6.13 it seems to need a full backup since it 
> usually works with incremental.  The backup does start 
> properly then, in this case, at around 70% it oopsed.  Using
> 2.6.22.14 it seems to oops right at startup.  Here is the 
> 2.6.22.13 CFS v24 oops:

Again, just to clari

RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-12 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Dhaval Giani [mailto:[EMAIL PROTECTED] 
> 
> On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote:
> > 
> > * Fortier,Vincent [Montreal] <[EMAIL PROTECTED]> wrote:
> > 
> > > > That has changed from /sys/kernel/uids//cpu_share
> > > 
> > > Here is my config.
> > > 
> > > Maybie I should give it a shot without CFS at all and see what 
> > > happends ?

It got triggerred also using a 2.6.22.14:
[57560.396000] BUG: unable to handle kernel paging request at virtual
address 8000
[57560.396000]  printing eip:
[57560.396000] c01d6c56
[57560.396000] *pdpt = 08d02001
[57560.396000] *pde = 
[57560.396000] Oops:  [#34]
[57560.396000] SMP
[57560.396000] last sysfs file: /devices/platform/floppy.0/uevent
[57560.396000] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd
nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse
ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt iTCO_vendor_support
psmouse e752x_edac edac_mc serio_raw evdev pcspkr sg floppy shpchp
pci_hotplug sr_mod cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod
generic piix ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal
processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm
cciss aacraid
[57560.396000] CPU:2
[57560.396000] EIP:0060:[]Not tainted VLI
[57560.396000] EFLAGS: 00010297   (2.6.22.14-etch-686-envcan #1)
[57560.396000] EIP is at vsnprintf+0x2af/0x48c
[57560.396000] eax: 8000   ebx:    ecx: 8000   edx:
fffe
[57560.396000] esi: edf37017   edi: edf09eac   ebp:    esp:
edf09e4c
[57560.396000] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
[57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530
task.ti=edf08000)
[57560.396000] Stack: c852b000 1000 c0338c78 f895b56c c0233bf5
c852b000 120c8fe8 edf37017
[57560.396000]00c3bd08    
c03354eb 0003 0017
[57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac
c0233170 edf37017 c03354ea
[57560.396000] Call Trace:
[57560.396000]  [] dev_uevent+0x189/0x1e0
[57560.396000]  [] sprintf+0x20/0x23
[57560.396000]  [] show_uevent+0xad/0xd5
[57560.396000]  [] get_page_from_freelist+0x296/0x32d
[57560.396000]  [] group_send_sig_info+0x12/0x56
[57560.396000]  [] __alloc_pages+0x52/0x294
[57560.396000]  [] show_uevent+0x0/0xd5
[57560.396000]  [] dev_attr_show+0x15/0x18
[57560.396000]  [] sysfs_read_file+0x87/0xd8
[57560.396000]  [] sys_getxattr+0x46/0x4e
[57560.396000]  [] sysfs_read_file+0x0/0xd8
[57560.396000]  [] vfs_read+0xa6/0x128
[57560.396000]  [] sys_read+0x41/0x67
[57560.396000]  [] syscall_call+0x7/0xb
[57560.396000]  ===
[57560.396000] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00
00 00 8b 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 c8
eb 06 <80> 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 30 10 89 c3
[57560.396000] EIP: [] vsnprintf+0x2af/0x48c SS:ESP
0068:edf09e4c

> > 
> > and also with CFS but without CONFIG_FAIR_GROUP_SCHED.
> > 

Is it still required since it now does not seems to be CFS related?

> 
> Hi Ingo,
> 
> I am able to reproduce the oops here on my system with 
> 2.6.22.14 + CFS backport. I am not able to reproduce it with 
> 2.6.22.13 + CFS backport. I believe the CFS backport is just 
> exposing the bug. Can't find an obvious culprit and am 
> looking into this issue.
> 
> Vincent, could you please confirm if you are able to 
> reproduce this with
> 2.6.22.13 + CFS?

Using 2.6.13 + CFS v24 I was also able to reproduce the bug (I already
had one built in my depot without the
display_most-recently-opened_sysfs_file_name_when_oopsing.patch).  So it
looks like it is at least related to >= 2.6.22.13 and probably not
directly CFS related.  Note that to get a oops on a 2.6.13 it seems to
need a full backup since it usually works with incremental.  The backup
does start properly then, in this case, at around 70% it oopsed.  Using
2.6.22.14 it seems to oops right at startup.  Here is the 2.6.22.13 CFS
v24 oops:

[  170.152908] SGI XFS Quota Management subsystem
[  170.168443] Filesystem "drbd0": Disabling barriers, not supported by
the underlying device
[  170.174964] XFS mounting filesystem drbd0
[  170.232455] Ending clean XFS mount for filesystem: drbd0
[  170.318614] Filesystem "drbd1": Disabling barriers, not supported by
the underlying device
[  170.327708] XFS mounting filesystem drbd1
[  170.380481] Ending clean XFS mount for filesystem: drbd1
[  947.493764] BUG: unable to handle kernel NULL pointer dereference at
virtual address 00c8
[  947.493797]  printing eip:
[  947.493810] c01a922c
[  947.493823] *pdpt = 2a97a001
[  947.493837] *pde = 
[  947.493852] Oops:  [#1]
[  947.493865] SMP
[  947.493881]

RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-12 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Dhaval Giani [mailto:[EMAIL PROTECTED] 
 
 On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote:
  
  * Fortier,Vincent [Montreal] [EMAIL PROTECTED] wrote:
  
That has changed from /sys/kernel/uids/uid/cpu_share
   
   Here is my config.
   
   Maybie I should give it a shot without CFS at all and see what 
   happends ?

It got triggerred also using a 2.6.22.14:
[57560.396000] BUG: unable to handle kernel paging request at virtual
address 8000
[57560.396000]  printing eip:
[57560.396000] c01d6c56
[57560.396000] *pdpt = 08d02001
[57560.396000] *pde = 
[57560.396000] Oops:  [#34]
[57560.396000] SMP
[57560.396000] last sysfs file: /devices/platform/floppy.0/uevent
[57560.396000] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd
nfs_acl sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse
ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt iTCO_vendor_support
psmouse e752x_edac edac_mc serio_raw evdev pcspkr sg floppy shpchp
pci_hotplug sr_mod cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod
generic piix ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal
processor fan mptscsih mptbase megaraid_sas megaraid_mbox megaraid_mm
cciss aacraid
[57560.396000] CPU:2
[57560.396000] EIP:0060:[c01d6c56]Not tainted VLI
[57560.396000] EFLAGS: 00010297   (2.6.22.14-etch-686-envcan #1)
[57560.396000] EIP is at vsnprintf+0x2af/0x48c
[57560.396000] eax: 8000   ebx:    ecx: 8000   edx:
fffe
[57560.396000] esi: edf37017   edi: edf09eac   ebp:    esp:
edf09e4c
[57560.396000] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
[57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530
task.ti=edf08000)
[57560.396000] Stack: c852b000 1000 c0338c78 f895b56c c0233bf5
c852b000 120c8fe8 edf37017
[57560.396000]00c3bd08    
c03354eb 0003 0017
[57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac
c0233170 edf37017 c03354ea
[57560.396000] Call Trace:
[57560.396000]  [c0233bf5] dev_uevent+0x189/0x1e0
[57560.396000]  [c01d6eb4] sprintf+0x20/0x23
[57560.396000]  [c0233170] show_uevent+0xad/0xd5
[57560.396000]  [c0154f48] get_page_from_freelist+0x296/0x32d
[57560.396000]  [c012e6f0] group_send_sig_info+0x12/0x56
[57560.396000]  [c0155031] __alloc_pages+0x52/0x294
[57560.396000]  [c02330c3] show_uevent+0x0/0xd5
[57560.396000]  [c0232c82] dev_attr_show+0x15/0x18
[57560.396000]  [c01a6979] sysfs_read_file+0x87/0xd8
[57560.396000]  [c0185f04] sys_getxattr+0x46/0x4e
[57560.396000]  [c01a68f2] sysfs_read_file+0x0/0xd8
[57560.396000]  [c016fe03] vfs_read+0xa6/0x128
[57560.396000]  [c01701ff] sys_read+0x41/0x67
[57560.396000]  [c0103d8a] syscall_call+0x7/0xb
[57560.396000]  ===
[57560.396000] Code: 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00
00 00 8b 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 c8
eb 06 80 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 30 10 89 c3
[57560.396000] EIP: [c01d6c56] vsnprintf+0x2af/0x48c SS:ESP
0068:edf09e4c

  
  and also with CFS but without CONFIG_FAIR_GROUP_SCHED.
  

Is it still required since it now does not seems to be CFS related?

 
 Hi Ingo,
 
 I am able to reproduce the oops here on my system with 
 2.6.22.14 + CFS backport. I am not able to reproduce it with 
 2.6.22.13 + CFS backport. I believe the CFS backport is just 
 exposing the bug. Can't find an obvious culprit and am 
 looking into this issue.
 
 Vincent, could you please confirm if you are able to 
 reproduce this with
 2.6.22.13 + CFS?

Using 2.6.13 + CFS v24 I was also able to reproduce the bug (I already
had one built in my depot without the
display_most-recently-opened_sysfs_file_name_when_oopsing.patch).  So it
looks like it is at least related to = 2.6.22.13 and probably not
directly CFS related.  Note that to get a oops on a 2.6.13 it seems to
need a full backup since it usually works with incremental.  The backup
does start properly then, in this case, at around 70% it oopsed.  Using
2.6.22.14 it seems to oops right at startup.  Here is the 2.6.22.13 CFS
v24 oops:

[  170.152908] SGI XFS Quota Management subsystem
[  170.168443] Filesystem drbd0: Disabling barriers, not supported by
the underlying device
[  170.174964] XFS mounting filesystem drbd0
[  170.232455] Ending clean XFS mount for filesystem: drbd0
[  170.318614] Filesystem drbd1: Disabling barriers, not supported by
the underlying device
[  170.327708] XFS mounting filesystem drbd1
[  170.380481] Ending clean XFS mount for filesystem: drbd1
[  947.493764] BUG: unable to handle kernel NULL pointer dereference at
virtual address 00c8
[  947.493797]  printing eip:
[  947.493810] c01a922c
[  947.493823] *pdpt = 2a97a001
[  947.493837] *pde = 
[  947.493852] Oops:  [#1]
[  947.493865] SMP
[  947.493881] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd
nfs_acl sunrpc ppdev parport_pc lp parport

RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-12 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Fortier,Vincent [Montreal]
 
  -Message d'origine-
  De : Dhaval Giani [mailto:[EMAIL PROTECTED]
  
  On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote:
   
   * Fortier,Vincent [Montreal] [EMAIL PROTECTED] wrote:
   
 That has changed from /sys/kernel/uids/uid/cpu_share

Here is my config.

Maybie I should give it a shot without CFS at all and see what 
happends ?
 
 It got triggerred also using a 2.6.22.14:

Just to clarify... this is a non CFS kernel oops...

 [57560.396000] BUG: unable to handle kernel paging request at 
 virtual address 8000 [57560.396000]  printing eip:
 [57560.396000] c01d6c56
 [57560.396000] *pdpt = 08d02001
 [57560.396000] *pde = 
 [57560.396000] Oops:  [#34]
 [57560.396000] SMP
 [57560.396000] last sysfs file: 
 /devices/platform/floppy.0/uevent [57560.396000] Modules 
 linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl sunrpc 
 ppdev parport_pc lp parport button ac battery ipv6 fuse 
 ide_cd ide_generic usbkbd usbmouse tsdev iTCO_wdt 
 iTCO_vendor_support psmouse e752x_edac edac_mc serio_raw 
 evdev pcspkr sg floppy shpchp pci_hotplug sr_mod cdrom ext3 
 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix 
 ide_core tg3 ata_piix ehci_hcd uhci_hcd usbcore thermal 
 processor fan mptscsih mptbase megaraid_sas megaraid_mbox 
 megaraid_mm cciss aacraid
 [57560.396000] CPU:2
 [57560.396000] EIP:0060:[c01d6c56]Not tainted VLI
 [57560.396000] EFLAGS: 00010297   (2.6.22.14-etch-686-envcan #1)
 [57560.396000] EIP is at vsnprintf+0x2af/0x48c
 [57560.396000] eax: 8000   ebx:    ecx: 8000   edx:
 fffe
 [57560.396000] esi: edf37017   edi: edf09eac   ebp:    esp:
 edf09e4c
 [57560.396000] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
 [57560.396000] Process clBackup (pid: 31421, ti=edf08000 task=f7d36530
 task.ti=edf08000)
 [57560.396000] Stack: c852b000 1000 c0338c78 f895b56c 
 c0233bf5 c852b000 120c8fe8 edf37017
 [57560.396000]00c3bd08    
 c03354eb 0003 0017
 [57560.396000]c0376dc0 c852b000 c01d6eb4 edf09eac edf09eac
 c0233170 edf37017 c03354ea
 [57560.396000] Call Trace:
 [57560.396000]  [c0233bf5] dev_uevent+0x189/0x1e0 
 [57560.396000]  [c01d6eb4] sprintf+0x20/0x23 [57560.396000] 
  [c0233170] show_uevent+0xad/0xd5 [57560.396000]  
 [c0154f48] get_page_from_freelist+0x296/0x32d
 [57560.396000]  [c012e6f0] group_send_sig_info+0x12/0x56 
 [57560.396000]  [c0155031] __alloc_pages+0x52/0x294 
 [57560.396000]  [c02330c3] show_uevent+0x0/0xd5 
 [57560.396000]  [c0232c82] dev_attr_show+0x15/0x18 
 [57560.396000]  [c01a6979] sysfs_read_file+0x87/0xd8 
 [57560.396000]  [c0185f04] sys_getxattr+0x46/0x4e 
 [57560.396000]  [c01a68f2] sysfs_read_file+0x0/0xd8 
 [57560.396000]  [c016fe03] vfs_read+0xa6/0x128 
 [57560.396000]  [c01701ff] sys_read+0x41/0x67 
 [57560.396000]  [c0103d8a] syscall_call+0x7/0xb 
 [57560.396000]  === [57560.396000] Code: 
 74 24 28 73 03 c6 06 20 4d 46 85 ed 7f f1 e9 b9 00 00 00 8b 
 0f b8 79 e0 32 c0 8b 54 24 2c 81 f9 ff 0f 00 00 0f 46 c8 89 
 c8 eb 06 80 38 00 74 07 40 4a 83 fa ff 75 f4 29 c8 f6 44 24 
 30 10 89 c3 [57560.396000] EIP: [c01d6c56] 
 vsnprintf+0x2af/0x48c SS:ESP 0068:edf09e4c
 
   
   and also with CFS but without CONFIG_FAIR_GROUP_SCHED.
   
 
 Is it still required since it now does not seems to be CFS related?
 
  
  Hi Ingo,
  
  I am able to reproduce the oops here on my system with
  2.6.22.14 + CFS backport. I am not able to reproduce it with
  2.6.22.13 + CFS backport. I believe the CFS backport is 
 just exposing 
  the bug. Can't find an obvious culprit and am looking into 
 this issue.
  
  Vincent, could you please confirm if you are able to reproduce this 
  with
  2.6.22.13 + CFS?
 
 Using 2.6.13 + CFS v24 I was also able to reproduce the bug 
 (I already had one built in my depot without the 
 display_most-recently-opened_sysfs_file_name_when_oopsing.patc
 h).  So it looks like it is at least related to = 2.6.22.13 
 and probably not directly CFS related.  Note that to get a 
 oops on a 2.6.13 it seems to need a full backup since it 
 usually works with incremental.  The backup does start 
 properly then, in this case, at around 70% it oopsed.  Using
 2.6.22.14 it seems to oops right at startup.  Here is the 
 2.6.22.13 CFS v24 oops:

Again, just to clarify, I'm not even sure the backup worked at all using
a 2.6.22.13 CFS v24 since I already had a previous pending full backup
at 70% ... so it may simply had tried to finalize that one and crash
right at startup?

 [  170.152908] SGI XFS Quota Management subsystem [  
 170.168443] Filesystem drbd0: Disabling barriers, not 
 supported by the underlying device [  170.174964] XFS 
 mounting filesystem drbd0 [  170.232455] Ending clean XFS 
 mount for filesystem: drbd0 [  170.318614] Filesystem 
 drbd1

RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-12 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani
 
 On Wed, Dec 12, 2007 at 07:57:33AM -0500, Fortier,Vincent 
 [Montreal] wrote:
   -Message d'origine-
   De : Dhaval Giani [mailto:[EMAIL PROTECTED]
   
   On Tue, Dec 11, 2007 at 10:06:53PM +0100, Ingo Molnar wrote:

* Fortier,Vincent [Montreal] [EMAIL PROTECTED] wrote:

  That has changed from /sys/kernel/uids/uid/cpu_share
 
 Here is my config.
 
 Maybie I should give it a shot without CFS at all and see what

 happends ?
  
  It got triggerred also using a 2.6.22.14:

Here are my preliminary test results:
   2.6.21.7: OK
   2.6.22.13/14: Failure
   2.6.23.9: OK
2.6.24-rc5-git2: OK

It seems to only hang using a 2.6.22 kernel.

 
 No, not any more. Would it be possible for you to do a 
 git-bisect? I am not too well versed with sysfs, so it is not 
 apparent to me what is causing this oops. It seems to be 
 easily reproducible. I don't still have a reliable method to 
 reproduce it without the CFS patch. Could sysfs experts 
 please help debugging?
 

I seriously doubt I have the time to do a git-bisect at the moment

- vin
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-11 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani
> 
> On Tue, Dec 11, 2007 at 09:04:00AM -0800, Greg KH wrote:
> > On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote:
> > > On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote:
> > > > On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote:
> > > > > On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent wrote:
> > > > > 
> > > > > Ingo, can you look at this, please?
> > > > > Vincent is getting oopses on 2.6.22.14-cfs-etch.
> > > > > 
> > > > 
> > > > Hi,
> > > > 
> > > > We are looking into this bug now. I believe that the patch at
> > > > http://marc.info/?l=linux-kernel=119404922603293 should help.
> > > > 
> > > > I am working with Kay to get this ported.
> > > > 
> > > 
> > > Hi Vincent,
> > > 
> > > Does the following patch help?
> > > 
> > > Kay/Greg, could you please review and add your Signed-off-by(s) as

> > > required?
> > 
> > Um, why?  What is this patch for?  Where is it to be sent, to Linus 
> > for 2.6.24-final?  Or to the -stable tree?
> > 
> 
> Hi Greg,
> 
> This is for 2.26.24-final, since Fair User scheduling is not 
> yet there in stable.
> 
> > > This is basically a port of the patch at
> > > http://marc.info/?l=linux-kernel=119404922603293
> > 
> > Yeah, but that patch needs some other core kobject changes, right?
> > 
> 
> Yep, there are some other changes that patch needed. We have 
> worked around them by using the existing functions in the 
> current Linus tree.
> 
> > What exactly are you trying to fix here, the fact that this code
never 
> > even worked?
> > 
> 
> The code was not using the kobject API. Its been cleaned up now (I
> hope!)

It refused to apply cleanly on a 2.6.22.14 + CFS v24, only one failure
occured.  So I resolved it manually and attached the resulting diff.

My tests with Galaxy 5.9 shows that it still does not work.  Although,
the error seems to have changed a bit (see attached dmesg)

> > And, please, we need some documentation for Documenatation/ABI/ on 
> > exactly what these sysfs files and tree is for.  Please add that now

> > for Linus's tree.
> 
> On to it, will send the patch asap.
> 
> > confused,
> 
> hope i helped (in clearing it :) )

Should this patch eventually be included in?
2.6.25  ?
2.6.24  ?
(-stable 2.6.23 & 2.6.22) || backport CFS v24 -> v25?

Thnx,

- vin


dmesg.2.6.22.14-CFSv24-FairUserInterfaceBUGfix
Description: dmesg.2.6.22.14-CFSv24-FairUserInterfaceBUGfix


FairUserInterface-BugFix.patch
Description: FairUserInterface-BugFix.patch


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-11 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Dhaval Giani
 
 On Tue, Dec 11, 2007 at 09:04:00AM -0800, Greg KH wrote:
  On Tue, Dec 11, 2007 at 10:13:19PM +0530, Dhaval Giani wrote:
   On Tue, Dec 11, 2007 at 08:24:37PM +0530, Dhaval Giani wrote:
On Mon, Dec 10, 2007 at 09:15:01AM -0800, Randy Dunlap wrote:
 On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent wrote:
 
 Ingo, can you look at this, please?
 Vincent is getting oopses on 2.6.22.14-cfs-etch.
 

Hi,

We are looking into this bug now. I believe that the patch at
http://marc.info/?l=linux-kernelm=119404922603293 should help.

I am working with Kay to get this ported.

   
   Hi Vincent,
   
   Does the following patch help?
   
   Kay/Greg, could you please review and add your Signed-off-by(s) as

   required?
  
  Um, why?  What is this patch for?  Where is it to be sent, to Linus 
  for 2.6.24-final?  Or to the -stable tree?
  
 
 Hi Greg,
 
 This is for 2.26.24-final, since Fair User scheduling is not 
 yet there in stable.
 
   This is basically a port of the patch at
   http://marc.info/?l=linux-kernelm=119404922603293
  
  Yeah, but that patch needs some other core kobject changes, right?
  
 
 Yep, there are some other changes that patch needed. We have 
 worked around them by using the existing functions in the 
 current Linus tree.
 
  What exactly are you trying to fix here, the fact that this code
never 
  even worked?
  
 
 The code was not using the kobject API. Its been cleaned up now (I
 hope!)

It refused to apply cleanly on a 2.6.22.14 + CFS v24, only one failure
occured.  So I resolved it manually and attached the resulting diff.

My tests with Galaxy 5.9 shows that it still does not work.  Although,
the error seems to have changed a bit (see attached dmesg)

  And, please, we need some documentation for Documenatation/ABI/ on 
  exactly what these sysfs files and tree is for.  Please add that now

  for Linus's tree.
 
 On to it, will send the patch asap.
 
  confused,
 
 hope i helped (in clearing it :) )

Should this patch eventually be included in?
2.6.25  ?
2.6.24  ?
(-stable 2.6.23  2.6.22) || backport CFS v24 - v25?

Thnx,

- vin


dmesg.2.6.22.14-CFSv24-FairUserInterfaceBUGfix
Description: dmesg.2.6.22.14-CFSv24-FairUserInterfaceBUGfix


FairUserInterface-BugFix.patch
Description: FairUserInterface-BugFix.patch


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-10 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Randy Dunlap [mailto:[EMAIL PROTECTED] 
> Envoyé : 10 décembre 2007 12:15
> 
> On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote:
> 
> Ingo, can you look at this, please?
> Vincent is getting oopses on 2.6.22.14-cfs-etch.
> 
> Vincent, did you apply the cfs patch or did Debian etch provide that?

I did. http://linux-dev.qc.ec.gc.ca/

> If you applied it, did you use
> http://people.redhat.com/mingo/cfs-scheduler/sched-cfs-v2.6.22.13-v24.patch
> or a different patch?
> 

I applied exactly that one.. and had already sent that info to ingo this 
morning since I presumed the CFS patchset could be involved in this by 
reagarding the more detailed output.

Also note that CFS v24 on 2.6.21 does not produce the oops and I can run galaxy 
backups on the system without any problems.

- vin
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-10 Thread Fortier,Vincent [Montreal]
 

> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Fortier,Vincent [Montreal]
> Envoyé : 10 décembre 2007 08:21
> À : Randy Dunlap; Andrew Morton
> Cc : linux-kernel@vger.kernel.org
> Objet : RE: 2.6.22.14 oops msg with commvault galaxy ?
> 
> > -Message d'origine-
> > De : Randy Dunlap [mailto:[EMAIL PROTECTED] Envoyé : 
> 7 décembre 
> > 2007 20:15
> > 
> > On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote:
> > 
> > > On Fri, 7 Dec 2007 14:15:36 -0800
> > > Randy Dunlap <[EMAIL PROTECTED]> wrote:
> > > 
> > > > > Help would really be appreciated.
> > > > 
> > > > Let's try the last_sysfs_file (name) patch.
> > > > I've attempted to update it for 2.6.22.14.
> > > > Andrew, does this change in fs/sysfs/file.c look OK?
> > > 
> > > umm, yup.
> > > 
> > > 
> > 
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r
> > > 
> > 
> c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch
> > > 
> > > should work.
> > 
> > Thanks.  
> > I produced a cleanly applying version of it for 2.6.22.14.
> > 
> > Vincent, please apply this patch so we can know which file in sysfs 
> > these oopses are happening with.
> > 
> 
> It did not applied cleanly on a 2.6.22.14... copy/paste might 
> be the issue here... Anyhow, I corrected the patch failure to 
> apply and here is my version of it... Hoping I got this 
> (attached patch).
> 
> Compiling at the moment... will try this out with commvault 
> 5.9 probably in the morning and get back with the results.
> 
> Let me know I got the patch wrong.

Here is the resulting trace... hoping this helps...:

[  942.107304] BUG: unable to handle kernel NULL pointer dereference at virtual 
address 00c8
[  942.107339]  printing eip:
[  942.107354] c01a924c
[  942.107368] *pdpt = 2d6b4001
[  942.107383] *pde = 
[  942.107401] Oops:  [#1]
[  942.107414] SMP
[  942.107431] last sysfs file: /kernel/uids/104/cpu_share
[  942.107449] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl 
sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd 
ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support e752x_edac 
edac_mc psmouse floppy shpchp pci_hotplug serio_raw sr_mod pcspkr evdev cdrom 
ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ehci_hcd 
uhci_hcd usbcore ata_piix tg3 thermal processor fan mptscsih mptbase 
megaraid_sas megaraid_mbox megaraid_mm cciss aacraid
[  942.107675] CPU:0
[  942.107676] EIP:0060:[]Not tainted VLI
[  942.107678] EFLAGS: 00010202   (2.6.22.14-cfs-etch-686-envcan #1)
[  942.107730] EIP is at sysfs_open_file+0xae/0x21e
[  942.107749] eax:    ebx: f77783b8   ecx: dfb0b280   edx: 00c8
[  942.107769] esi: f7e0ce8c   edi: c03fd5c0   ebp: c01a919e   esp: f1257ed8
[  942.107789] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
[  942.107810] Process clBackup (pid: 5191, ti=f1256000 task=f71b3290 
task.ti=f1256000)
[  942.107831] Stack: 1000 f295d240 ed4f4ac0 f7e0ce48 f295d240 ed4f4ac0 
f1257f30 c01a919e
[  942.107878]c0170464 dfe76100 ed4f3880 f295d240 8000 f1257f30 
0010 c0170595
[  942.107921]f295d240   c01705d6  f1257f30 
ed4f3880 dfe76100
[  942.107968] Call Trace:
[  942.107998]  [] sysfs_open_file+0x0/0x21e
[  942.108017]  [] __dentry_open+0xc1/0x178
[  942.108039]  [] nameidata_to_filp+0x24/0x33
[  942.108063]  [] do_filp_open+0x32/0x39
[  942.108088]  [] get_unused_fd+0x4a/0xaa
[  942.108112]  [] do_sys_open+0x42/0xc3
[  942.108134]  [] sys_open+0x1c/0x1e
[  942.108155]  [] syscall_call+0x7/0xb
[  942.108179]  ===
[  942.108194] Code: b8 c0 c5 3f c0 41 e8 e8 06 03 00 83 7c 24 0c 00 0f 84 72 
01 00 00 85 f6 0f 84 6a 01 00 00 8b 56 04 85 d2 74 19 64 a1 08 50 3d c0 <83> 3a 
02 0f 84 44 01 00 00 c1 e0 05 ff 84 10 20 01 00 00 8b 54
[  942.108364] EIP: [] sysfs_open_file+0xae/0x21e SS:ESP 0068:f1257ed8

- vin
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-10 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Randy Dunlap [mailto:[EMAIL PROTECTED] 
> Envoyé : 7 décembre 2007 20:15
> 
> On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote:
> 
> > On Fri, 7 Dec 2007 14:15:36 -0800
> > Randy Dunlap <[EMAIL PROTECTED]> wrote:
> > 
> > > > Help would really be appreciated.
> > > 
> > > Let's try the last_sysfs_file (name) patch.
> > > I've attempted to update it for 2.6.22.14.
> > > Andrew, does this change in fs/sysfs/file.c look OK?
> > 
> > umm, yup.
> > 
> > 
> ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r
> > 
> c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch
> > 
> > should work.
> 
> Thanks.  
> I produced a cleanly applying version of it for 2.6.22.14.
> 
> Vincent, please apply this patch so we can know which file in 
> sysfs these oopses are happening with.
> 

It did not applied cleanly on a 2.6.22.14... copy/paste might be the issue 
here... Anyhow, I corrected the patch failure to apply and here is my version 
of it... Hoping I got this (attached patch).

Compiling at the moment... will try this out with commvault 5.9 probably in the 
morning and get back with the results.

Let me know I got the patch wrong.

- vin


display_most-recently-opened_sysfs_file_name_when_oopsing.patch
Description: display_most-recently-opened_sysfs_file_name_when_oopsing.patch


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-10 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Randy Dunlap [mailto:[EMAIL PROTECTED] 
 Envoyé : 7 décembre 2007 20:15
 
 On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote:
 
  On Fri, 7 Dec 2007 14:15:36 -0800
  Randy Dunlap [EMAIL PROTECTED] wrote:
  
Help would really be appreciated.
   
   Let's try the last_sysfs_file (name) patch.
   I've attempted to update it for 2.6.22.14.
   Andrew, does this change in fs/sysfs/file.c look OK?
  
  umm, yup.
  
  
 ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r
  
 c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch
  
  should work.
 
 Thanks.  
 I produced a cleanly applying version of it for 2.6.22.14.
 
 Vincent, please apply this patch so we can know which file in 
 sysfs these oopses are happening with.
 

It did not applied cleanly on a 2.6.22.14... copy/paste might be the issue 
here... Anyhow, I corrected the patch failure to apply and here is my version 
of it... Hoping I got this (attached patch).

Compiling at the moment... will try this out with commvault 5.9 probably in the 
morning and get back with the results.

Let me know I got the patch wrong.

- vin


display_most-recently-opened_sysfs_file_name_when_oopsing.patch
Description: display_most-recently-opened_sysfs_file_name_when_oopsing.patch


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-10 Thread Fortier,Vincent [Montreal]
 

 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Fortier,Vincent [Montreal]
 Envoyé : 10 décembre 2007 08:21
 À : Randy Dunlap; Andrew Morton
 Cc : linux-kernel@vger.kernel.org
 Objet : RE: 2.6.22.14 oops msg with commvault galaxy ?
 
  -Message d'origine-
  De : Randy Dunlap [mailto:[EMAIL PROTECTED] Envoyé : 
 7 décembre 
  2007 20:15
  
  On Fri, 7 Dec 2007 15:11:13 -0800 Andrew Morton wrote:
  
   On Fri, 7 Dec 2007 14:15:36 -0800
   Randy Dunlap [EMAIL PROTECTED] wrote:
   
 Help would really be appreciated.

Let's try the last_sysfs_file (name) patch.
I've attempted to update it for 2.6.22.14.
Andrew, does this change in fs/sysfs/file.c look OK?
   
   umm, yup.
   
   
  
 ftp://ftp.kernel.org/pub/linux/kernel/people/akpm/patches/2.6/2.6.21-r
   
  
 c6/2.6.21-rc6-mm1/broken-out/gregkh-driver-sysfs-crash-debugging.patch
   
   should work.
  
  Thanks.  
  I produced a cleanly applying version of it for 2.6.22.14.
  
  Vincent, please apply this patch so we can know which file in sysfs 
  these oopses are happening with.
  
 
 It did not applied cleanly on a 2.6.22.14... copy/paste might 
 be the issue here... Anyhow, I corrected the patch failure to 
 apply and here is my version of it... Hoping I got this 
 (attached patch).
 
 Compiling at the moment... will try this out with commvault 
 5.9 probably in the morning and get back with the results.
 
 Let me know I got the patch wrong.

Here is the resulting trace... hoping this helps...:

[  942.107304] BUG: unable to handle kernel NULL pointer dereference at virtual 
address 00c8
[  942.107339]  printing eip:
[  942.107354] c01a924c
[  942.107368] *pdpt = 2d6b4001
[  942.107383] *pde = 
[  942.107401] Oops:  [#1]
[  942.107414] SMP
[  942.107431] last sysfs file: /kernel/uids/104/cpu_share
[  942.107449] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl 
sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd 
ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support e752x_edac 
edac_mc psmouse floppy shpchp pci_hotplug serio_raw sr_mod pcspkr evdev cdrom 
ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core ehci_hcd 
uhci_hcd usbcore ata_piix tg3 thermal processor fan mptscsih mptbase 
megaraid_sas megaraid_mbox megaraid_mm cciss aacraid
[  942.107675] CPU:0
[  942.107676] EIP:0060:[c01a924c]Not tainted VLI
[  942.107678] EFLAGS: 00010202   (2.6.22.14-cfs-etch-686-envcan #1)
[  942.107730] EIP is at sysfs_open_file+0xae/0x21e
[  942.107749] eax:    ebx: f77783b8   ecx: dfb0b280   edx: 00c8
[  942.107769] esi: f7e0ce8c   edi: c03fd5c0   ebp: c01a919e   esp: f1257ed8
[  942.107789] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
[  942.107810] Process clBackup (pid: 5191, ti=f1256000 task=f71b3290 
task.ti=f1256000)
[  942.107831] Stack: 1000 f295d240 ed4f4ac0 f7e0ce48 f295d240 ed4f4ac0 
f1257f30 c01a919e
[  942.107878]c0170464 dfe76100 ed4f3880 f295d240 8000 f1257f30 
0010 c0170595
[  942.107921]f295d240   c01705d6  f1257f30 
ed4f3880 dfe76100
[  942.107968] Call Trace:
[  942.107998]  [c01a919e] sysfs_open_file+0x0/0x21e
[  942.108017]  [c0170464] __dentry_open+0xc1/0x178
[  942.108039]  [c0170595] nameidata_to_filp+0x24/0x33
[  942.108063]  [c01705d6] do_filp_open+0x32/0x39
[  942.108088]  [c0170343] get_unused_fd+0x4a/0xaa
[  942.108112]  [c017061f] do_sys_open+0x42/0xc3
[  942.108134]  [c01706d9] sys_open+0x1c/0x1e
[  942.108155]  [c0103d8a] syscall_call+0x7/0xb
[  942.108179]  ===
[  942.108194] Code: b8 c0 c5 3f c0 41 e8 e8 06 03 00 83 7c 24 0c 00 0f 84 72 
01 00 00 85 f6 0f 84 6a 01 00 00 8b 56 04 85 d2 74 19 64 a1 08 50 3d c0 83 3a 
02 0f 84 44 01 00 00 c1 e0 05 ff 84 10 20 01 00 00 8b 54
[  942.108364] EIP: [c01a924c] sysfs_open_file+0xae/0x21e SS:ESP 0068:f1257ed8

- vin
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-12-10 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Randy Dunlap [mailto:[EMAIL PROTECTED] 
 Envoyé : 10 décembre 2007 12:15
 
 On Mon, 10 Dec 2007 09:03:17 -0500 Fortier,Vincent [Montreal] wrote:
 
 Ingo, can you look at this, please?
 Vincent is getting oopses on 2.6.22.14-cfs-etch.
 
 Vincent, did you apply the cfs patch or did Debian etch provide that?

I did. http://linux-dev.qc.ec.gc.ca/

 If you applied it, did you use
 http://people.redhat.com/mingo/cfs-scheduler/sched-cfs-v2.6.22.13-v24.patch
 or a different patch?
 

I applied exactly that one.. and had already sent that info to ingo this 
morning since I presumed the CFS patchset could be involved in this by 
reagarding the more detailed output.

Also note that CFS v24 on 2.6.21 does not produce the oops and I can run galaxy 
backups on the system without any problems.

- vin
--
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-11-30 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Randy Dunlap [mailto:[EMAIL PROTECTED] 
> Envoyé : 30 novembre 2007 12:13
> 
> On Fri, 30 Nov 2007 13:02:54 + Vincent Fortier wrote:
> 
> > Hi all,
> > 
> > I'm using a 2.6.22.14 + CFS v24 and I got theses errors 
> when starting 
> > up my commvault galaxy client...  Do anybody know what this could mean?
> 
> Can you provide a few lines of syslog before the Oops: line, 
> which should contain some info about what happened, e.g.:
> 
> Unable to handle kernel paging request at virtual address 
> e4a85017 printing eip:
> c01d915a
> *pde = 37d0d067
> *pte = 

Would this be better?
[766535.379600] BUG: unable to handle kernel NULL pointer dereference at 
virtual address 00c8
[766535.379636]  printing eip:
[766535.379652] c01a920c
[766535.379665] *pdpt = 1cc2c001
[766535.379681] *pde = 
[766535.379698] Oops:  [#1]
[766535.379713] SMP
[766535.379729] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl 
sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd 
ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support psmouse 
e752x_edac shpchp serio_raw edac_mc pcspkr evdev sr_mod pci_hotplug floppy 
cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core 
ehci_hcd uhci_hcd ata_piix tg3 usbcore thermal processor fan mptscsih mptbase 
megaraid_sas megaraid_mbox megaraid_mm cciss aacraid
[766535.379956] CPU:0
[766535.379957] EIP:0060:[]Not tainted VLI
[766535.379959] EFLAGS: 00010202   (2.6.22.14-cfs-etch-686-envcan #1)
[766535.380011] EIP is at sysfs_open_file+0x78/0x1e4
[766535.380028] eax:    ebx: f7f02e58   ecx: 000d   edx: 00c8
[766535.380049] esi: f7e7ec8c   edi: defadf30   ebp: c01a9194   esp: defadedc
[766535.380070] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
[766535.380091] Process clBackup (pid: 22085, ti=defac000 task=de01ac60 
task.ti=defac000)
[766535.380110] Stack: de093300 dd2ce408 f7e7ec48 de093300 dd2ce408 defadf30 
c01a9194 c017048c
[766535.380155]dfe8a180 dd2cbd48 de093300 8000 defadf30 000d 
c01705bd de093300
[766535.380202]  c01705fe  defadf30 dd2cbd48 
dfe8a180 e0bd6f00
[766535.380246] Call Trace:
[766535.380276]  [] sysfs_open_file+0x0/0x1e4
[766535.380296]  [] __dentry_open+0xc1/0x178
[766535.380321]  [] nameidata_to_filp+0x24/0x33
[766535.380343]  [] do_filp_open+0x32/0x39
[766535.380367]  [] get_unused_fd+0x4a/0xaa
[766535.380390]  [] do_sys_open+0x42/0xc3
[766535.380413]  [] sys_open+0x1c/0x1e
[766535.380434]  [] syscall_call+0x7/0xb
[766535.380460]  ===
[766535.380476] Code: 14 24 83 7c 24 08 00 8b 42 0c 8b 40 54 8b 70 14 0f 84 70 
01 00 00 85 f6 0f 84 68 01 00 00 8b 56 04 85 d2 74 19 64 a1 08 50 3d c0 <83> 3a 
02 0f 84 42 01 00 00 c1 e0 05 ff 84 10 20 01 00 00 8b 54
[766535.380644] EIP: [] sysfs_open_file+0x78/0x1e4 SS:ESP 
0068:defadedc

Again,

> > 
> > thnx very much!
> 

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.22.14 oops msg with commvault galaxy ?

2007-11-30 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Randy Dunlap [mailto:[EMAIL PROTECTED] 
 Envoyé : 30 novembre 2007 12:13
 
 On Fri, 30 Nov 2007 13:02:54 + Vincent Fortier wrote:
 
  Hi all,
  
  I'm using a 2.6.22.14 + CFS v24 and I got theses errors 
 when starting 
  up my commvault galaxy client...  Do anybody know what this could mean?
 
 Can you provide a few lines of syslog before the Oops: line, 
 which should contain some info about what happened, e.g.:
 
 Unable to handle kernel paging request at virtual address 
 e4a85017 printing eip:
 c01d915a
 *pde = 37d0d067
 *pte = 

Would this be better?
[766535.379600] BUG: unable to handle kernel NULL pointer dereference at 
virtual address 00c8
[766535.379636]  printing eip:
[766535.379652] c01a920c
[766535.379665] *pdpt = 1cc2c001
[766535.379681] *pde = 
[766535.379698] Oops:  [#1]
[766535.379713] SMP
[766535.379729] Modules linked in: xfs drbd cn nfs nfsd exportfs lockd nfs_acl 
sunrpc ppdev parport_pc lp parport button ac battery ipv6 fuse ide_cd 
ide_generic usbkbd usbmouse tsdev sg iTCO_wdt iTCO_vendor_support psmouse 
e752x_edac shpchp serio_raw edac_mc pcspkr evdev sr_mod pci_hotplug floppy 
cdrom ext3 jbd mbcache dm_mirror dm_snapshot dm_mod generic piix ide_core 
ehci_hcd uhci_hcd ata_piix tg3 usbcore thermal processor fan mptscsih mptbase 
megaraid_sas megaraid_mbox megaraid_mm cciss aacraid
[766535.379956] CPU:0
[766535.379957] EIP:0060:[c01a920c]Not tainted VLI
[766535.379959] EFLAGS: 00010202   (2.6.22.14-cfs-etch-686-envcan #1)
[766535.380011] EIP is at sysfs_open_file+0x78/0x1e4
[766535.380028] eax:    ebx: f7f02e58   ecx: 000d   edx: 00c8
[766535.380049] esi: f7e7ec8c   edi: defadf30   ebp: c01a9194   esp: defadedc
[766535.380070] ds: 007b   es: 007b   fs: 00d8  gs: 0033  ss: 0068
[766535.380091] Process clBackup (pid: 22085, ti=defac000 task=de01ac60 
task.ti=defac000)
[766535.380110] Stack: de093300 dd2ce408 f7e7ec48 de093300 dd2ce408 defadf30 
c01a9194 c017048c
[766535.380155]dfe8a180 dd2cbd48 de093300 8000 defadf30 000d 
c01705bd de093300
[766535.380202]  c01705fe  defadf30 dd2cbd48 
dfe8a180 e0bd6f00
[766535.380246] Call Trace:
[766535.380276]  [c01a9194] sysfs_open_file+0x0/0x1e4
[766535.380296]  [c017048c] __dentry_open+0xc1/0x178
[766535.380321]  [c01705bd] nameidata_to_filp+0x24/0x33
[766535.380343]  [c01705fe] do_filp_open+0x32/0x39
[766535.380367]  [c017036b] get_unused_fd+0x4a/0xaa
[766535.380390]  [c0170647] do_sys_open+0x42/0xc3
[766535.380413]  [c0170701] sys_open+0x1c/0x1e
[766535.380434]  [c0103d8a] syscall_call+0x7/0xb
[766535.380460]  ===
[766535.380476] Code: 14 24 83 7c 24 08 00 8b 42 0c 8b 40 54 8b 70 14 0f 84 70 
01 00 00 85 f6 0f 84 68 01 00 00 8b 56 04 85 d2 74 19 64 a1 08 50 3d c0 83 3a 
02 0f 84 42 01 00 00 c1 e0 05 ff 84 10 20 01 00 00 8b 54
[766535.380644] EIP: [c01a920c] sysfs_open_file+0x78/0x1e4 SS:ESP 
0068:defadedc

Again,

  
  thnx very much!
 

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch/backport] CFS scheduler, -v24, for v2.6.24-rc3,v2.6.23.8, v2.6.22.13, v2.6.21.7

2007-11-19 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
> 
> 
> * Erik Andersen <[EMAIL PROTECTED]> wrote:
> 
> > On Mon Nov 19, 2007 at 04:17:56PM +0100, Ingo Molnar wrote:
> > > 
> > > By popular demand, here is release -v24 of the CFS 
> scheduler patch.
> > > 
> > > It is a full backport of the latest & greatest scheduler code to 
> > > v2.6.24-rc3, v2.6.23.8, v2.6.22.13, v2.6.21.7. The patches can be 
> > > downloaded from the usual place:
> > > 
> > > http://people.redhat.com/mingo/cfs-scheduler/
> > 
> > Doesn't compile with 2.6.21.7:
> 
> does the patch below help?
> 

Looks good.  Compiling.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch/backport] CFS scheduler, -v24, for v2.6.24-rc3, v2.6.23.8,v2.6.22.13, v2.6.21.7

2007-11-19 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
> 
> By popular demand, here is release -v24 of the CFS scheduler patch.
> 
> It is a full backport of the latest & greatest scheduler code 
> to v2.6.24-rc3, v2.6.23.8, v2.6.22.13, v2.6.21.7. The patches 
> can be downloaded from the usual place:
> 
> http://people.redhat.com/mingo/cfs-scheduler/
> 
> There's tons of changes since v22 was released:
> 
> 36 files changed, 2359 insertions(+), 1082 deletions(-)
> 
> that's 187 individual commits from 32 authors.
> 
> So even if CFS v22 worked well for you, please try this 
> release too and report regressions (if any).
> 

Hi Ingo,

Thnx a lot for theses backports...

Ran into this while compiling a 2.6.22.13 with CFS v24

  CC  kernel/sched.o
kernel/sched.c: In function 'cpu_to_core_group':
kernel/sched.c:6056: error: 'per_cpu__cpu_sibling_map' undeclared (first
use in this function)
kernel/sched.c:6056: error: (Each undeclared identifier is reported only
once
kernel/sched.c:6056: error: for each function it appears in.)
kernel/sched.c:6056: warning: type defaults to 'int' in declaration of
'type name'
kernel/sched.c:6056: error: invalid type argument of 'unary *'
kernel/sched.c: In function 'build_sched_domains':
kernel/sched.c:6319: error: 'per_cpu__cpu_sibling_map' undeclared (first
use in this function)
kernel/sched.c:6319: warning: type defaults to 'int' in declaration of
'type name'
kernel/sched.c:6319: error: invalid type argument of 'unary *'
kernel/sched.c:6330: warning: type defaults to 'int' in declaration of
'type name'
kernel/sched.c:6330: error: invalid type argument of 'unary *'
make[2]: *** [kernel/sched.o] Error 1
make[1]: *** [kernel] Error 2
make[1]: Leaving directory
`/usr/src/linux-headers-2.6.22.13-cfs-sarge-686-envcan'
make: *** [debian/stamp-build-kernel] Error 2

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch/backport] CFS scheduler, -v24, for v2.6.24-rc3, v2.6.23.8,v2.6.22.13, v2.6.21.7

2007-11-19 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
 
 By popular demand, here is release -v24 of the CFS scheduler patch.
 
 It is a full backport of the latest  greatest scheduler code 
 to v2.6.24-rc3, v2.6.23.8, v2.6.22.13, v2.6.21.7. The patches 
 can be downloaded from the usual place:
 
 http://people.redhat.com/mingo/cfs-scheduler/
 
 There's tons of changes since v22 was released:
 
 36 files changed, 2359 insertions(+), 1082 deletions(-)
 
 that's 187 individual commits from 32 authors.
 
 So even if CFS v22 worked well for you, please try this 
 release too and report regressions (if any).
 

Hi Ingo,

Thnx a lot for theses backports...

Ran into this while compiling a 2.6.22.13 with CFS v24

  CC  kernel/sched.o
kernel/sched.c: In function 'cpu_to_core_group':
kernel/sched.c:6056: error: 'per_cpu__cpu_sibling_map' undeclared (first
use in this function)
kernel/sched.c:6056: error: (Each undeclared identifier is reported only
once
kernel/sched.c:6056: error: for each function it appears in.)
kernel/sched.c:6056: warning: type defaults to 'int' in declaration of
'type name'
kernel/sched.c:6056: error: invalid type argument of 'unary *'
kernel/sched.c: In function 'build_sched_domains':
kernel/sched.c:6319: error: 'per_cpu__cpu_sibling_map' undeclared (first
use in this function)
kernel/sched.c:6319: warning: type defaults to 'int' in declaration of
'type name'
kernel/sched.c:6319: error: invalid type argument of 'unary *'
kernel/sched.c:6330: warning: type defaults to 'int' in declaration of
'type name'
kernel/sched.c:6330: error: invalid type argument of 'unary *'
make[2]: *** [kernel/sched.o] Error 1
make[1]: *** [kernel] Error 2
make[1]: Leaving directory
`/usr/src/linux-headers-2.6.22.13-cfs-sarge-686-envcan'
make: *** [debian/stamp-build-kernel] Error 2

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch/backport] CFS scheduler, -v24, for v2.6.24-rc3,v2.6.23.8, v2.6.22.13, v2.6.21.7

2007-11-19 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
 
 
 * Erik Andersen [EMAIL PROTECTED] wrote:
 
  On Mon Nov 19, 2007 at 04:17:56PM +0100, Ingo Molnar wrote:
   
   By popular demand, here is release -v24 of the CFS 
 scheduler patch.
   
   It is a full backport of the latest  greatest scheduler code to 
   v2.6.24-rc3, v2.6.23.8, v2.6.22.13, v2.6.21.7. The patches can be 
   downloaded from the usual place:
   
   http://people.redhat.com/mingo/cfs-scheduler/
  
  Doesn't compile with 2.6.21.7:
 
 does the patch below help?
 

Looks good.  Compiling.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: BUG: usb Mouse continual disconnect (2.6.22.10) leading to eventual crash?

2007-11-07 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Andrew Morton [mailto:[EMAIL PROTECTED] 
> Envoyé : 7 novembre 2007 15:33
> 
> (+linux-usb-devel)
> 
> > On Wed, 7 Nov 2007 13:19:47 -0500 "Fortier,Vincent 
> [Montreal]" <[EMAIL PROTECTED]> wrote:
> > 
> > I'm having a preblem with a few workstations that are hard freezing 
> > from time to time since about a week.  Curiously since then we 
> > upgraded from
> > 2.6.20.6 to 2.6.20.10...
> > 

[.]

> > 
> > Looking at the changes between 2.6.20.6 and 2.6.20.10 I only found 
> > this USB related:
> > 
> http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.22.y.git;a=commit;h=0e22438a5adfdf32b3bb1c75c81c01a29fba9770
> > 
> > I'm presuming that maybie eventually it might hit a input max barrier 
> > (like a max of 1024) and then the kernel would be crashing?
> > 
> > Help very much appreciated!
> 
> People are going to want to know if this is still happening 
> in 2.6.23 or 2.6.24-rc2, please.

I andrew,

(logged-in at wrok to make proper reply)

It might be tricky to test properly on 2.6.23.1 since theses are all 
operationnal aviation forecaster workstations 24/7  And sadly the problem 
only occur on a few of theses workstations AFAIK randomly ???  Actually many of 
them did not had any problems using the exact same FAI reinstallation using the 
same hardware..?!?!.

If someone thinks that the problem might effectively be the usb patch that I 
mentionned before (or any other patch that might make suddently thoses stations 
to freeze dur to a change that occur between 2.6.20.6 and .10) I would then be 
able to push a new kernel on thoses workstations without the offending 
patch(es) and have a clear answer within a week.

I also need to mention that our 2.6.20.6 build contained CFS v20.5 and the new 
2.6.20.10 build contains CFS v22 although I doubt it is the cause of the 
freezing problem but hey, never know.  Although I could probably rebuild and 
push a new 2.6.20.10 without CFS to be sure.

I'm also currently following the problem occurring on a station and I hope it 
will freeze at an evident input entry.  Here where it is now:
Nov  7 22:04:44 mammatus kernel: [100926.387652] usb 2-1: USB disconnect, 
address 118
Nov  7 22:04:44 mammatus kernel: [100926.627502] usb 2-1: new low speed USB 
device using uhci_hcd and address 119
Nov  7 22:04:45 mammatus kernel: [100926.804006] usb 2-1: configuration #1 
chosen from 1 choice
Nov  7 22:04:45 mammatus kernel: [100926.820213] input: Logitech USB-PS/2 
Optical Mouse as /class/input/input729
Nov  7 22:04:45 mammatus kernel: [100926.820304] input: USB HID v1.10 Mouse 
[Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 22:11:55 mammatus kernel: [101356.846746] usb 2-1: USB disconnect, 
address 119
Nov  7 22:11:56 mammatus kernel: [101357.590408] usb 2-1: new low speed USB 
device using uhci_hcd and address 120
Nov  7 22:11:56 mammatus kernel: [101357.766609] usb 2-1: configuration #1 
chosen from 1 choice
Nov  7 22:11:56 mammatus kernel: [101357.782803] input: Logitech USB-PS/2 
Optical Mouse as /class/input/input730
Nov  7 22:11:56 mammatus kernel: [101357.782897] input: USB HID v1.10 Mouse 
[Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1

vin

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


BUG: usb Mouse continual disconnect (2.6.22.10) leading to eventual crash?

2007-11-07 Thread Fortier,Vincent [Montreal]
Hi all,

I'm having a preblem with a few workstations that are hard freezing from
time to time since about a week.  Curiously since then we upgraded from
2.6.20.6 to 2.6.20.10...

While investigating I found out that there seems to be a problem,
occuring randomly on some workstations after a certain amount time, that
the mouse gets disconnected continously (see full log attached):
Nov  7 00:09:36 mammatus kernel: [22055.357104] usb 2-1: USB disconnect,
address 2
Nov  7 00:09:37 mammatus kernel: [22055.596974] usb 2-1: new low speed
USB device using uhci_hcd and address 3
Nov  7 00:09:37 mammatus kernel: [22055.777831] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 00:09:37 mammatus kernel: [22055.793905] input: Logitech USB-PS/2
Optical Mouse as /class/input/input5
Nov  7 00:09:37 mammatus kernel: [22055.793994] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 00:10:11 mammatus kernel: [22090.368722] usb 2-1: USB disconnect,
address 3
Nov  7 00:10:12 mammatus kernel: [22090.608588] usb 2-1: new low speed
USB device using uhci_hcd and address 4
Nov  7 00:10:12 mammatus kernel: [22090.784671] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 00:10:12 mammatus kernel: [22090.800881] input: Logitech USB-PS/2
Optical Mouse as /class/input/input6
Nov  7 00:10:12 mammatus kernel: [22090.800982] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
[.]
Nov  7 17:57:28 mammatus kernel: [86097.543960] input: Logitech USB-PS/2
Optical Mouse as /class/input/input596
Nov  7 17:57:28 mammatus kernel: [86097.544050] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 17:59:07 mammatus kernel: [86196.161272] usb 2-1: USB disconnect,
address 112
Nov  7 17:59:07 mammatus kernel: [86196.401142] usb 2-1: new low speed
USB device using uhci_hcd and address 113
Nov  7 17:59:08 mammatus kernel: [86196.577674] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 17:59:08 mammatus kernel: [86196.593834] input: Logitech USB-PS/2
Optical Mouse as /class/input/input597
Nov  7 17:59:08 mammatus kernel: [86196.593939] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 17:59:13 mammatus kernel: [86202.206449] usb 2-1: USB disconnect,
address 113
Nov  7 17:59:13 mammatus kernel: [86202.446319] usb 2-1: new low speed
USB device using uhci_hcd and address 114
Nov  7 17:59:14 mammatus kernel: [86202.623309] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 17:59:14 mammatus kernel: [86202.638402] input: Logitech USB-PS/2
Optical Mouse as /class/input/input598
Nov  7 17:59:14 mammatus kernel: [86202.638493] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1

There are some occasionnal connect/disconnect due to the fact that we
recently bought an ergonomic mouse (Evoluent Vertical Mouse 3 Rev. 2)
that some people plug-in the workstation they are working for the
duration of their day/night shift... but nothing like 600 disconnect /
day / workstations?  And actually, the source workstation of this log as
not yet got the mouse connected to it since it's last reboot.

Looking at the changes between 2.6.20.6 and 2.6.20.10 I only found this
USB related:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.22.y.git;a=co
mmit;h=0e22438a5adfdf32b3bb1c75c81c01a29fba9770

I'm presuming that maybie eventually it might hit a input max barrier
(like a max of 1024) and then the kernel would be crashing?

Help very much appreciated!

Regards,

Vincent Fortier
Informatique
Environnement Canada




kern.log.bz2
Description: kern.log.bz2


BUG: usb Mouse continual disconnect (2.6.22.10) leading to eventual crash?

2007-11-07 Thread Fortier,Vincent [Montreal]
Hi all,

I'm having a preblem with a few workstations that are hard freezing from
time to time since about a week.  Curiously since then we upgraded from
2.6.20.6 to 2.6.20.10...

While investigating I found out that there seems to be a problem,
occuring randomly on some workstations after a certain amount time, that
the mouse gets disconnected continously (see full log attached):
Nov  7 00:09:36 mammatus kernel: [22055.357104] usb 2-1: USB disconnect,
address 2
Nov  7 00:09:37 mammatus kernel: [22055.596974] usb 2-1: new low speed
USB device using uhci_hcd and address 3
Nov  7 00:09:37 mammatus kernel: [22055.777831] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 00:09:37 mammatus kernel: [22055.793905] input: Logitech USB-PS/2
Optical Mouse as /class/input/input5
Nov  7 00:09:37 mammatus kernel: [22055.793994] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 00:10:11 mammatus kernel: [22090.368722] usb 2-1: USB disconnect,
address 3
Nov  7 00:10:12 mammatus kernel: [22090.608588] usb 2-1: new low speed
USB device using uhci_hcd and address 4
Nov  7 00:10:12 mammatus kernel: [22090.784671] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 00:10:12 mammatus kernel: [22090.800881] input: Logitech USB-PS/2
Optical Mouse as /class/input/input6
Nov  7 00:10:12 mammatus kernel: [22090.800982] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
[.]
Nov  7 17:57:28 mammatus kernel: [86097.543960] input: Logitech USB-PS/2
Optical Mouse as /class/input/input596
Nov  7 17:57:28 mammatus kernel: [86097.544050] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 17:59:07 mammatus kernel: [86196.161272] usb 2-1: USB disconnect,
address 112
Nov  7 17:59:07 mammatus kernel: [86196.401142] usb 2-1: new low speed
USB device using uhci_hcd and address 113
Nov  7 17:59:08 mammatus kernel: [86196.577674] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 17:59:08 mammatus kernel: [86196.593834] input: Logitech USB-PS/2
Optical Mouse as /class/input/input597
Nov  7 17:59:08 mammatus kernel: [86196.593939] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 17:59:13 mammatus kernel: [86202.206449] usb 2-1: USB disconnect,
address 113
Nov  7 17:59:13 mammatus kernel: [86202.446319] usb 2-1: new low speed
USB device using uhci_hcd and address 114
Nov  7 17:59:14 mammatus kernel: [86202.623309] usb 2-1: configuration
#1 chosen from 1 choice
Nov  7 17:59:14 mammatus kernel: [86202.638402] input: Logitech USB-PS/2
Optical Mouse as /class/input/input598
Nov  7 17:59:14 mammatus kernel: [86202.638493] input: USB HID v1.10
Mouse [Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1

There are some occasionnal connect/disconnect due to the fact that we
recently bought an ergonomic mouse (Evoluent Vertical Mouse 3 Rev. 2)
that some people plug-in the workstation they are working for the
duration of their day/night shift... but nothing like 600 disconnect /
day / workstations?  And actually, the source workstation of this log as
not yet got the mouse connected to it since it's last reboot.

Looking at the changes between 2.6.20.6 and 2.6.20.10 I only found this
USB related:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.22.y.git;a=co
mmit;h=0e22438a5adfdf32b3bb1c75c81c01a29fba9770

I'm presuming that maybie eventually it might hit a input max barrier
(like a max of 1024) and then the kernel would be crashing?

Help very much appreciated!

Regards,

Vincent Fortier
Informatique
Environnement Canada




kern.log.bz2
Description: kern.log.bz2


RE: BUG: usb Mouse continual disconnect (2.6.22.10) leading to eventual crash?

2007-11-07 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Andrew Morton [mailto:[EMAIL PROTECTED] 
 Envoyé : 7 novembre 2007 15:33
 
 (+linux-usb-devel)
 
  On Wed, 7 Nov 2007 13:19:47 -0500 Fortier,Vincent 
 [Montreal] [EMAIL PROTECTED] wrote:
  
  I'm having a preblem with a few workstations that are hard freezing 
  from time to time since about a week.  Curiously since then we 
  upgraded from
  2.6.20.6 to 2.6.20.10...
  

[.]

  
  Looking at the changes between 2.6.20.6 and 2.6.20.10 I only found 
  this USB related:
  
 http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.22.y.git;a=commit;h=0e22438a5adfdf32b3bb1c75c81c01a29fba9770
  
  I'm presuming that maybie eventually it might hit a input max barrier 
  (like a max of 1024) and then the kernel would be crashing?
  
  Help very much appreciated!
 
 People are going to want to know if this is still happening 
 in 2.6.23 or 2.6.24-rc2, please.

I andrew,

(logged-in at wrok to make proper reply)

It might be tricky to test properly on 2.6.23.1 since theses are all 
operationnal aviation forecaster workstations 24/7  And sadly the problem 
only occur on a few of theses workstations AFAIK randomly ???  Actually many of 
them did not had any problems using the exact same FAI reinstallation using the 
same hardware..?!?!.

If someone thinks that the problem might effectively be the usb patch that I 
mentionned before (or any other patch that might make suddently thoses stations 
to freeze dur to a change that occur between 2.6.20.6 and .10) I would then be 
able to push a new kernel on thoses workstations without the offending 
patch(es) and have a clear answer within a week.

I also need to mention that our 2.6.20.6 build contained CFS v20.5 and the new 
2.6.20.10 build contains CFS v22 although I doubt it is the cause of the 
freezing problem but hey, never know.  Although I could probably rebuild and 
push a new 2.6.20.10 without CFS to be sure.

I'm also currently following the problem occurring on a station and I hope it 
will freeze at an evident input entry.  Here where it is now:
Nov  7 22:04:44 mammatus kernel: [100926.387652] usb 2-1: USB disconnect, 
address 118
Nov  7 22:04:44 mammatus kernel: [100926.627502] usb 2-1: new low speed USB 
device using uhci_hcd and address 119
Nov  7 22:04:45 mammatus kernel: [100926.804006] usb 2-1: configuration #1 
chosen from 1 choice
Nov  7 22:04:45 mammatus kernel: [100926.820213] input: Logitech USB-PS/2 
Optical Mouse as /class/input/input729
Nov  7 22:04:45 mammatus kernel: [100926.820304] input: USB HID v1.10 Mouse 
[Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1
Nov  7 22:11:55 mammatus kernel: [101356.846746] usb 2-1: USB disconnect, 
address 119
Nov  7 22:11:56 mammatus kernel: [101357.590408] usb 2-1: new low speed USB 
device using uhci_hcd and address 120
Nov  7 22:11:56 mammatus kernel: [101357.766609] usb 2-1: configuration #1 
chosen from 1 choice
Nov  7 22:11:56 mammatus kernel: [101357.782803] input: Logitech USB-PS/2 
Optical Mouse as /class/input/input730
Nov  7 22:11:56 mammatus kernel: [101357.782897] input: USB HID v1.10 Mouse 
[Logitech USB-PS/2 Optical Mouse] on usb-:00:1d.0-1

vin

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch/backport] CFS scheduler, -v22, for v2.6.23-rc8, v2.6.22.8,v2.6.21.7, v2.6.20.20

2007-10-26 Thread Fortier,Vincent [Montreal]
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
> Envoyé : 26 septembre 2007 07:14
> 
> By popular demand, here is release -v22 of the CFS scheduler. 
> It is a full backport of the latest & greatest 
> sched-devel.git code to v2.6.23-rc8, v2.6.22.8, v2.6.21.7 and 
> v2.6.20.20. The patches can be downloaded from the usual place:
> 
> http://people.redhat.com/mingo/cfs-scheduler/
> 
> This is the first time the development version of the 
> scheduler has been fed back into the stable backport series, 
> so there's many changes since
> v20.5:
> 
>  15 files changed, 1103 insertions(+), 840 deletions(-)
> 
> Even if CFS v20.5 worked well for you, please try this 
> release too, with a good focus on interactivity testing - 
> because, unless some major showstopper is found, this 
> codebase is intended for a v2.6.24 upstream merge.
> 
> ( Even a quick, subjective report of: "checked this patch, it didnt
>   crash and it feels like v20.5" or "laggier than v20.5" or "feels
>   better than v20.5" is useful to us and enables us to judge 
> the general
>   direction of interactivity. )
> 
> The changes in v22 consist of lots of mostly small 
> enhancements, speedups, interactivity improvements, debug 
> enhancements and tidy-ups - many of which can be 
> user-visible. (These enhancements have been contributed by 
> many people - see the changelog below and the git tree for 
> detailed credits.)
> 

Better late than never...

Hi Ingo & cie.,

I tested CFS v22 on a 2.6.20.10 and it is working like a charm.  I used to have 
a few lags once in a while with v20.5 on a 2.6.20.6 kernel but they simply 
vanished with the v22 (although I'm currenctly mostly testing on etch with xorg 
7.1 instead of sarge with xfree 4.3 so this might also have an effect...)

It seems stable enough to put it in operation by next week on all our 
workstations.

This is great work, thx to all CFS contributors.

- vin



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch/backport] CFS scheduler, -v22, for v2.6.23-rc8, v2.6.22.8,v2.6.21.7, v2.6.20.20

2007-10-26 Thread Fortier,Vincent [Montreal]
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
 Envoyé : 26 septembre 2007 07:14
 
 By popular demand, here is release -v22 of the CFS scheduler. 
 It is a full backport of the latest  greatest 
 sched-devel.git code to v2.6.23-rc8, v2.6.22.8, v2.6.21.7 and 
 v2.6.20.20. The patches can be downloaded from the usual place:
 
 http://people.redhat.com/mingo/cfs-scheduler/
 
 This is the first time the development version of the 
 scheduler has been fed back into the stable backport series, 
 so there's many changes since
 v20.5:
 
  15 files changed, 1103 insertions(+), 840 deletions(-)
 
 Even if CFS v20.5 worked well for you, please try this 
 release too, with a good focus on interactivity testing - 
 because, unless some major showstopper is found, this 
 codebase is intended for a v2.6.24 upstream merge.
 
 ( Even a quick, subjective report of: checked this patch, it didnt
   crash and it feels like v20.5 or laggier than v20.5 or feels
   better than v20.5 is useful to us and enables us to judge 
 the general
   direction of interactivity. )
 
 The changes in v22 consist of lots of mostly small 
 enhancements, speedups, interactivity improvements, debug 
 enhancements and tidy-ups - many of which can be 
 user-visible. (These enhancements have been contributed by 
 many people - see the changelog below and the git tree for 
 detailed credits.)
 

Better late than never...

Hi Ingo  cie.,

I tested CFS v22 on a 2.6.20.10 and it is working like a charm.  I used to have 
a few lags once in a while with v20.5 on a 2.6.20.6 kernel but they simply 
vanished with the v22 (although I'm currenctly mostly testing on etch with xorg 
7.1 instead of sarge with xfree 4.3 so this might also have an effect...)

It seems stable enough to put it in operation by next week on all our 
workstations.

This is great work, thx to all CFS contributors.

- vin



-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: Patch for Apani Nortel VPN Client to build against kernel 2.6.22 help/review

2007-09-11 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Nick Bowler
> 
> I haven't read too much into the patch, but some quick 
> comments.  Most applies
> throughout:
> 
> > +#if (LINUX_VERSION_CODE >= 0x020616)
> KERNEL_VERSION(2,6,22) is much more readable than 0x020616.
> 
> > +  return (struct iphdr*) skb->network_header;
> Should be return ip_hdr(skb);
> 
> > +skb->network_header = skb->data;
> Should be skb_reset_network_header(skb);
> 
> > +iph = skb->network_header.iph;
> Probably meant skb->nh.iph.
> 
> > +  if ( iph->protocol == 17 )/* if UDP, */
> (snip)
> > +  if ( iph->protocol == 6 ) /* if TCP, */
> Could use IPPROTO_UDP and IPPROTO_TCP instead of 17 and 6, 
> respectively.
> 
> Instead of littering the code with #if blah #else blah, you 
> could also simply provide implementations for the 2.6.22 
> functions #if LINUX_VERSION_CODE < KERNEL_VERSION(2,6,22).
> 

(Forwarding info by Samuel to lkml...)

Thanks for your comments, here is a new patch (bowler.patch) reflecting the 
changes you suggested. Unfortunately, and unsurprisingly for such cosmetic 
changes, the code still hangs.

I have added a great many debugging statements in order to pinpoint the problem 
(backtrace.patch); surprisingly, my custom backtraces didn't show up in the 
console, but a kernel call trace did (call_trace).

Here is the contents of the offending function. The {enter,exit}_func() calls 
are my debugging statements, and I have verified that only the else branch was 
compiled. As you can see, there are only function calls here, no memory 
manipulation nor dereferencing at all. What could possibly fail in this 
function? Shouldn`t the backtrace include one of the subcalls?

  void nl_spin_lock_irqsave(spinlock_t *lockptr, DWORD *flagptr)
  {
  enter_func(__LINE__);
  #if (LINUX_VERSION_CODE < KERNEL_VERSION(2,6,7))
  assert(lockptr);
  assert(flagptr);
  spin_lock_irqsave((spinlock_t *)lockptr, (*flagptr));
  #else
  spin_lock_irq((spinlock_t *)lockptr);
  #endif
  exit_func(__LINE__);
  }

- Samuel Gélineau


call_trace
Description: call_trace


backtrace.patch
Description: backtrace.patch


bowler.patch
Description: bowler.patch


RE: Patch for Apani Nortel VPN Client to build against kernel 2.6.22 help/review

2007-09-11 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Nick Bowler
 
 I haven't read too much into the patch, but some quick 
 comments.  Most applies
 throughout:
 
  +#if (LINUX_VERSION_CODE = 0x020616)
 KERNEL_VERSION(2,6,22) is much more readable than 0x020616.
 
  +  return (struct iphdr*) skb-network_header;
 Should be return ip_hdr(skb);
 
  +skb-network_header = skb-data;
 Should be skb_reset_network_header(skb);
 
  +iph = skb-network_header.iph;
 Probably meant skb-nh.iph.
 
  +  if ( iph-protocol == 17 )/* if UDP, */
 (snip)
  +  if ( iph-protocol == 6 ) /* if TCP, */
 Could use IPPROTO_UDP and IPPROTO_TCP instead of 17 and 6, 
 respectively.
 
 Instead of littering the code with #if blah #else blah, you 
 could also simply provide implementations for the 2.6.22 
 functions #if LINUX_VERSION_CODE  KERNEL_VERSION(2,6,22).
 

(Forwarding info by Samuel to lkml...)

Thanks for your comments, here is a new patch (bowler.patch) reflecting the 
changes you suggested. Unfortunately, and unsurprisingly for such cosmetic 
changes, the code still hangs.

I have added a great many debugging statements in order to pinpoint the problem 
(backtrace.patch); surprisingly, my custom backtraces didn't show up in the 
console, but a kernel call trace did (call_trace).

Here is the contents of the offending function. The {enter,exit}_func() calls 
are my debugging statements, and I have verified that only the else branch was 
compiled. As you can see, there are only function calls here, no memory 
manipulation nor dereferencing at all. What could possibly fail in this 
function? Shouldn`t the backtrace include one of the subcalls?

  void nl_spin_lock_irqsave(spinlock_t *lockptr, DWORD *flagptr)
  {
  enter_func(__LINE__);
  #if (LINUX_VERSION_CODE  KERNEL_VERSION(2,6,7))
  assert(lockptr);
  assert(flagptr);
  spin_lock_irqsave((spinlock_t *)lockptr, (*flagptr));
  #else
  spin_lock_irq((spinlock_t *)lockptr);
  #endif
  exit_func(__LINE__);
  }

- Samuel Gélineau


call_trace
Description: call_trace


backtrace.patch
Description: backtrace.patch


bowler.patch
Description: bowler.patch


RE: [question] IPC queue filling-up problem?

2007-09-05 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Eric Dumazet
> 
> On Wed, 5 Sep 2007 09:37:50 -0400
> "Fortier,Vincent [Montreal]" <[EMAIL PROTECTED]> wrote:
> 
> > 
> > Hi all,
> > 
> > We are testing new hardware and planning a switch from our old
redhat
> > 7.3 to a Debian Etch 4.0 for our radar forecast analysis systems.
We 
> > found out that our main IPC dispatcher software module would use
100% 
> > of a CPU all the time and that the IPC queues would fill up quickly
on a
> > 2.6 kernel.  We first tought that it would be a problem of 
> > compatibility between a 2.4 vs 2.6 IPC calls vs our radar analysis 
> > software but after a lot of work we have been able to test a 2.4 
> > kernel on that same hardware and got the exact same problem.
> > 
> > So curiously, on our actual systems (see SYSTEM 1 below) our IPC 
> > dispatcher module works like a charm and queues gets near 0.  On our

> > test system which are way more powerfull systems (see SYSTEM 2) our 
> > IPC dispatcher module queue fills up rapidly (depending of the
msgmnb 
> > queue size it will wimply take a bit longer to fill).
> > 
> > We have tested both our already compiled binaries from rh73 using
gcc
> > 2.9 and a recompiled version of the modules on a debian sarge system

> > and got the exact same problem on either a Debian Sarge 3.1 (running
a 
> > 2.4 or 2.6 kernel) and on a Etch 64bit system (using 32bit compat 
> > layer) with a 2.6 kernel.  In all cases the queues would simply
fill-up.
> > 
> > After strac'ing the module I noticed that the time needed to handle 
> > the signal & ipc calls are way lower on the new system hence I don't

> > see why the dispatcher queue does fill-up like that?!?!?!
> > 
> > Does anyone experienced something similar?  Could this be a kernel 
> > issue vs material, kernel option?  Might this be related to libc?
> > 
> > Help / Clues very much appreciated.
> 
> Hi Vincent
> 
> top shows that something is eating cpu cycles in User mode on 
> your new platform, while old platform consumes cycles both in 
> User and System land.
> 
> This might be related to some programing error, maybe some 
> spinlock in user mode or bad multi-threading synchronization, 
> or scheduling assumptions, that break because of the quad 
> core cpus of your new machine.

Actually, could this be worth trying (adding Ingo in CC):
http://lkml.org/lkml/2007/9/5/75

> So the thread that is supposed to consume IPC messages is not 
> scheduled in time, because CPU starves. (beware the four 
> cores of each CPU compete for ressources)
> 
> You could issue "ps auxm" to check which threads are spining 
> in User mode and try to trace them ?

Effectively in this specific test case I had 5 stuck process using each
100% of a CPU... although 3 other cores where still available so there
is (I believe) no reason why it should had starved that much.

Anyhow, that did not happend on all the other testing I made during the
past 3 weeks (except this one).

I restarted the this test (again using a 2.4.35.1 kernel), mde sure no
process where stuck :), and grabbed the ps aux + ipcs -q info (attached)

Again, the ipcs -q shows that the queue is getting full comparing to
SYSTEM 1 which always has a queue of 0.

Note: Also attached a top.txt file showing that the dispatcher uses 100%
of a CPU on SYSTEM 2.  This never occurs on SYSTEM 1.

> Eric

PS, thnx for replying.

- vin

> > SYSTEM INFORMATION:
> > 
> > SYSTEM 1:
> > -
> > HPDL580 G2
> > Quad Intel Xeon 1.90GHz
> > 4G ram
> > DRBD disks on dual-gigabit adapter
> > OS: RedHat 7.3 / kernel: 2.4.33 / libc: 2.2.5 / gcc 2.96
> > 
> > SYSTEM 2:
> > -
> > Dell PE2950
> > Dual Intel Quad-Core 2.66GHz
> > 16G ram
> > local 300G 15000 RPM SCSI.
> > OS1: Debian Etch 4.0 / kernels 2.6.18 -> 2.6.22 / libc 2.3.6 / gcc
4.1.2
> > OS2: Debian Sarge 3.1 / kernels 2.4.35, 2.6.18 -> 2.6.22 / libc
2.3.2 / gcc 3.3.5

-- Message Queues 
keymsqid  owner  perms  used-bytes   messages
0x41018018 0  urp66060784748 
0x41018024 32769  urp66000   
0x4101800b 65538  urp66000   
0x41018011 98307  urp66000   
0x4101800e 131076 urp66000   
0x4101803f 163845 urp66000   
0x41018019 196614 urp66000   
0x41018026 229383 urp6600 

[question] IPC queue filling-up problem?

2007-09-05 Thread Fortier,Vincent [Montreal]

Hi all,

We are testing new hardware and planning a switch from our old redhat
7.3 to a Debian Etch 4.0 for our radar forecast analysis systems.  We
found out that our main IPC dispatcher software module would use 100% of
a CPU all the time and that the IPC queues would fill up quickly on a
2.6 kernel.  We first tought that it would be a problem of compatibility
between a 2.4 vs 2.6 IPC calls vs our radar analysis software but after
a lot of work we have been able to test a 2.4 kernel on that same
hardware and got the exact same problem.

So curiously, on our actual systems (see SYSTEM 1 below) our IPC
dispatcher module works like a charm and queues gets near 0.  On our
test system which are way more powerfull systems (see SYSTEM 2) our IPC
dispatcher module queue fills up rapidly (depending of the msgmnb queue
size it will wimply take a bit longer to fill).

We have tested both our already compiled binaries from rh73 using gcc
2.9 and a recompiled version of the modules on a debian sarge system and
got the exact same problem on either a Debian Sarge 3.1 (running a 2.4
or 2.6 kernel) and on a Etch 64bit system (using 32bit compat layer)
with a 2.6 kernel.  In all cases the queues would simply fill-up.

After strac'ing the module I noticed that the time needed to handle the
signal & ipc calls are way lower on the new system hence I don't see why
the dispatcher queue does fill-up like that?!?!?!

Does anyone experienced something similar?  Could this be a kernel issue
vs material, kernel option?  Might this be related to libc?

Help / Clues very much appreciated.

Thnx

- vin


Debian Sarge 3.1 kernel 2.4.35 notes:
-
Pre-built 2.4.35.1 kernel + backported mpt fusion & megasas drivers for
Debian Sarge 3.1 available at:
http://linux-dev.qc.ec.gc.ca/kernel/debian/sarge/i386/2.4.35/
Megaraid SAS backport patch for a 2.4.35.1 kernel available at:
http://linux-dev.qc.ec.gc.ca/kernel/patches/megaraid_sas-linux_2.4.35.1-
v00.00.03.09.patch
2.4.35.1 config file:
http://linux-dev.qc.ec.gc.ca/kernel/debian/CONFIG-i686-2.4.35.1-008


RedHat 7.3 kernel 2.4.33 notes:
---
Pre-built 2.4.33 kernel for RH73:
http://linux-dev.qc.ec.gc.ca/kernel/redhat/rh73/
2.4.33 config file:
http://linux-dev.qc.ec.gc.ca/kernel/redhat/rh73/config-2.4.33-01.rh73.en
vcanbigmem


SYSTEM INFORMATION:

SYSTEM 1:
-
HPDL580 G2
Quad Intel Xeon 1.90GHz
4G ram
DRBD disks on dual-gigabit adapter
OS: RedHat 7.3 / kernel: 2.4.33 / libc: 2.2.5 / gcc 2.96

SYSTEM 2:
-
Dell PE2950
Dual Intel Quad-Core 2.66GHz
16G ram
local 300G 15000 RPM SCSI.
OS1: Debian Etch 4.0 / kernels 2.6.18 -> 2.6.22 / libc 2.3.6 / gcc 4.1.2
OS2: Debian Sarge 3.1 / kernels 2.4.35, 2.6.18 -> 2.6.22 / libc 2.3.2 /
gcc 3.3.5



=
SYSTEM 1
=

top
---
 12:29pm  up 147 days, 15:47,  4 users,  load average: 3.00, 1.65, 1.56
229 processes: 227 sleeping, 2 running, 0 zombie, 0 stopped
CPU0 states: 48.0% user, 28.0% system,  0.0% nice, 23.0% idle
CPU1 states:  7.0% user,  3.0% system,  0.0% nice, 88.0% idle
CPU2 states: 71.0% user, 13.0% system,  0.0% nice, 15.0% idle
CPU3 states:  6.0% user,  7.0% system,  0.0% nice, 85.0% idle
CPU4 states: 35.0% user,  3.0% system,  0.0% nice, 60.0% idle
CPU5 states: 18.0% user,  3.0% system, 15.0% nice, 77.0% idle
CPU6 states: 11.0% user, 19.0% system,  0.0% nice, 68.0% idle
CPU7 states: 15.0% user,  9.0% system,  0.0% nice, 75.0% idle
Mem:  5952612K av, 5772232K used,  180380K free,   0K shrd,  145816K
buff
Swap: 2097112K av,7560K used, 2089552K free 4820588K
cached

  PID USER PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME COMMAND
29308 urp   14   5 10920  10M   496 S N  15.0  0.1   0:46
URPDispatcher


ipcs -q
---
[EMAIL PROTECTED] ipcs -q

-- Message Queues 
keymsqid  owner  perms  used-bytes   messages
0x41008050 1130594304 urp66000
0x370041d6 1132494849 urp66000
0x140041d8 1132396546 urp66000
0xfa0041d6 1133084675 urp66000
...


strace -ss -tt -T -v -e trace=ipc
-
11:51:45.238031 msgsnd(1104543768, {1, ""...}, 91, IPC_NOWAIT) = 0
<0.29>
11:51:45.260221 msgrcv(1100054529, {1, ""...}, 2565, 0, 0) = 82
<0.92>
11:51:45.280938 msgsnd(1103790090, {1, ""...}, 82, IPC_NOWAIT) = 0
<0.34>
11:51:45.368730 msgsnd(1104543768, {1, ""...}, 82, IPC_NOWAIT) = 0
<0.023403>
11:51:45.414185 msgrcv(1100054529, {1, ""...}, 2565, 0, 0) = 84
<0.20>
11:51:45.483523 msgrcv(1100054529, {1, ""...}, 2565, 0, 0) = 84
<0.42>
11:51:45.543332 msgrcv(1100054529, {1, ""...}, 2565, 0, 0) = 89
<0.27>
11:51:45.578211 msgsnd(1104543768, {1, ""...}, 89, IPC_NOWAIT) = 0
<0.57>
11:51:45.592104 msgrcv(1100054529, {1, ""...}, 2565, 0, 0) = 91
<0.32>
11:51:45.596233 msgsnd(1103790090, {1, ""...}, 91, IPC_NOWAIT) = 0

[question] IPC queue filling-up problem?

2007-09-05 Thread Fortier,Vincent [Montreal]

Hi all,

We are testing new hardware and planning a switch from our old redhat
7.3 to a Debian Etch 4.0 for our radar forecast analysis systems.  We
found out that our main IPC dispatcher software module would use 100% of
a CPU all the time and that the IPC queues would fill up quickly on a
2.6 kernel.  We first tought that it would be a problem of compatibility
between a 2.4 vs 2.6 IPC calls vs our radar analysis software but after
a lot of work we have been able to test a 2.4 kernel on that same
hardware and got the exact same problem.

So curiously, on our actual systems (see SYSTEM 1 below) our IPC
dispatcher module works like a charm and queues gets near 0.  On our
test system which are way more powerfull systems (see SYSTEM 2) our IPC
dispatcher module queue fills up rapidly (depending of the msgmnb queue
size it will wimply take a bit longer to fill).

We have tested both our already compiled binaries from rh73 using gcc
2.9 and a recompiled version of the modules on a debian sarge system and
got the exact same problem on either a Debian Sarge 3.1 (running a 2.4
or 2.6 kernel) and on a Etch 64bit system (using 32bit compat layer)
with a 2.6 kernel.  In all cases the queues would simply fill-up.

After strac'ing the module I noticed that the time needed to handle the
signal  ipc calls are way lower on the new system hence I don't see why
the dispatcher queue does fill-up like that?!?!?!

Does anyone experienced something similar?  Could this be a kernel issue
vs material, kernel option?  Might this be related to libc?

Help / Clues very much appreciated.

Thnx

- vin


Debian Sarge 3.1 kernel 2.4.35 notes:
-
Pre-built 2.4.35.1 kernel + backported mpt fusion  megasas drivers for
Debian Sarge 3.1 available at:
http://linux-dev.qc.ec.gc.ca/kernel/debian/sarge/i386/2.4.35/
Megaraid SAS backport patch for a 2.4.35.1 kernel available at:
http://linux-dev.qc.ec.gc.ca/kernel/patches/megaraid_sas-linux_2.4.35.1-
v00.00.03.09.patch
2.4.35.1 config file:
http://linux-dev.qc.ec.gc.ca/kernel/debian/CONFIG-i686-2.4.35.1-008


RedHat 7.3 kernel 2.4.33 notes:
---
Pre-built 2.4.33 kernel for RH73:
http://linux-dev.qc.ec.gc.ca/kernel/redhat/rh73/
2.4.33 config file:
http://linux-dev.qc.ec.gc.ca/kernel/redhat/rh73/config-2.4.33-01.rh73.en
vcanbigmem


SYSTEM INFORMATION:

SYSTEM 1:
-
HPDL580 G2
Quad Intel Xeon 1.90GHz
4G ram
DRBD disks on dual-gigabit adapter
OS: RedHat 7.3 / kernel: 2.4.33 / libc: 2.2.5 / gcc 2.96

SYSTEM 2:
-
Dell PE2950
Dual Intel Quad-Core 2.66GHz
16G ram
local 300G 15000 RPM SCSI.
OS1: Debian Etch 4.0 / kernels 2.6.18 - 2.6.22 / libc 2.3.6 / gcc 4.1.2
OS2: Debian Sarge 3.1 / kernels 2.4.35, 2.6.18 - 2.6.22 / libc 2.3.2 /
gcc 3.3.5



=
SYSTEM 1
=

top
---
 12:29pm  up 147 days, 15:47,  4 users,  load average: 3.00, 1.65, 1.56
229 processes: 227 sleeping, 2 running, 0 zombie, 0 stopped
CPU0 states: 48.0% user, 28.0% system,  0.0% nice, 23.0% idle
CPU1 states:  7.0% user,  3.0% system,  0.0% nice, 88.0% idle
CPU2 states: 71.0% user, 13.0% system,  0.0% nice, 15.0% idle
CPU3 states:  6.0% user,  7.0% system,  0.0% nice, 85.0% idle
CPU4 states: 35.0% user,  3.0% system,  0.0% nice, 60.0% idle
CPU5 states: 18.0% user,  3.0% system, 15.0% nice, 77.0% idle
CPU6 states: 11.0% user, 19.0% system,  0.0% nice, 68.0% idle
CPU7 states: 15.0% user,  9.0% system,  0.0% nice, 75.0% idle
Mem:  5952612K av, 5772232K used,  180380K free,   0K shrd,  145816K
buff
Swap: 2097112K av,7560K used, 2089552K free 4820588K
cached

  PID USER PRI  NI  SIZE  RSS SHARE STAT %CPU %MEM   TIME COMMAND
29308 urp   14   5 10920  10M   496 S N  15.0  0.1   0:46
URPDispatcher


ipcs -q
---
[EMAIL PROTECTED] ipcs -q

-- Message Queues 
keymsqid  owner  perms  used-bytes   messages
0x41008050 1130594304 urp66000
0x370041d6 1132494849 urp66000
0x140041d8 1132396546 urp66000
0xfa0041d6 1133084675 urp66000
...


strace -ss -tt -T -v -e trace=ipc
-
11:51:45.238031 msgsnd(1104543768, {1, ...}, 91, IPC_NOWAIT) = 0
0.29
11:51:45.260221 msgrcv(1100054529, {1, ...}, 2565, 0, 0) = 82
0.92
11:51:45.280938 msgsnd(1103790090, {1, ...}, 82, IPC_NOWAIT) = 0
0.34
11:51:45.368730 msgsnd(1104543768, {1, ...}, 82, IPC_NOWAIT) = 0
0.023403
11:51:45.414185 msgrcv(1100054529, {1, ...}, 2565, 0, 0) = 84
0.20
11:51:45.483523 msgrcv(1100054529, {1, ...}, 2565, 0, 0) = 84
0.42
11:51:45.543332 msgrcv(1100054529, {1, ...}, 2565, 0, 0) = 89
0.27
11:51:45.578211 msgsnd(1104543768, {1, ...}, 89, IPC_NOWAIT) = 0
0.57
11:51:45.592104 msgrcv(1100054529, {1, ...}, 2565, 0, 0) = 91
0.32
11:51:45.596233 msgsnd(1103790090, {1, ...}, 91, IPC_NOWAIT) = 0
0.37
11:51:45.660060 msgrcv(1100054529, {1, 

RE: [question] IPC queue filling-up problem?

2007-09-05 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Eric Dumazet
 
 On Wed, 5 Sep 2007 09:37:50 -0400
 Fortier,Vincent [Montreal] [EMAIL PROTECTED] wrote:
 
  
  Hi all,
  
  We are testing new hardware and planning a switch from our old
redhat
  7.3 to a Debian Etch 4.0 for our radar forecast analysis systems.
We 
  found out that our main IPC dispatcher software module would use
100% 
  of a CPU all the time and that the IPC queues would fill up quickly
on a
  2.6 kernel.  We first tought that it would be a problem of 
  compatibility between a 2.4 vs 2.6 IPC calls vs our radar analysis 
  software but after a lot of work we have been able to test a 2.4 
  kernel on that same hardware and got the exact same problem.
  
  So curiously, on our actual systems (see SYSTEM 1 below) our IPC 
  dispatcher module works like a charm and queues gets near 0.  On our

  test system which are way more powerfull systems (see SYSTEM 2) our 
  IPC dispatcher module queue fills up rapidly (depending of the
msgmnb 
  queue size it will wimply take a bit longer to fill).
  
  We have tested both our already compiled binaries from rh73 using
gcc
  2.9 and a recompiled version of the modules on a debian sarge system

  and got the exact same problem on either a Debian Sarge 3.1 (running
a 
  2.4 or 2.6 kernel) and on a Etch 64bit system (using 32bit compat 
  layer) with a 2.6 kernel.  In all cases the queues would simply
fill-up.
  
  After strac'ing the module I noticed that the time needed to handle 
  the signal  ipc calls are way lower on the new system hence I don't

  see why the dispatcher queue does fill-up like that?!?!?!
  
  Does anyone experienced something similar?  Could this be a kernel 
  issue vs material, kernel option?  Might this be related to libc?
  
  Help / Clues very much appreciated.
 
 Hi Vincent
 
 top shows that something is eating cpu cycles in User mode on 
 your new platform, while old platform consumes cycles both in 
 User and System land.
 
 This might be related to some programing error, maybe some 
 spinlock in user mode or bad multi-threading synchronization, 
 or scheduling assumptions, that break because of the quad 
 core cpus of your new machine.

Actually, could this be worth trying (adding Ingo in CC):
http://lkml.org/lkml/2007/9/5/75

 So the thread that is supposed to consume IPC messages is not 
 scheduled in time, because CPU starves. (beware the four 
 cores of each CPU compete for ressources)
 
 You could issue ps auxm to check which threads are spining 
 in User mode and try to trace them ?

Effectively in this specific test case I had 5 stuck process using each
100% of a CPU... although 3 other cores where still available so there
is (I believe) no reason why it should had starved that much.

Anyhow, that did not happend on all the other testing I made during the
past 3 weeks (except this one).

I restarted the this test (again using a 2.4.35.1 kernel), mde sure no
process where stuck :), and grabbed the ps aux + ipcs -q info (attached)

Again, the ipcs -q shows that the queue is getting full comparing to
SYSTEM 1 which always has a queue of 0.

Note: Also attached a top.txt file showing that the dispatcher uses 100%
of a CPU on SYSTEM 2.  This never occurs on SYSTEM 1.

 Eric

PS, thnx for replying.

- vin

  SYSTEM INFORMATION:
  
  SYSTEM 1:
  -
  HPDL580 G2
  Quad Intel Xeon 1.90GHz
  4G ram
  DRBD disks on dual-gigabit adapter
  OS: RedHat 7.3 / kernel: 2.4.33 / libc: 2.2.5 / gcc 2.96
  
  SYSTEM 2:
  -
  Dell PE2950
  Dual Intel Quad-Core 2.66GHz
  16G ram
  local 300G 15000 RPM SCSI.
  OS1: Debian Etch 4.0 / kernels 2.6.18 - 2.6.22 / libc 2.3.6 / gcc
4.1.2
  OS2: Debian Sarge 3.1 / kernels 2.4.35, 2.6.18 - 2.6.22 / libc
2.3.2 / gcc 3.3.5

-- Message Queues 
keymsqid  owner  perms  used-bytes   messages
0x41018018 0  urp66060784748 
0x41018024 32769  urp66000   
0x4101800b 65538  urp66000   
0x41018011 98307  urp66000   
0x4101800e 131076 urp66000   
0x4101803f 163845 urp66000   
0x41018019 196614 urp66000   
0x41018026 229383 urp66000   
0x4101802f 262152 urp66000   
0x4101803a 294921 urp66000   
0x41018036 327690 urp66000   
0x4101802b 360459 urp66000   
0x41018040 393228 urp66000   
0x41018027 425997 urp66000   
0x41018028 458766 urp66000   
0x41018020 491535 urp660

Patch for Apani Nortel VPN Client to build against kernel 2.6.22 help/review

2007-08-22 Thread Fortier,Vincent [Montreal]
Hi all,

Sadly I have to use the Apani/Nortel VPN linux client to connect to my office.  
The latest version (3.5) stopped being compatible with the latest stable 2.6.22 
kernel (how suprising :().

Since
1) I'm stuck using that closed source/proprietary software
2) there will most probably not be a new release from apani until year 2013
3) support from apani is near zero
we got used with previous version (aka 3.3) to hack the linux_wrapper.c file to 
try to keep the module compatible 
(http://www.fedoraforum.org/forum/showpost.php?p=742619=26)

Based on the horrors that showed up while compiling against a 2.6.22 kernel 
(see below), I was wondering if somebody would be kind enough to review our 
not-yet-working patch that converts dev_base/skb calls to the new API.  A 
collegue of mine made all the required changes but when the netlock service 
gets fired-up the machine hang.

Are there any ways to debug this black box?

Help very much appreciated.

--

Here is the not-yet-working patch against the linux_wrapper.c file:

--- cvc_linux-rh-gcc3-3.5.orig/src/linux_wrapper.c  2007-02-07 
19:10:41.0 +
+++ cvc_linux-rh-gcc3-3.5/src/linux_wrapper.c   2007-08-21 17:42:30.0 
+
@@ -136,6 +136,8 @@ int nl_open(struct inode *inode, struct 
 /* some necessary globals... */
 #if ((LINUX_VERSION_CODE >= 0x020200) && (LINUX_VERSION_CODE < 0x020300))
 typedef struct device net_device_t;
+#elif (LINUX_VERSION_CODE >= 0x020616)
+typedef struct list_head net_device_t;
 #else
 typedef struct net_device net_device_t;
 #endif
@@ -306,8 +308,11 @@ nl_copy_to_user(void *to, const void *fr
 
 void init_misc(void)
 {
-
+#if (LINUX_VERSION_CODE >= 0x020616)
+nl_dev_base = _base_head;
+#else
 nl_dev_base = dev_base;
+#endif
 }
 
 #if ((LINUX_VERSION_CODE >= 0x020200) && (LINUX_VERSION_CODE < 0x020300))
@@ -430,8 +435,13 @@ void nl_skb_put(struct sk_buff *skb, int
 int nl_ip_rcv(struct sk_buff *skb, struct packet_type *pt)
 {
 #if (LINUX_VERSION_CODE > 0x02060c)
+#if (LINUX_VERSION_CODE >= 0x020616)
+struct net_device *dev = skb->dev;
+struct iphdr *iph = (struct iphdr*) skb->network_header;
+#else
 struct net_device *dev = skb->dev;
 struct iphdr *iph = skb->nh.iph;
+#endif
 
 if (skb->dst == NULL)
 {
@@ -469,28 +479,52 @@ int nl_unregister_netdevice_notifier(str
 
 char *dev_name (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return (net_device_entry(dev)->name);
+#else
   return (dev->name);
+#endif
 }
 
 int dev_name_len (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return (strlen(net_device_entry(dev)->name));
+#else
   return (strlen(dev->name));
+#endif
 }
 unsigned *dev_mtu_ptr (net_device_t *dev)
 {
-  return (>mtu);
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return &(net_device_entry(dev)->mtu);
+#else
+  return &(dev->mtu);
+#endif
 }
 unsigned dev_mtu (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return (net_device_entry(dev)->mtu);
+#else
   return (dev->mtu);
+#endif
 }
 int dev_ifindex (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return (net_device_entry(dev)->ifindex);
+#else
   return (dev->ifindex);
+#endif
 }
 void *dev_ip_ptr (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return (net_device_entry(dev)->ip_ptr);
+#else
   return (dev->ip_ptr);
+#endif
 }
 net_device_t *dev_next (net_device_t *dev)
 {
@@ -546,9 +580,15 @@ void nl_skb_dup (struct sk_buff *new_skb
 head_offset = new_skb->head - skb->head;
 new_skb->dev = skb->dev;
 new_skb->dst = dst_clone(skb->dst);
+#if (LINUX_VERSION_CODE >= 0x020616)
+new_skb->transport_header = skb->transport_header+head_offset;
+new_skb->network_header = skb->network_header+head_offset;
+new_skb->mac_header = skb->mac_header+head_offset;
+#else
 new_skb->h.raw = skb->h.raw+head_offset;
 new_skb->nh.raw = skb->nh.raw+head_offset;
 new_skb->mac.raw = skb->mac.raw+head_offset;
+#endif
 memcpy(new_skb->cb, skb->cb, sizeof(skb->cb));
 new_skb->priority = skb->priority;
 new_skb->protocol = skb->protocol;
@@ -573,9 +613,15 @@ void nl_skb_hdr_copy (struct sk_buff *sk
 {
   skb_to->dev = skb_from->dev;
   skb_to->dst = skb_from->dst;
+#if (LINUX_VERSION_CODE >= 0x020616)
+  skb_to->transport_header = skb_from->transport_header;
+  skb_to->network_header = skb_from->network_header;
+  skb_to->mac_header = skb_from->mac_header;
+#else
   skb_to->h.raw = skb_from->h.raw;
   skb_to->nh.raw = skb_from->nh.raw;
   skb_to->mac.raw = skb_from->mac.raw;
+#endif
   memcpy(skb_to->cb, skb_from->cb, sizeof(skb_from->cb));
   skb_to->priority = skb_from->priority;
   skb_to->protocol = skb_from->protocol;
@@ -592,7 +638,11 @@ void nl_skb_hdr_copy (struct sk_buff *sk
 
 struct iphdr * nl_skb_iph (struct sk_buff *skb)
 {
+#if (LINUX_VERSION_CODE >= 0x020616)
+  return (struct iphdr*) skb->network_header;
+#else
   return skb->nh.iph;
+#endif
 }
 
 net_device_t * nl_skb_dev 

Patch for Apani Nortel VPN Client to build against kernel 2.6.22 help/review

2007-08-22 Thread Fortier,Vincent [Montreal]
Hi all,

Sadly I have to use the Apani/Nortel VPN linux client to connect to my office.  
The latest version (3.5) stopped being compatible with the latest stable 2.6.22 
kernel (how suprising :().

Since
1) I'm stuck using that closed source/proprietary software
2) there will most probably not be a new release from apani until year 2013
3) support from apani is near zero
we got used with previous version (aka 3.3) to hack the linux_wrapper.c file to 
try to keep the module compatible 
(http://www.fedoraforum.org/forum/showpost.php?p=742619postcount=26)

Based on the horrors that showed up while compiling against a 2.6.22 kernel 
(see below), I was wondering if somebody would be kind enough to review our 
not-yet-working patch that converts dev_base/skb calls to the new API.  A 
collegue of mine made all the required changes but when the netlock service 
gets fired-up the machine hang.

Are there any ways to debug this black box?

Help very much appreciated.

--

Here is the not-yet-working patch against the linux_wrapper.c file:

--- cvc_linux-rh-gcc3-3.5.orig/src/linux_wrapper.c  2007-02-07 
19:10:41.0 +
+++ cvc_linux-rh-gcc3-3.5/src/linux_wrapper.c   2007-08-21 17:42:30.0 
+
@@ -136,6 +136,8 @@ int nl_open(struct inode *inode, struct 
 /* some necessary globals... */
 #if ((LINUX_VERSION_CODE = 0x020200)  (LINUX_VERSION_CODE  0x020300))
 typedef struct device net_device_t;
+#elif (LINUX_VERSION_CODE = 0x020616)
+typedef struct list_head net_device_t;
 #else
 typedef struct net_device net_device_t;
 #endif
@@ -306,8 +308,11 @@ nl_copy_to_user(void *to, const void *fr
 
 void init_misc(void)
 {
-
+#if (LINUX_VERSION_CODE = 0x020616)
+nl_dev_base = dev_base_head;
+#else
 nl_dev_base = dev_base;
+#endif
 }
 
 #if ((LINUX_VERSION_CODE = 0x020200)  (LINUX_VERSION_CODE  0x020300))
@@ -430,8 +435,13 @@ void nl_skb_put(struct sk_buff *skb, int
 int nl_ip_rcv(struct sk_buff *skb, struct packet_type *pt)
 {
 #if (LINUX_VERSION_CODE  0x02060c)
+#if (LINUX_VERSION_CODE = 0x020616)
+struct net_device *dev = skb-dev;
+struct iphdr *iph = (struct iphdr*) skb-network_header;
+#else
 struct net_device *dev = skb-dev;
 struct iphdr *iph = skb-nh.iph;
+#endif
 
 if (skb-dst == NULL)
 {
@@ -469,28 +479,52 @@ int nl_unregister_netdevice_notifier(str
 
 char *dev_name (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (net_device_entry(dev)-name);
+#else
   return (dev-name);
+#endif
 }
 
 int dev_name_len (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (strlen(net_device_entry(dev)-name));
+#else
   return (strlen(dev-name));
+#endif
 }
 unsigned *dev_mtu_ptr (net_device_t *dev)
 {
-  return (dev-mtu);
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (net_device_entry(dev)-mtu);
+#else
+  return (dev-mtu);
+#endif
 }
 unsigned dev_mtu (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (net_device_entry(dev)-mtu);
+#else
   return (dev-mtu);
+#endif
 }
 int dev_ifindex (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (net_device_entry(dev)-ifindex);
+#else
   return (dev-ifindex);
+#endif
 }
 void *dev_ip_ptr (net_device_t *dev)
 {
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (net_device_entry(dev)-ip_ptr);
+#else
   return (dev-ip_ptr);
+#endif
 }
 net_device_t *dev_next (net_device_t *dev)
 {
@@ -546,9 +580,15 @@ void nl_skb_dup (struct sk_buff *new_skb
 head_offset = new_skb-head - skb-head;
 new_skb-dev = skb-dev;
 new_skb-dst = dst_clone(skb-dst);
+#if (LINUX_VERSION_CODE = 0x020616)
+new_skb-transport_header = skb-transport_header+head_offset;
+new_skb-network_header = skb-network_header+head_offset;
+new_skb-mac_header = skb-mac_header+head_offset;
+#else
 new_skb-h.raw = skb-h.raw+head_offset;
 new_skb-nh.raw = skb-nh.raw+head_offset;
 new_skb-mac.raw = skb-mac.raw+head_offset;
+#endif
 memcpy(new_skb-cb, skb-cb, sizeof(skb-cb));
 new_skb-priority = skb-priority;
 new_skb-protocol = skb-protocol;
@@ -573,9 +613,15 @@ void nl_skb_hdr_copy (struct sk_buff *sk
 {
   skb_to-dev = skb_from-dev;
   skb_to-dst = skb_from-dst;
+#if (LINUX_VERSION_CODE = 0x020616)
+  skb_to-transport_header = skb_from-transport_header;
+  skb_to-network_header = skb_from-network_header;
+  skb_to-mac_header = skb_from-mac_header;
+#else
   skb_to-h.raw = skb_from-h.raw;
   skb_to-nh.raw = skb_from-nh.raw;
   skb_to-mac.raw = skb_from-mac.raw;
+#endif
   memcpy(skb_to-cb, skb_from-cb, sizeof(skb_from-cb));
   skb_to-priority = skb_from-priority;
   skb_to-protocol = skb_from-protocol;
@@ -592,7 +638,11 @@ void nl_skb_hdr_copy (struct sk_buff *sk
 
 struct iphdr * nl_skb_iph (struct sk_buff *skb)
 {
+#if (LINUX_VERSION_CODE = 0x020616)
+  return (struct iphdr*) skb-network_header;
+#else
   return skb-nh.iph;
+#endif
 }
 
 net_device_t * nl_skb_dev (struct sk_buff *skb)
@@ -613,10 +663,14 @@ void nl_send_skb (struct sk_buff 

RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Chuck Ebbert
> Envoyé : 3 juillet 2007 17:03
> 
> On 07/03/2007 03:28 PM, Chris Friesen wrote:
> > Arne Georg Gleditsch wrote:
> > 
> >> An interesting exercise might be to
> >> code up a small program to call adjtimex with timex.status |= 
> >> STA_INS, to see if this can trigger the problem.
> > 
> > Setting the date to just before midnight June 30 UTC and then running 
> > the following as root triggered the crash on a modified 2.6.10.  
> > Anyone see anything wrong with the code below, or is this a valid 
> > indication of a bug in the leap second code?
> > 
> 
> Fixed:
> http://git.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2
> .6.git;a=commitdiff;h=746976a301ac9c9aa10d7d42454f8d6cdad8ff2b

Thanx a lot!  This was fast! (beat that closed source!)

- vin

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Chris Friesen
> 
> Fortier,Vincent [Montreal] wrote:
> > Hi all,
> > 
> > All my servers and workstations running a 2.6.21.5 kernel hanged 
> > exactly when the date shift from june 30th to july 1st.
> 
> Interesting, I just sent out an email for a similar issue, 
> but with a pair of 2.6.10 machines.
> 
> I'm wondering if its related to a spurious leap second event...
> 

Just wondering, what is your distribution?

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Arne Georg Gleditsch
> 
> Florian Attenberger <[EMAIL PROTECTED]> writes:
> > yep, controlled by ntpd.
> > You're right according to
> > ftp://hpiers.obspm.fr/iers/bul/bulc/bulletinc.33
> > that event shouldn't have been there.
> 
> I'm not all that versed in ntp-ish, but it appears that the 
> leap second insertion should be propagated through the ntp protocol.
> Whether the leap second in question came from a ntp server 
> giving out wrong data or from a misinterpretation or bug in 
> ntpd is of course hard to say, but either way turning the 
> clock back is unlikely to reconstruct the circumstances.  An 
> interesting exercise might be to code up a small program to 
> call adjtimex with timex.status |= STA_INS, to see if this 
> can trigger the problem.  (The bogus leap second might be a 
> red herring entirely, of course...)

You are probably right, I did tried to reproduce the problem without
success...

Although it is wierd that it happend only on 2.6.21 kernels... It did
not happend on any of my workstations/servers running either 2.6.18 or
2.6.20.  

Could dynticks be involved?

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Clemens Koller [mailto:[EMAIL PROTECTED] 
> Envoyé : 3 juillet 2007 09:05
> 
> Hi, Vincent!
> 
> Fortier,Vincent [Montreal] schrieb:
> > Hi all,
> > 
> > All my servers and workstations running a 2.6.21.5 kernel hanged 
> > exactly when the date shift from june 30th to july 1st.
> > 
> > I could not get anything on any of the 20+ consoles...  All the 
> > systems hanged at around the exact same time... When the date shifted 
> > from June 30th to July 1st in UTC ...?
> > 
> > Any clue any one?
> 
> No problems over here with plain 2.6.21.5 on x686
> 
> You could just reset the date back on one of these machines 
> and check the transition again... and see if it was really 
> the kernel who crashed and check your cron configuration.

I tried reverting to 23:50 June 30th and it did not hanged when switching to 
July 1st.  I've tried it a few times without any problems.  So I deactivated 
all NTP related jobs, switched the date back to June 30th 23:10 and reboot the 
system.  Wait and see.

> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Uli Luckas
> 
> Same thing here on two machines with plain vanilla 
> 2.6.21.(3/4), on debian testing & debian unstable.
> 

I am also using Debian... But Sarge 3.1.  There might be a relation there 
unless somebody comes up having the same problem using another dist.  At least 
it's not CFS related.

> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Arne Georg Gleditsch
> 
> Florian Attenberger <[EMAIL PROTECTED]> writes:
> > there was one 'special' event at that date:
> > syslog.2.gz:Jul  1 01:59:59 master kernel: Clock: inserting leap 
> > second 23:59:60 UTC
> 
> As far as I can tell, no leap second was due to be inserted 
> at 1. of July this year.  Is the year set correctly for this box?
> 

All my server/workstations are in sync using ntp... And yes, the year is set 
properly on all of them.

Regards,

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Fortier,Vincent [Montreal]
> Envoyé : 3 juillet 2007 08:44
> 
> Hi all,
> 
> All my servers and workstations running a 2.6.21.5 kernel 
> hanged exactly when the date shift from june 30th to july 1st.
> 
> On my monitoring system every single station running a 
> 2.6.21.5 kernel stoped responding exactly after midnight on 
> the date shift from June 30th to July 1st.  Although, 
> stations still running 2.6.18 to 2.6.20.11 worked flawlessly.
> 
> I first tought there had been an electricity outage but two 
> of my servers (dell PE 2950 dual-quad core) on UPS in our 
> server room also
> hanged:
> Jun 30 23:55:01 urpdev1 /USR/SBIN/CRON[31298]: (root) CMD ([ -x
> /usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" 
> ; [ "$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1; }) 
> Jul  3 11:54:03 urpdev1 syslogd 1.4.1#17: restart.
> 
> I could not get anything on any of the 20+ consoles...  All 
> the systems hanged at around the exact same time... When the 
> date shifted from June 30th to July 1st in UTC ...?
> 
> Any clue any one?

Forgot to mention:

- All stations that failed where running a 2.6.21 kernel + CFS v18 (I don't 
have any stations running a plain 2.6.21 kernel so can't tell)
- Config file can be found at: 
http://linux-dev.qc.ec.gc.ca/kernel/debian/CONFIG-i686-2.6.21-005
- kernels can be found at: 
http://linux-dev.qc.ec.gc.ca/kernel/debian/sarge/i686/2.6.21/

> 
> - vin
> 
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
Hi all,

All my servers and workstations running a 2.6.21.5 kernel hanged exactly
when the date shift from june 30th to july 1st.

On my monitoring system every single station running a 2.6.21.5 kernel
stoped responding exactly after midnight on the date shift from June
30th to July 1st.  Although, stations still running 2.6.18 to 2.6.20.11
worked flawlessly.

I first tought there had been an electricity outage but two of my
servers (dell PE 2950 dual-quad core) on UPS in our server room also
hanged:
Jun 30 23:55:01 urpdev1 /USR/SBIN/CRON[31298]: (root) CMD ([ -x
/usr/lib/sysstat/sa1 ] && { [ -r "$DEFAULT" ] && . "$DEFAULT" ; [
"$ENABLED" = "true" ] && exec /usr/lib/sysstat/sa1; })
Jul  3 11:54:03 urpdev1 syslogd 1.4.1#17: restart.

I could not get anything on any of the 20+ consoles...  All the systems
hanged at around the exact same time... When the date shifted from June
30th to July 1st in UTC ...?

Any clue any one?

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
Hi all,

All my servers and workstations running a 2.6.21.5 kernel hanged exactly
when the date shift from june 30th to july 1st.

On my monitoring system every single station running a 2.6.21.5 kernel
stoped responding exactly after midnight on the date shift from June
30th to July 1st.  Although, stations still running 2.6.18 to 2.6.20.11
worked flawlessly.

I first tought there had been an electricity outage but two of my
servers (dell PE 2950 dual-quad core) on UPS in our server room also
hanged:
Jun 30 23:55:01 urpdev1 /USR/SBIN/CRON[31298]: (root) CMD ([ -x
/usr/lib/sysstat/sa1 ]  { [ -r $DEFAULT ]  . $DEFAULT ; [
$ENABLED = true ]  exec /usr/lib/sysstat/sa1; })
Jul  3 11:54:03 urpdev1 syslogd 1.4.1#17: restart.

I could not get anything on any of the 20+ consoles...  All the systems
hanged at around the exact same time... When the date shifted from June
30th to July 1st in UTC ...?

Any clue any one?

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Fortier,Vincent [Montreal]
 Envoyé : 3 juillet 2007 08:44
 
 Hi all,
 
 All my servers and workstations running a 2.6.21.5 kernel 
 hanged exactly when the date shift from june 30th to july 1st.
 
 On my monitoring system every single station running a 
 2.6.21.5 kernel stoped responding exactly after midnight on 
 the date shift from June 30th to July 1st.  Although, 
 stations still running 2.6.18 to 2.6.20.11 worked flawlessly.
 
 I first tought there had been an electricity outage but two 
 of my servers (dell PE 2950 dual-quad core) on UPS in our 
 server room also
 hanged:
 Jun 30 23:55:01 urpdev1 /USR/SBIN/CRON[31298]: (root) CMD ([ -x
 /usr/lib/sysstat/sa1 ]  { [ -r $DEFAULT ]  . $DEFAULT 
 ; [ $ENABLED = true ]  exec /usr/lib/sysstat/sa1; }) 
 Jul  3 11:54:03 urpdev1 syslogd 1.4.1#17: restart.
 
 I could not get anything on any of the 20+ consoles...  All 
 the systems hanged at around the exact same time... When the 
 date shifted from June 30th to July 1st in UTC ...?
 
 Any clue any one?

Forgot to mention:

- All stations that failed where running a 2.6.21 kernel + CFS v18 (I don't 
have any stations running a plain 2.6.21 kernel so can't tell)
- Config file can be found at: 
http://linux-dev.qc.ec.gc.ca/kernel/debian/CONFIG-i686-2.6.21-005
- kernels can be found at: 
http://linux-dev.qc.ec.gc.ca/kernel/debian/sarge/i686/2.6.21/

 
 - vin
 
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Clemens Koller [mailto:[EMAIL PROTECTED] 
 Envoyé : 3 juillet 2007 09:05
 
 Hi, Vincent!
 
 Fortier,Vincent [Montreal] schrieb:
  Hi all,
  
  All my servers and workstations running a 2.6.21.5 kernel hanged 
  exactly when the date shift from june 30th to july 1st.
  
  I could not get anything on any of the 20+ consoles...  All the 
  systems hanged at around the exact same time... When the date shifted 
  from June 30th to July 1st in UTC ...?
  
  Any clue any one?
 
 No problems over here with plain 2.6.21.5 on x686
 
 You could just reset the date back on one of these machines 
 and check the transition again... and see if it was really 
 the kernel who crashed and check your cron configuration.

I tried reverting to 23:50 June 30th and it did not hanged when switching to 
July 1st.  I've tried it a few times without any problems.  So I deactivated 
all NTP related jobs, switched the date back to June 30th 23:10 and reboot the 
system.  Wait and see.

 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Uli Luckas
 
 Same thing here on two machines with plain vanilla 
 2.6.21.(3/4), on debian testing  debian unstable.
 

I am also using Debian... But Sarge 3.1.  There might be a relation there 
unless somebody comes up having the same problem using another dist.  At least 
it's not CFS related.

 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Arne Georg Gleditsch
 
 Florian Attenberger [EMAIL PROTECTED] writes:
  there was one 'special' event at that date:
  syslog.2.gz:Jul  1 01:59:59 master kernel: Clock: inserting leap 
  second 23:59:60 UTC
 
 As far as I can tell, no leap second was due to be inserted 
 at 1. of July this year.  Is the year set correctly for this box?
 

All my server/workstations are in sync using ntp... And yes, the year is set 
properly on all of them.

Regards,

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Arne Georg Gleditsch
 
 Florian Attenberger [EMAIL PROTECTED] writes:
  yep, controlled by ntpd.
  You're right according to
  ftp://hpiers.obspm.fr/iers/bul/bulc/bulletinc.33
  that event shouldn't have been there.
 
 I'm not all that versed in ntp-ish, but it appears that the 
 leap second insertion should be propagated through the ntp protocol.
 Whether the leap second in question came from a ntp server 
 giving out wrong data or from a misinterpretation or bug in 
 ntpd is of course hard to say, but either way turning the 
 clock back is unlikely to reconstruct the circumstances.  An 
 interesting exercise might be to code up a small program to 
 call adjtimex with timex.status |= STA_INS, to see if this 
 can trigger the problem.  (The bogus leap second might be a 
 red herring entirely, of course...)

You are probably right, I did tried to reproduce the problem without
success...

Although it is wierd that it happend only on 2.6.21 kernels... It did
not happend on any of my workstations/servers running either 2.6.18 or
2.6.20.  

Could dynticks be involved?

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Chris Friesen
 
 Fortier,Vincent [Montreal] wrote:
  Hi all,
  
  All my servers and workstations running a 2.6.21.5 kernel hanged 
  exactly when the date shift from june 30th to july 1st.
 
 Interesting, I just sent out an email for a similar issue, 
 but with a pair of 2.6.10 machines.
 
 I'm wondering if its related to a spurious leap second event...
 

Just wondering, what is your distribution?

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21.5 june 30th to july 1st date hang?

2007-07-03 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Chuck Ebbert
 Envoyé : 3 juillet 2007 17:03
 
 On 07/03/2007 03:28 PM, Chris Friesen wrote:
  Arne Georg Gleditsch wrote:
  
  An interesting exercise might be to
  code up a small program to call adjtimex with timex.status |= 
  STA_INS, to see if this can trigger the problem.
  
  Setting the date to just before midnight June 30 UTC and then running 
  the following as root triggered the crash on a modified 2.6.10.  
  Anyone see anything wrong with the code below, or is this a valid 
  indication of a bug in the leap second code?
  
 
 Fixed:
 http://git.kernel.org/git/?p=linux/kernel/git/torvalds/linux-2
 .6.git;a=commitdiff;h=746976a301ac9c9aa10d7d42454f8d6cdad8ff2b

Thanx a lot!  This was fast! (beat that closed source!)

- vin

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch] CFS scheduler, -v18

2007-06-26 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
> Envoyé : 22 juin 2007 18:02
> 
> i'm pleased to announce release -v18 of the CFS scheduler patchset.
> 
> The rolled-up CFS patch against today's -git kernel, 
> v2.6.22-rc5, v2.6.22-rc4-mm2, v2.6.21.5 or v2.6.20.14 can be 
> downloaded from the usual place:
> 
>http://people.redhat.com/mingo/cfs-scheduler/
> 
> ...

Pre-built kernels for Fedora 7 & Fedora Core 6 (yum reporsitory available) and 
Debian Sarge/Etch using latest CFS v18 available at http://linux-dev.qc.ec.gc.ca

> 
> As usual, any sort of feedback, bugreport, fix and suggestion 
> is more than welcome!
>
>   Ingo

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch] CFS scheduler, -v18

2007-06-26 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
 Envoyé : 22 juin 2007 18:02
 
 i'm pleased to announce release -v18 of the CFS scheduler patchset.
 
 The rolled-up CFS patch against today's -git kernel, 
 v2.6.22-rc5, v2.6.22-rc4-mm2, v2.6.21.5 or v2.6.20.14 can be 
 downloaded from the usual place:
 
http://people.redhat.com/mingo/cfs-scheduler/
 
 ...

Pre-built kernels for Fedora 7  Fedora Core 6 (yum reporsitory available) and 
Debian Sarge/Etch using latest CFS v18 available at http://linux-dev.qc.ec.gc.ca

 
 As usual, any sort of feedback, bugreport, fix and suggestion 
 is more than welcome!

   Ingo

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: Broadcom (bnx2) on PE1950/2950 failure

2007-06-22 Thread Fortier,Vincent [Montreal]
> Huh? eth0 and eth1 are the e1000 adapters, eth3 (and 
> presumably eth2) are the bnx2 adapters..

Got it... Almost...

There was a confusion between
/etc/udev/rules.d/z25_persistent-net.rules, the fact that bnx2 was in
the initrd image but not e1000.

So I edited /etc/udev/rules.d/z25_persistent-net.rules to set the order
I wanted using directly the PCI ID as provided by the lspci -vvv:
[EMAIL PROTECTED] /etc]# lspci -vvv | grep Ethernet
05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
09:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
0e:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (rev 06)
0e:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (rev 06)


[EMAIL PROTECTED] rules.d]# cat /etc/udev/rules.d/z25_persistent-net.rules
# This file was automatically generated by the /lib/udev/write_net_rules
# program, probably run by the persistent-net-generator.rules rules
file.
#
# You can modify it, as long as you keep each rule on a single line.
# MAC addresses must be written in lowercase.

SUBSYSTEM=="net", DRIVERS=="?*", KERNELS==":05:00.0", NAME="eth0"
SUBSYSTEM=="net", DRIVERS=="?*", KERNELS==":09:00.0", NAME="eth1"
SUBSYSTEM=="net", DRIVERS=="?*", KERNELS==":0e:00.0", NAME="eth2"
SUBSYSTEM=="net", DRIVERS=="?*", KERNELS==":0e:00.1", NAME="eth3"


And recreated the initrd image after having set which module not to
forget... And it's order:
[EMAIL PROTECTED] /etc]# cat /etc/initramfs-tools/modules
# List of modules that you want to include in your initramfs.
#
# Syntax:  module_name [args ...]
#
# You must run update-initramfs(8) to effect this change.
#
# Examples:
#
# raid1
# sd_mod
bnx2
e1000


Here is the output of the dmesg with now the appropriate order:
[EMAIL PROTECTED] /etc]# dmesg | grep -i eth
[  120.685696] Broadcom NetXtreme II Gigabit Ethernet Driver bnx2
v1.5.8.1 (May 7, 2007)
[  120.703846] eth0: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f800, IRQ 17, node addr 0019b9c8eedc
[  120.735350] eth1: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f400, IRQ 16, node addr 0019b9c8eede
[  120.921403] e1000: eth2: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  121.069025] e1000: eth3: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  267.720475] bnx2: eth0: using MSI
[  270.832235] bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex, receive
& transmit flow control ON


But still, there is a problem with the second Broadcom port (eth1):
[EMAIL PROTECTED] /etc]# mii-tool
eth0: negotiated 100baseTx-FD flow-control, link ok
SIOCGMIIPHY on 'eth1' failed: Resource temporarily unavailable
eth2: negotiated 100baseTx-FD flow-control, link ok
eth3: negotiated 100baseTx-FD flow-control, link ok

I don't get it?

Help appreciated.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: Broadcom (bnx2) on PE1950/2950 failure

2007-06-22 Thread Fortier,Vincent [Montreal]
 Huh? eth0 and eth1 are the e1000 adapters, eth3 (and 
 presumably eth2) are the bnx2 adapters..

Got it... Almost...

There was a confusion between
/etc/udev/rules.d/z25_persistent-net.rules, the fact that bnx2 was in
the initrd image but not e1000.

So I edited /etc/udev/rules.d/z25_persistent-net.rules to set the order
I wanted using directly the PCI ID as provided by the lspci -vvv:
[EMAIL PROTECTED] /etc]# lspci -vvv | grep Ethernet
05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
09:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
0e:00.0 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (rev 06)
0e:00.1 Ethernet controller: Intel Corporation 82571EB Gigabit Ethernet
Controller (rev 06)


[EMAIL PROTECTED] rules.d]# cat /etc/udev/rules.d/z25_persistent-net.rules
# This file was automatically generated by the /lib/udev/write_net_rules
# program, probably run by the persistent-net-generator.rules rules
file.
#
# You can modify it, as long as you keep each rule on a single line.
# MAC addresses must be written in lowercase.

SUBSYSTEM==net, DRIVERS==?*, KERNELS==:05:00.0, NAME=eth0
SUBSYSTEM==net, DRIVERS==?*, KERNELS==:09:00.0, NAME=eth1
SUBSYSTEM==net, DRIVERS==?*, KERNELS==:0e:00.0, NAME=eth2
SUBSYSTEM==net, DRIVERS==?*, KERNELS==:0e:00.1, NAME=eth3


And recreated the initrd image after having set which module not to
forget... And it's order:
[EMAIL PROTECTED] /etc]# cat /etc/initramfs-tools/modules
# List of modules that you want to include in your initramfs.
#
# Syntax:  module_name [args ...]
#
# You must run update-initramfs(8) to effect this change.
#
# Examples:
#
# raid1
# sd_mod
bnx2
e1000


Here is the output of the dmesg with now the appropriate order:
[EMAIL PROTECTED] /etc]# dmesg | grep -i eth
[  120.685696] Broadcom NetXtreme II Gigabit Ethernet Driver bnx2
v1.5.8.1 (May 7, 2007)
[  120.703846] eth0: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f800, IRQ 17, node addr 0019b9c8eedc
[  120.735350] eth1: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f400, IRQ 16, node addr 0019b9c8eede
[  120.921403] e1000: eth2: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  121.069025] e1000: eth3: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  267.720475] bnx2: eth0: using MSI
[  270.832235] bnx2: eth0 NIC Link is Up, 1000 Mbps full duplex, receive
 transmit flow control ON


But still, there is a problem with the second Broadcom port (eth1):
[EMAIL PROTECTED] /etc]# mii-tool
eth0: negotiated 100baseTx-FD flow-control, link ok
SIOCGMIIPHY on 'eth1' failed: Resource temporarily unavailable
eth2: negotiated 100baseTx-FD flow-control, link ok
eth3: negotiated 100baseTx-FD flow-control, link ok

I don't get it?

Help appreciated.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Broadcom (bnx2) on PE1950/2950 failure

2007-06-21 Thread Fortier,Vincent [Montreal]

When trying manually to get the on-board broadcom adapter working I get
this:
Jun 21 16:50:47 urpdev1 kernel: [  184.528092] Broadcom NetXtreme II
Gigabit Ethernet Driver bnx2 v1.4.45 (September 29, 2006)
Jun 21 16:50:47 urpdev1 kernel: [  184.539831] ACPI: PCI Interrupt
:05:00.0[A] -> GSI 16 (level, low) -> IRQ 17
Jun 21 16:50:48 urpdev1 kernel: [  184.551805] eth0: Broadcom NetXtreme
II BCM5708 1000Base-T (B2) PCI-X 64-bit 133MHz found at mem f800,
IRQ 17, node ad dr 0019b9c8eedc
Jun 21 16:50:48 urpdev1 kernel: [  184.576672] ACPI: PCI Interrupt
:09:00.0[A] -> GSI 17 (level, low) -> IRQ 16
Jun 21 16:50:48 urpdev1 kernel: [  184.590359] eth0: Broadcom NetXtreme
II BCM5708 1000Base-T (B2) PCI-X 64-bit 133MHz found at mem f400,
IRQ 16, node ad dr 0019b9c8eede

Jun 21 16:50:52 urpdev1 dhclient: Internet Systems Consortium DHCP
Client V3.0.1
Jun 21 16:50:52 urpdev1 dhclient: Copyright 2004 Internet Systems
Consortium.
Jun 21 16:50:52 urpdev1 dhclient: All rights reserved.
Jun 21 16:50:52 urpdev1 dhclient: For info, please visit
http://www.isc.org/products/DHCP
Jun 21 16:50:52 urpdev1 dhclient:
Jun 21 16:50:53 urpdev1 dhclient: Bind socket to interface: No such
device

When I invoke "modprobe bnx2" it looks like both adapter fails to
initialise properly reason I think both are trying to bind on eth0
instead of eth0 & eth1.

I've removed the TOE jumper (has read on a gentoo list) with no effect
(except that it really is disabled in the BIOS now).

I've also tried loading the driver using disable_msi=1 option without
any effect.

The problem occurs on all tested kernels (2.6.18.8, 2.6.19.7, 2.6.20.14
& 2.6.21.5)

Here is part of the dmesg:
[EMAIL PROTECTED] /root]# dmesg | grep -i eth
[  119.196375] Broadcom NetXtreme II Gigabit Ethernet Driver bnx2
v1.5.8.1 (May 7, 2007)
[  119.215023] eth0: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f800, IRQ 17, node addr 0019b9c8eedc
[  119.246853] eth1: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f400, IRQ 16, node addr 0019b9c8eede
[  119.458598] e1000: eth0: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  119.654095] e1000: eth1: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  266.513144] bnx2: eth3: using MSI
[  269.638237] bnx2: eth3 NIC Link is Up, 1000 Mbps full duplex, receive
& transmit flow control ON
[  290.242533] eth3: no IPv6 routers present

As you can notice it tried to use eth0 & eth1 for both broadcom adapters
but it did not worked properly since e1000 used them instead and finally
bnx2 got eth3 working using MSI ?
Here is the mii-tool output:
[EMAIL PROTECTED] /root]# mii-tool
eth0: no link
eth1: no link
SIOCGMIIPHY on 'eth2' failed: Resource temporarily unavailable
eth3: negotiated 100baseTx-FD flow-control, link ok

So by switching the network config from eth0 to eth3 made the on-board
port 1 working ?

Here is the lspci output:
05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
Subsystem: Dell Unknown device 01b2
Control: I/O- Mem+ BusMaster+ SpecCycle+ MemWINV+ VGASnoop-
ParErr+ Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
SERR- TAbort-
SERR- http://linux-dev.qc.ec.gc.ca/kernel/debian/dmesg-2.6.21.5-bnx2-error.txt
Full lspci -vvv:
http://linux-dev.qc.ec.gc.ca/kernel/debian/lspci-2.6.21.5-bnx2-error.txt
Config (2.6.21.5):
http://linux-dev.qc.ec.gc.ca/kernel/debian/CONFIG-i686-2.6.21-005

Help would greatly be appreciated!

- vin


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Broadcom (bnx2) on PE1950/2950 failure

2007-06-21 Thread Fortier,Vincent [Montreal]

When trying manually to get the on-board broadcom adapter working I get
this:
Jun 21 16:50:47 urpdev1 kernel: [  184.528092] Broadcom NetXtreme II
Gigabit Ethernet Driver bnx2 v1.4.45 (September 29, 2006)
Jun 21 16:50:47 urpdev1 kernel: [  184.539831] ACPI: PCI Interrupt
:05:00.0[A] - GSI 16 (level, low) - IRQ 17
Jun 21 16:50:48 urpdev1 kernel: [  184.551805] eth0: Broadcom NetXtreme
II BCM5708 1000Base-T (B2) PCI-X 64-bit 133MHz found at mem f800,
IRQ 17, node ad dr 0019b9c8eedc
Jun 21 16:50:48 urpdev1 kernel: [  184.576672] ACPI: PCI Interrupt
:09:00.0[A] - GSI 17 (level, low) - IRQ 16
Jun 21 16:50:48 urpdev1 kernel: [  184.590359] eth0: Broadcom NetXtreme
II BCM5708 1000Base-T (B2) PCI-X 64-bit 133MHz found at mem f400,
IRQ 16, node ad dr 0019b9c8eede

Jun 21 16:50:52 urpdev1 dhclient: Internet Systems Consortium DHCP
Client V3.0.1
Jun 21 16:50:52 urpdev1 dhclient: Copyright 2004 Internet Systems
Consortium.
Jun 21 16:50:52 urpdev1 dhclient: All rights reserved.
Jun 21 16:50:52 urpdev1 dhclient: For info, please visit
http://www.isc.org/products/DHCP
Jun 21 16:50:52 urpdev1 dhclient:
Jun 21 16:50:53 urpdev1 dhclient: Bind socket to interface: No such
device

When I invoke modprobe bnx2 it looks like both adapter fails to
initialise properly reason I think both are trying to bind on eth0
instead of eth0  eth1.

I've removed the TOE jumper (has read on a gentoo list) with no effect
(except that it really is disabled in the BIOS now).

I've also tried loading the driver using disable_msi=1 option without
any effect.

The problem occurs on all tested kernels (2.6.18.8, 2.6.19.7, 2.6.20.14
 2.6.21.5)

Here is part of the dmesg:
[EMAIL PROTECTED] /root]# dmesg | grep -i eth
[  119.196375] Broadcom NetXtreme II Gigabit Ethernet Driver bnx2
v1.5.8.1 (May 7, 2007)
[  119.215023] eth0: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f800, IRQ 17, node addr 0019b9c8eedc
[  119.246853] eth1: Broadcom NetXtreme II BCM5708 1000Base-T (B2) PCI-X
64-bit 133MHz found at mem f400, IRQ 16, node addr 0019b9c8eede
[  119.458598] e1000: eth0: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  119.654095] e1000: eth1: e1000_probe: Intel(R) PRO/1000 Network
Connection
[  266.513144] bnx2: eth3: using MSI
[  269.638237] bnx2: eth3 NIC Link is Up, 1000 Mbps full duplex, receive
 transmit flow control ON
[  290.242533] eth3: no IPv6 routers present

As you can notice it tried to use eth0  eth1 for both broadcom adapters
but it did not worked properly since e1000 used them instead and finally
bnx2 got eth3 working using MSI ?
Here is the mii-tool output:
[EMAIL PROTECTED] /root]# mii-tool
eth0: no link
eth1: no link
SIOCGMIIPHY on 'eth2' failed: Resource temporarily unavailable
eth3: negotiated 100baseTx-FD flow-control, link ok

So by switching the network config from eth0 to eth3 made the on-board
port 1 working ?

Here is the lspci output:
05:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
Subsystem: Dell Unknown device 01b2
Control: I/O- Mem+ BusMaster+ SpecCycle+ MemWINV+ VGASnoop-
ParErr+ Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium TAbort-
TAbort- MAbort- SERR- PERR-
Latency: 32 (16000ns min), Cache Line Size: 64 bytes
Interrupt: pin A routed to IRQ 214
Region 0: Memory at f800 (64-bit, non-prefetchable)
[size=32M]
Capabilities: [40] PCI-X non-bridge device
Command: DPERE- ERO- RBC=512 OST=8
Status: Dev=05:00.0 64bit+ 133MHz+ SCD- USC- DC=simple
DMMRBC=512 DMOST=8 DMCRS=32 RSCEM- 266MHz- 533MHz-
Capabilities: [48] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot+,D3cold+)
Status: D0 PME-Enable- DSel=0 DScale=1 PME-
Capabilities: [50] Vital Product Data
Capabilities: [58] Message Signalled Interrupts: Mask- 64bit+
Queue=0/0 Enable+
Address: feeff00c  Data: 416a


09:00.0 Ethernet controller: Broadcom Corporation NetXtreme II BCM5708
Gigabit Ethernet (rev 12)
Subsystem: Dell Unknown device 01b2
Control: I/O- Mem+ BusMaster+ SpecCycle+ MemWINV+ VGASnoop-
ParErr+ Stepping- SERR+ FastB2B-
Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium TAbort-
TAbort- MAbort- SERR- PERR-
Latency: 64 (16000ns min), Cache Line Size: 64 bytes
Interrupt: pin A routed to IRQ 16
Region 0: Memory at f400 (64-bit, non-prefetchable)
[size=32M]
Capabilities: [40] PCI-X non-bridge device
Command: DPERE- ERO- RBC=512 OST=8
Status: Dev=09:00.0 64bit+ 133MHz+ SCD- USC- DC=simple
DMMRBC=512 DMOST=8 DMCRS=32 RSCEM- 266MHz- 533MHz-
Capabilities: [48] Power Management version 2
Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA
PME(D0-,D1-,D2-,D3hot+,D3cold+)
  

RE: How to improve the quality of the kernel?

2007-06-18 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Natalie Protasevich [mailto:[EMAIL PROTECTED] 
> Envoyé : 18 juin 2007 18:56
> 
> On 6/18/07, Martin Bligh <[EMAIL PROTECTED]> wrote:
> > >> > So if you make changes to random-driver.c you can do `git-log 
> > >> > random-driver.c|grep Tested-by" to find people who can test your 
> > >> > changes for you.
> > >>
> > >> You would'nt even need to search in GIT.  Maybie even when ever a 
> > >> patchset is being proposed a mail could be sent to appropriate 
> > >> hardware/or feature pseudo-auto-generated mailing-list?
> > >>
> > >> On lkml I mostly try to follow patches/bugs associated with 
> > >> hardware I use.  Why not try to automate the process and get more 
> > >> testers in?
> > >>
> > >
> > > I think this is an excellent point. One data point could be a field 
> > > in bugzilla to input the hardware information. Simple query can 
> > > select common hardware and platform. So far it's not working when 
> > > hardware is just mentioned in the text part.
> >
> > if it's free text it'll be useless for search ... I suppose we could 
> > do drop-downs for architecture at least? Not sure much beyond that 
> > would work ... *possibly* the common drivers, but I don't think we'd 
> > get enough coverage for it to be of use.
> >
> How about several buckets for model/BIOS version/chipset 
> etc., at least optional, and some will be relevant some not 
> for particular cases. But at least people will make an 
> attempt to collect such data from their system, boards, etc.

How about an easy way to send multiple hardware profiles to your bugzilla user 
account simultaniously linked to an online pciutils database and/or an hardware 
list database similar to overclocking web sites and why not even with a link to 
the git repository when possible?

A some sort of really usefull "send your profile" of RHN that would link the 
driver with the discovered hardware and add you to appropriate mailing lists to 
test patches/help reproducing & solving problems/etc.

In the end plenty of statistics and hardware compatibility list could be made.  
For example, that would make my life easier knowing what level of compatibility 
Linux can offer for old HP9000 K-boxes that we still have running at the office 
and presumably get people to contact to get help?

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: How to improve the quality of the kernel?

2007-06-18 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Andrew Morton
> 
> On Mon, 18 Jun 2007 01:15:15 +0200 Stefan Richter 
> <[EMAIL PROTECTED]> wrote:
> 
> > Tested-by
> 
> Tested-by would be good too.  Because over time, we will 
> generate a list of people who own the relevant hardware and 
> who are prepared to test changes. 

Why not include a user-space tool that, when invoked, if you agree to
send personnal info, sends your hardware vs driver info to a web
database + your email address (maybie even you .config, etc..) ... In
case of help for testing new patches/finding a bug/etc.. your email
could be used by maintainers to ask for help...

> So if you make changes to random-driver.c you can do `git-log 
> random-driver.c|grep Tested-by" to find people who can test 
> your changes for you.

You would'nt even need to search in GIT.  Maybie even when ever a
patchset is being proposed a mail could be sent to appropriate
hardware/or feature pseudo-auto-generated mailing-list?

On lkml I mostly try to follow patches/bugs associated with hardware I
use.  Why not try to automate the process and get more testers in?

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: How to improve the quality of the kernel?

2007-06-18 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Andrew Morton
 
 On Mon, 18 Jun 2007 01:15:15 +0200 Stefan Richter 
 [EMAIL PROTECTED] wrote:
 
  Tested-by
 
 Tested-by would be good too.  Because over time, we will 
 generate a list of people who own the relevant hardware and 
 who are prepared to test changes. 

Why not include a user-space tool that, when invoked, if you agree to
send personnal info, sends your hardware vs driver info to a web
database + your email address (maybie even you .config, etc..) ... In
case of help for testing new patches/finding a bug/etc.. your email
could be used by maintainers to ask for help...

 So if you make changes to random-driver.c you can do `git-log 
 random-driver.c|grep Tested-by to find people who can test 
 your changes for you.

You would'nt even need to search in GIT.  Maybie even when ever a
patchset is being proposed a mail could be sent to appropriate
hardware/or feature pseudo-auto-generated mailing-list?

On lkml I mostly try to follow patches/bugs associated with hardware I
use.  Why not try to automate the process and get more testers in?

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: How to improve the quality of the kernel?

2007-06-18 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Natalie Protasevich [mailto:[EMAIL PROTECTED] 
 Envoyé : 18 juin 2007 18:56
 
 On 6/18/07, Martin Bligh [EMAIL PROTECTED] wrote:
So if you make changes to random-driver.c you can do `git-log 
random-driver.c|grep Tested-by to find people who can test your 
changes for you.
  
   You would'nt even need to search in GIT.  Maybie even when ever a 
   patchset is being proposed a mail could be sent to appropriate 
   hardware/or feature pseudo-auto-generated mailing-list?
  
   On lkml I mostly try to follow patches/bugs associated with 
   hardware I use.  Why not try to automate the process and get more 
   testers in?
  
  
   I think this is an excellent point. One data point could be a field 
   in bugzilla to input the hardware information. Simple query can 
   select common hardware and platform. So far it's not working when 
   hardware is just mentioned in the text part.
 
  if it's free text it'll be useless for search ... I suppose we could 
  do drop-downs for architecture at least? Not sure much beyond that 
  would work ... *possibly* the common drivers, but I don't think we'd 
  get enough coverage for it to be of use.
 
 How about several buckets for model/BIOS version/chipset 
 etc., at least optional, and some will be relevant some not 
 for particular cases. But at least people will make an 
 attempt to collect such data from their system, boards, etc.

How about an easy way to send multiple hardware profiles to your bugzilla user 
account simultaniously linked to an online pciutils database and/or an hardware 
list database similar to overclocking web sites and why not even with a link to 
the git repository when possible?

A some sort of really usefull send your profile of RHN that would link the 
driver with the discovered hardware and add you to appropriate mailing lists to 
test patches/help reproducing  solving problems/etc.

In the end plenty of statistics and hardware compatibility list could be made.  
For example, that would make my life easier knowing what level of compatibility 
Linux can offer for old HP9000 K-boxes that we still have running at the office 
and presumably get people to contact to get help?

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch] CFS scheduler, -v17

2007-06-15 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
> Envoyé : 14 juin 2007 18:49
> 
> i'm pleased to announce release -v17 of the CFS scheduler patchset.
>  
> The rolled-up CFS patch against v2.6.22-rc4, v2.6.22-rc4-mm2, 
> v2.6.21.5 or v2.6.20.13 can be downloaded from the usual place:
> 
>http://people.redhat.com/mingo/cfs-scheduler/
>  
> -v17 includes a bigger change: the CFS-core changes in 
> preparation of the group-scheduling feature, written Srivatsa 
> Vaddagiri. Dmitry Adamushko provided cleanups and further 
> generalizations to this code and the modularization of CFS 
> has been further enhanced as a result. To users, these 
> changes are mostly invisible.
> 
> Changes since -v16:
> 
>  - lots of core updates to support group scheduling, and related 
>cleanups. (Srivatsa Vaddagiri, Dmitry Adamushko)
> 
>  - tuned the runtime-limit up a bit, based on relentless 
> testing done by
>Tobias Gerschner.
> 
>  - the new, precise load-calculation method for SMP balancing has been
>further enhanced, and is now active by default. (Dmitry Adamushko)
> 
>  - fix SCHED_IDLEPRIO support (based on feedback from Thomas Sattler)
> 
>  - further updates to /proc/sched_debug and /proc/PID/sched
> 
>  - more cleanups
> 
> As usual, any sort of feedback, bugreport, fix and suggestion 
> is more than welcome!

Fedora 7 & Debian Sarge 3.1 kernels using latest CFS v17 available at 
http://linux-dev.qc.ec.gc.ca/
YUM users: http://linux-dev.qc.ec.gc.ca/kernel/fedora/alt-sched.repo

Thnx Ingo & cie.

>   Ingo

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch] CFS scheduler, -v17

2007-06-15 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
 Envoyé : 14 juin 2007 18:49
 
 i'm pleased to announce release -v17 of the CFS scheduler patchset.
  
 The rolled-up CFS patch against v2.6.22-rc4, v2.6.22-rc4-mm2, 
 v2.6.21.5 or v2.6.20.13 can be downloaded from the usual place:
 
http://people.redhat.com/mingo/cfs-scheduler/
  
 -v17 includes a bigger change: the CFS-core changes in 
 preparation of the group-scheduling feature, written Srivatsa 
 Vaddagiri. Dmitry Adamushko provided cleanups and further 
 generalizations to this code and the modularization of CFS 
 has been further enhanced as a result. To users, these 
 changes are mostly invisible.
 
 Changes since -v16:
 
  - lots of core updates to support group scheduling, and related 
cleanups. (Srivatsa Vaddagiri, Dmitry Adamushko)
 
  - tuned the runtime-limit up a bit, based on relentless 
 testing done by
Tobias Gerschner.
 
  - the new, precise load-calculation method for SMP balancing has been
further enhanced, and is now active by default. (Dmitry Adamushko)
 
  - fix SCHED_IDLEPRIO support (based on feedback from Thomas Sattler)
 
  - further updates to /proc/sched_debug and /proc/PID/sched
 
  - more cleanups
 
 As usual, any sort of feedback, bugreport, fix and suggestion 
 is more than welcome!

Fedora 7  Debian Sarge 3.1 kernels using latest CFS v17 available at 
http://linux-dev.qc.ec.gc.ca/
YUM users: http://linux-dev.qc.ec.gc.ca/kernel/fedora/alt-sched.repo

Thnx Ingo  cie.

   Ingo

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: call for more SD versus CFS comparisons (was: Re: [ck] Mainline plans)

2007-06-12 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Fortier,Vincent [Montreal]
> Envoyé : 12 juin 2007 21:36
> À : Miguel Figueiredo; linux kernel mailing list; [EMAIL PROTECTED]
> Cc : Con Kolivas; Ingo Molnar
> Objet : RE: call for more SD versus CFS comparisons (was: Re: 
> [ck] Mainline plans)
> 
> > -Message d'origine-
> > De : [EMAIL PROTECTED]
> > [mailto:[EMAIL PROTECTED] De la part de Miguel 
> > Figueiredo Envoyé : 11 juin 2007 20:30
> > 
> > Hi all,
> > 
> > some results based on massing_intr.c by Satoru, can be found on 
> > http://people.redhat.com/mingo/cfs-scheduler/tools/massive_intr.c
> > 
> > 
> > I know that other people, who read lkml, also tested the 
> same way, it 
> > would be nice if they also post their data.
> > 
> 
> For the pleasure of comparing CPU schedulers, both CFS v16 & 
> full CK2 patched pre-built kernels for Fedora 7 using latest 
> build 3194 are now available at http://linux-dev.qc.ec.gc.ca
> 
> There is also a not yet fully tuned YUM repository 
> configuration file available at 
> http://linux-dev.qc.ec.gc.ca/kernel/fedora/alt-sched.repo
> Although I suggest you download the rpm's directly from the 
> web page & install both manually using rpm -ivh --force
> 
> Have fun.
> 

Also, now that I think of it, here are my results:

Test:
./massive_intr 20 6000

Hardware:
- Athlon 64 4200+
- 2GB ram
- nVidia 6600GT using latest Beta driver NVIDIA-Linux-x86_64-100.14.06-pkg2.run
- SATA drive
- On-board audio (00:04.0 Multimedia audio controller: nVidia Corporation CK804 
AC'97 Audio Controller (rev a2))

Kernels:
CFS v16 2.6.21 FC7 build 3194
CK2 2.6.21 FC7 build 3194

Here is what the TOP output was looking like for both case:
top - 13:36:22 up 38 min,  4 users,  load average: 20.41, 13.43, 7.49
Tasks: 172 total,  21 running, 150 sleeping,   0 stopped,   1 zombie
Cpu0  : 99.7%us,  0.3%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu1  : 94.7%us,  4.6%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.3%hi,  0.3%si,  0.0%st
Mem:   2058860k total,  1217132k used,   841728k free,56456k buffers
Swap:  1959888k total,0k used,  1959888k free,   781636k cached

  PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
 6099 megalout  20   0  7860  224  136 R   10  0.0   0:26.12 massive_intr
 6102 megalout  20   0  7860  224  136 R   10  0.0   0:26.27 massive_intr
 6104 megalout  20   0  7860  224  136 R   10  0.0   0:26.51 massive_intr
 6105 megalout  20   0  7860  224  136 R   10  0.0   0:26.32 massive_intr
 6107 megalout  20   0  7860  224  136 R   10  0.0   0:26.62 massive_intr
 6112 megalout  20   0  7860  224  136 R   10  0.0   0:26.32 massive_intr
 6116 megalout  20   0  7860  224  136 R   10  0.0   0:26.26 massive_intr
 6109 megalout  20   0  7860  224  136 R   10  0.0   0:26.32 massive_intr
 6115 megalout  20   0  7860  224  136 R   10  0.0   0:26.50 massive_intr
 6118 megalout  20   0  7860  224  136 R9  0.0   0:26.37 massive_intr
 6100 megalout  20   0  7860  224  136 R9  0.0   0:26.51 massive_intr
 6103 megalout  20   0  7860  224  136 R9  0.0   0:26.24 massive_intr
 6106 megalout  20   0  7860  224  136 R9  0.0   0:26.32 massive_intr
 6108 megalout  20   0  7860  224  136 R9  0.0   0:26.32 massive_intr
 6110 megalout  20   0  7860  224  136 R9  0.0   0:26.54 massive_intr
 6111 megalout  20   0  7860  224  136 R9  0.0   0:26.37 massive_intr
 6113 megalout  20   0  7860  224  136 R9  0.0   0:26.56 massive_intr
 6114 megalout  20   0  7860  224  136 R9  0.0   0:26.30 massive_intr
 6117 megalout  20   0  7860  224  136 R9  0.0   0:26.34 massive_intr
 6101 megalout  20   0  7860  224  136 R9  0.0   0:26.31 massive_intr
 4681 root  20   0  196m  67m  12m S8  3.4   4:06.00 X
 6036 megalout  20   0  694m  53m  32m S2  2.6   0:07.82 amarokapp
 4904 megalout  20   0  271m 9.9m 6948 S1  0.5   0:18.16 gkrellm
 5054 megalout  20   0  160m  45m  20m S1  2.2   0:10.90 firefox-bin
 5201 megalout  20   0  160m 9304 5480 S1  0.5   0:01.42 emerald
 5210 megalout  20   0  240m  12m 6284 S1  0.6   0:33.29 beryl


CFS v16:
-
beryl interractivity way too unresponsive..
- window decoration highlight taking around 5-10 secs to switch between windows 
focus
- window movement either impossible or animation laggy enough to not being seen 
at all.
- Cube rotation still possible using mouse scroll although really really really 
laggy

Amarok MP3 music:
- No audio skips at all.  Playing really well!


CK2 patchset:
-
Beryl interractivity almost totally unresponsive... In fact mouse movement was 
near-impossible to control.
- window movement totally impossible
- Cube rotation still possible using mouse scroll although there was no 
animation at all

Amarok MP3 music:
- 

RE: call for more SD versus CFS comparisons (was: Re: [ck] Mainline plans)

2007-06-12 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de 
> Miguel Figueiredo
> Envoyé : 11 juin 2007 20:30
> 
> Hi all,
> 
> some results based on massing_intr.c by Satoru, can be found 
> on http://people.redhat.com/mingo/cfs-scheduler/tools/massive_intr.c
> 
> 
> I know that other people, who read lkml, also tested the same 
> way, it would be nice if they also post their data.
> 

For the pleasure of comparing CPU schedulers, both CFS v16 & full CK2 patched 
pre-built kernels for Fedora 7 using latest build 3194 are now available at 
http://linux-dev.qc.ec.gc.ca

There is also a not yet fully tuned YUM repository configuration file available 
at http://linux-dev.qc.ec.gc.ca/kernel/fedora/alt-sched.repo
Although I suggest you download the rpm's directly from the web page & install 
both manually using rpm -ivh --force

Have fun.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: call for more SD versus CFS comparisons (was: Re: [ck] Mainline plans)

2007-06-12 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Miguel Figueiredo
 Envoyé : 11 juin 2007 20:30
 
 Hi all,
 
 some results based on massing_intr.c by Satoru, can be found 
 on http://people.redhat.com/mingo/cfs-scheduler/tools/massive_intr.c
 
 
 I know that other people, who read lkml, also tested the same 
 way, it would be nice if they also post their data.
 

For the pleasure of comparing CPU schedulers, both CFS v16  full CK2 patched 
pre-built kernels for Fedora 7 using latest build 3194 are now available at 
http://linux-dev.qc.ec.gc.ca

There is also a not yet fully tuned YUM repository configuration file available 
at http://linux-dev.qc.ec.gc.ca/kernel/fedora/alt-sched.repo
Although I suggest you download the rpm's directly from the web page  install 
both manually using rpm -ivh --force

Have fun.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: call for more SD versus CFS comparisons (was: Re: [ck] Mainline plans)

2007-06-12 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de 
 Fortier,Vincent [Montreal]
 Envoyé : 12 juin 2007 21:36
 À : Miguel Figueiredo; linux kernel mailing list; [EMAIL PROTECTED]
 Cc : Con Kolivas; Ingo Molnar
 Objet : RE: call for more SD versus CFS comparisons (was: Re: 
 [ck] Mainline plans)
 
  -Message d'origine-
  De : [EMAIL PROTECTED]
  [mailto:[EMAIL PROTECTED] De la part de Miguel 
  Figueiredo Envoyé : 11 juin 2007 20:30
  
  Hi all,
  
  some results based on massing_intr.c by Satoru, can be found on 
  http://people.redhat.com/mingo/cfs-scheduler/tools/massive_intr.c
  
  
  I know that other people, who read lkml, also tested the 
 same way, it 
  would be nice if they also post their data.
  
 
 For the pleasure of comparing CPU schedulers, both CFS v16  
 full CK2 patched pre-built kernels for Fedora 7 using latest 
 build 3194 are now available at http://linux-dev.qc.ec.gc.ca
 
 There is also a not yet fully tuned YUM repository 
 configuration file available at 
 http://linux-dev.qc.ec.gc.ca/kernel/fedora/alt-sched.repo
 Although I suggest you download the rpm's directly from the 
 web page  install both manually using rpm -ivh --force
 
 Have fun.
 

Also, now that I think of it, here are my results:

Test:
./massive_intr 20 6000

Hardware:
- Athlon 64 4200+
- 2GB ram
- nVidia 6600GT using latest Beta driver NVIDIA-Linux-x86_64-100.14.06-pkg2.run
- SATA drive
- On-board audio (00:04.0 Multimedia audio controller: nVidia Corporation CK804 
AC'97 Audio Controller (rev a2))

Kernels:
CFS v16 2.6.21 FC7 build 3194
CK2 2.6.21 FC7 build 3194

Here is what the TOP output was looking like for both case:
top - 13:36:22 up 38 min,  4 users,  load average: 20.41, 13.43, 7.49
Tasks: 172 total,  21 running, 150 sleeping,   0 stopped,   1 zombie
Cpu0  : 99.7%us,  0.3%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si,  0.0%st
Cpu1  : 94.7%us,  4.6%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.3%hi,  0.3%si,  0.0%st
Mem:   2058860k total,  1217132k used,   841728k free,56456k buffers
Swap:  1959888k total,0k used,  1959888k free,   781636k cached

  PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
 6099 megalout  20   0  7860  224  136 R   10  0.0   0:26.12 massive_intr
 6102 megalout  20   0  7860  224  136 R   10  0.0   0:26.27 massive_intr
 6104 megalout  20   0  7860  224  136 R   10  0.0   0:26.51 massive_intr
 6105 megalout  20   0  7860  224  136 R   10  0.0   0:26.32 massive_intr
 6107 megalout  20   0  7860  224  136 R   10  0.0   0:26.62 massive_intr
 6112 megalout  20   0  7860  224  136 R   10  0.0   0:26.32 massive_intr
 6116 megalout  20   0  7860  224  136 R   10  0.0   0:26.26 massive_intr
 6109 megalout  20   0  7860  224  136 R   10  0.0   0:26.32 massive_intr
 6115 megalout  20   0  7860  224  136 R   10  0.0   0:26.50 massive_intr
 6118 megalout  20   0  7860  224  136 R9  0.0   0:26.37 massive_intr
 6100 megalout  20   0  7860  224  136 R9  0.0   0:26.51 massive_intr
 6103 megalout  20   0  7860  224  136 R9  0.0   0:26.24 massive_intr
 6106 megalout  20   0  7860  224  136 R9  0.0   0:26.32 massive_intr
 6108 megalout  20   0  7860  224  136 R9  0.0   0:26.32 massive_intr
 6110 megalout  20   0  7860  224  136 R9  0.0   0:26.54 massive_intr
 6111 megalout  20   0  7860  224  136 R9  0.0   0:26.37 massive_intr
 6113 megalout  20   0  7860  224  136 R9  0.0   0:26.56 massive_intr
 6114 megalout  20   0  7860  224  136 R9  0.0   0:26.30 massive_intr
 6117 megalout  20   0  7860  224  136 R9  0.0   0:26.34 massive_intr
 6101 megalout  20   0  7860  224  136 R9  0.0   0:26.31 massive_intr
 4681 root  20   0  196m  67m  12m S8  3.4   4:06.00 X
 6036 megalout  20   0  694m  53m  32m S2  2.6   0:07.82 amarokapp
 4904 megalout  20   0  271m 9.9m 6948 S1  0.5   0:18.16 gkrellm
 5054 megalout  20   0  160m  45m  20m S1  2.2   0:10.90 firefox-bin
 5201 megalout  20   0  160m 9304 5480 S1  0.5   0:01.42 emerald
 5210 megalout  20   0  240m  12m 6284 S1  0.6   0:33.29 beryl


CFS v16:
-
beryl interractivity way too unresponsive..
- window decoration highlight taking around 5-10 secs to switch between windows 
focus
- window movement either impossible or animation laggy enough to not being seen 
at all.
- Cube rotation still possible using mouse scroll although really really really 
laggy

Amarok MP3 music:
- No audio skips at all.  Playing really well!


CK2 patchset:
-
Beryl interractivity almost totally unresponsive... In fact mouse movement was 
near-impossible to control.
- window movement totally impossible
- Cube rotation still possible using mouse scroll although there was no 
animation at all

Amarok MP3 music:
- Same as CFS, no skips at all.  Playing really well!


I'll quote Martin on the Ck mailing list:
 According to Ingo most of the interactivity issues should be fixed by now. 
 Still I do not see how that translates to CFS

RE: [patch 00/32] 2.6.20-stable review

2007-06-08 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Chris Wright
> Envoyé : 8 juin 2007 03:15
> 
> This is the start of the stable review cycle for the 
> 2.6.20.14 release.
> There are 32 patches in this series, all will be posted as a 
> response to this one.  If anyone has any issues with these 
> being applied, please let us know.  If anyone is a maintainer 
> of the proper subsystem, and wants to add a Signed-off-by: 
> line to the patch, please respond with it.
> 
> These patches are sent out with a number of different people on the
> Cc: line.  If you wish to be a reviewer, please email 
> [EMAIL PROTECTED] to add your name to the list.  If you want 
> to be off the reviewer list, also email us.
> 
> Responses should be made by Sun Jun 10 07:15 UTC.
> Anything received after that time might be too late.

Hi Chris,

I have encoutered a bug using fixdep when compiling the Apani VPN contivity 
client v3.3 (v3.5 seems fine).  The bug started happening with the release of 
the 2948 build of FC6 kernel (corresponding to a post 2.6.20.7) and caused 
fixdep to segfault.  This has been fixed in 2.6.21.  I tried to trigger the bug 
using the 2.6.21 patch (thnx to the raw ouput option in git.kernel.org) and it 
really seems to resolve the problem.

The original commit:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=commit;h=3dedd29b0ef5c7a36ac835ac5524a6a8e22c22ab

The fix:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.21.y.git;a=commit;h=1ea6b2418cd2c6a6f31067ed6ece5cd12c69351e

Could it be included in this new stable release of 2.6.20 ?

thnx

> thanks,
> 
> the -stable release team
> --
> -
> To unsubscribe from this list: send the line "unsubscribe 
> linux-kernel" in the body of a message to 
> [EMAIL PROTECTED] More majordomo info at  
> http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at  http://www.tux.org/lkml/
> 
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch 00/32] 2.6.20-stable review

2007-06-08 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Chris Wright
 Envoyé : 8 juin 2007 03:15
 
 This is the start of the stable review cycle for the 
 2.6.20.14 release.
 There are 32 patches in this series, all will be posted as a 
 response to this one.  If anyone has any issues with these 
 being applied, please let us know.  If anyone is a maintainer 
 of the proper subsystem, and wants to add a Signed-off-by: 
 line to the patch, please respond with it.
 
 These patches are sent out with a number of different people on the
 Cc: line.  If you wish to be a reviewer, please email 
 [EMAIL PROTECTED] to add your name to the list.  If you want 
 to be off the reviewer list, also email us.
 
 Responses should be made by Sun Jun 10 07:15 UTC.
 Anything received after that time might be too late.

Hi Chris,

I have encoutered a bug using fixdep when compiling the Apani VPN contivity 
client v3.3 (v3.5 seems fine).  The bug started happening with the release of 
the 2948 build of FC6 kernel (corresponding to a post 2.6.20.7) and caused 
fixdep to segfault.  This has been fixed in 2.6.21.  I tried to trigger the bug 
using the 2.6.21 patch (thnx to the raw ouput option in git.kernel.org) and it 
really seems to resolve the problem.

The original commit:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=commit;h=3dedd29b0ef5c7a36ac835ac5524a6a8e22c22ab

The fix:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.21.y.git;a=commit;h=1ea6b2418cd2c6a6f31067ed6ece5cd12c69351e

Could it be included in this new stable release of 2.6.20 ?

thnx

 thanks,
 
 the -stable release team
 --
 -
 To unsubscribe from this list: send the line unsubscribe 
 linux-kernel in the body of a message to 
 [EMAIL PROTECTED] More majordomo info at  
 http://vger.kernel.org/majordomo-info.html
 Please read the FAQ at  http://www.tux.org/lkml/
 
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [stable] [patch 00/69] -stable review

2007-05-23 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : Greg KH [mailto:[EMAIL PROTECTED] 
> Envoyé : 22 mai 2007 23:37
> À : Fortier,Vincent [Montreal]
> 
> On Tue, May 22, 2007 at 09:28:34PM -0400, Fortier,Vincent 
> [Montreal] wrote:
> > > -Message d'origine-
> > > De : [EMAIL PROTECTED]
> > > [mailto:[EMAIL PROTECTED] De la part de Chuck 
> > > Ebbert Envoy? : 21 mai 2007 18:24
> > > 
> > > Adrian Bunk wrote:
> > > > On Mon, May 21, 2007 at 12:31:48PM -0700, Andrew Morton wrote:
> > > >> On Mon, 21 May 2007 12:16:12 -0700 Chris Wright 
> > > >> <[EMAIL PROTECTED]> wrote:
> > > >>
> > > >>> This is the start of the stable review cycle for the 2.6.21.2 release.
> > > >> Gee a lot of these are fixing recently-added regressions :( ...
> > > > 
> > > > If only it would fix all of them...
> > > > 
> > > > Michal's list [1] currently contains 51 different regressions in
> > > > 2.6.21 compared to 2.6.20 (12 of them were already reported before
> > > > 2.6.21 was released).
> > > 
> > > Yeah, 2.6.21 seems awfully buggy, and patches to fix many of the 
> > > bugs haven't appeared.
> > > 
> > > Another 2.6.20 update seems to be in order...
> > 
> > Hi,
> > 
> > If there is a new stable 2.6.20 / 2.6.21, would it be 
> possible that they both contain also this patch to keep the 
> same behaviour between 2.6.20 -> 2.6.22 kernels regarding 
> paravirt_ops GPL export?
> > 
> http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=com
> > mit;h=21564fd2a3deb48200b595332f9ed4c9f311f2a7
> 
> Why?  Is there an in-kernel use for that export?

I sincerely dont know... Although:
- * NOTE: CONFIG_PARAVIRT is experimental and the paravirt_ops
- * semantics are subject to change. Hence we only do this
- * internal-only export of this, until it gets sorted out and
- * all lowlevel CPU ops used by modules are separately exported.

Makes me think there was a "good enough" reason to export instead of export GPL 
and that it "could" be usefull to keep the same behaviour for that same 
experimental feature under 2.6.20 / 2.6.21.  Although, the "until it gets 
sorted out" makes me think it might planned to get removed before the end of 
2.6.22 

Thnx.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [stable] [patch 00/69] -stable review

2007-05-23 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : Greg KH [mailto:[EMAIL PROTECTED] 
 Envoyé : 22 mai 2007 23:37
 À : Fortier,Vincent [Montreal]
 
 On Tue, May 22, 2007 at 09:28:34PM -0400, Fortier,Vincent 
 [Montreal] wrote:
   -Message d'origine-
   De : [EMAIL PROTECTED]
   [mailto:[EMAIL PROTECTED] De la part de Chuck 
   Ebbert Envoy? : 21 mai 2007 18:24
   
   Adrian Bunk wrote:
On Mon, May 21, 2007 at 12:31:48PM -0700, Andrew Morton wrote:
On Mon, 21 May 2007 12:16:12 -0700 Chris Wright 
[EMAIL PROTECTED] wrote:
   
This is the start of the stable review cycle for the 2.6.21.2 release.
Gee a lot of these are fixing recently-added regressions :( ...

If only it would fix all of them...

Michal's list [1] currently contains 51 different regressions in
2.6.21 compared to 2.6.20 (12 of them were already reported before
2.6.21 was released).
   
   Yeah, 2.6.21 seems awfully buggy, and patches to fix many of the 
   bugs haven't appeared.
   
   Another 2.6.20 update seems to be in order...
  
  Hi,
  
  If there is a new stable 2.6.20 / 2.6.21, would it be 
 possible that they both contain also this patch to keep the 
 same behaviour between 2.6.20 - 2.6.22 kernels regarding 
 paravirt_ops GPL export?
  
 http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=com
  mit;h=21564fd2a3deb48200b595332f9ed4c9f311f2a7
 
 Why?  Is there an in-kernel use for that export?

I sincerely dont know... Although:
- * NOTE: CONFIG_PARAVIRT is experimental and the paravirt_ops
- * semantics are subject to change. Hence we only do this
- * internal-only export of this, until it gets sorted out and
- * all lowlevel CPU ops used by modules are separately exported.

Makes me think there was a good enough reason to export instead of export GPL 
and that it could be usefull to keep the same behaviour for that same 
experimental feature under 2.6.20 / 2.6.21.  Although, the until it gets 
sorted out makes me think it might planned to get removed before the end of 
2.6.22 

Thnx.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch 00/69] -stable review

2007-05-22 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Chuck Ebbert
> Envoyé : 21 mai 2007 18:24
> 
> Adrian Bunk wrote:
> > On Mon, May 21, 2007 at 12:31:48PM -0700, Andrew Morton wrote:
> >> On Mon, 21 May 2007 12:16:12 -0700
> >> Chris Wright <[EMAIL PROTECTED]> wrote:
> >>
> >>> This is the start of the stable review cycle for the 2.6.21.2 release.
> >> Gee a lot of these are fixing recently-added regressions :( ...
> > 
> > If only it would fix all of them...
> > 
> > Michal's list [1] currently contains 51 different regressions in
> > 2.6.21 compared to 2.6.20 (12 of them were already reported before
> > 2.6.21 was released).
> 
> Yeah, 2.6.21 seems awfully buggy, and patches to fix many of 
> the bugs haven't appeared.
> 
> Another 2.6.20 update seems to be in order...

Hi,

If there is a new stable 2.6.20 / 2.6.21, would it be possible that they both 
contain also this patch to keep the same behaviour between 2.6.20 -> 2.6.22 
kernels regarding paravirt_ops GPL export?
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=21564fd2a3deb48200b595332f9ed4c9f311f2a7

Thnx!

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch 00/69] -stable review

2007-05-22 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Chuck Ebbert
 Envoyé : 21 mai 2007 18:24
 
 Adrian Bunk wrote:
  On Mon, May 21, 2007 at 12:31:48PM -0700, Andrew Morton wrote:
  On Mon, 21 May 2007 12:16:12 -0700
  Chris Wright [EMAIL PROTECTED] wrote:
 
  This is the start of the stable review cycle for the 2.6.21.2 release.
  Gee a lot of these are fixing recently-added regressions :( ...
  
  If only it would fix all of them...
  
  Michal's list [1] currently contains 51 different regressions in
  2.6.21 compared to 2.6.20 (12 of them were already reported before
  2.6.21 was released).
 
 Yeah, 2.6.21 seems awfully buggy, and patches to fix many of 
 the bugs haven't appeared.
 
 Another 2.6.20 update seems to be in order...

Hi,

If there is a new stable 2.6.20 / 2.6.21, would it be possible that they both 
contain also this patch to keep the same behaviour between 2.6.20 - 2.6.22 
kernels regarding paravirt_ops GPL export?
http://git.kernel.org/?p=linux/kernel/git/torvalds/linux-2.6.git;a=commit;h=21564fd2a3deb48200b595332f9ed4c9f311f2a7

Thnx!

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch] CFS scheduler, -v10

2007-05-07 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
> Envoyé : 7 mai 2007 12:34
> 
> 
> i'm pleased to announce release -v10 of the CFS scheduler patchset.
> (The main goal of CFS is to implement "desktop scheduling" 
> with as  high quality as technically possible.)
> 
> The CFS patch against v2.6.21.1 (or against v2.6.20.10) can 
> be downloaded from the usual place:
>  
> http://people.redhat.com/mingo/cfs-scheduler/
> 
> -v10 got a bit bigger than usual. One reason for that is that 
> i couldnt resist the fine enhancements contributed by Peter 
> Williams and Mike Galbraith. Peter fixed and improved nice 
> level support and SMP load-balancing, while Mike applied his 
> "no compromises" interactivity tuning skills to CFS:
> 
> 6 files changed, 247 insertions(+), 110 deletions(-)

CFS v10 & SD 0.48 Fedora Core 6 kernels 2.6.20 build 2948 arch x86_64 & i686 
are available at http://linux-dev.qc.ec.gc.ca.

>   Ingo

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [patch] CFS scheduler, -v10

2007-05-07 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Ingo Molnar
 Envoyé : 7 mai 2007 12:34
 
 
 i'm pleased to announce release -v10 of the CFS scheduler patchset.
 (The main goal of CFS is to implement desktop scheduling 
 with as  high quality as technically possible.)
 
 The CFS patch against v2.6.21.1 (or against v2.6.20.10) can 
 be downloaded from the usual place:
  
 http://people.redhat.com/mingo/cfs-scheduler/
 
 -v10 got a bit bigger than usual. One reason for that is that 
 i couldnt resist the fine enhancements contributed by Peter 
 Williams and Mike Galbraith. Peter fixed and improved nice 
 level support and SMP load-balancing, while Mike applied his 
 no compromises interactivity tuning skills to CFS:
 
 6 files changed, 247 insertions(+), 110 deletions(-)

CFS v10  SD 0.48 Fedora Core 6 kernels 2.6.20 build 2948 arch x86_64  i686 
are available at http://linux-dev.qc.ec.gc.ca.

   Ingo

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [ck] Staircase Deadline cpu scheduler version 0.48 - 2.6.18 to 2.6.20 backports patches

2007-05-03 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Con Kolivas
> Envoyé : 3 mai 2007 02:30
> 
> I've done some minor cleanups and microoptimisations to the 
> code since version
> 0.46 in preparation for releasing -ck with SD as its base.
> 
> http://www.kernel.org/pub/linux/kernel/people/ck/patches/stair
> case-deadline/2.6.21/2.6.21-sd-0.48.patch

Hi all,

I made backport patches of SD 0.48 for 2.6.18.8, 2.6.19.7 & 2.6.20.11 kernels 
and also diff patches 0.46 -> 0.47 -> 0.48.

All info & download available at http://linux-dev.qc.ec.gc.ca

There are also various versions of SD & CFS kernels for FC6 & Debian Sarge/Etch

> --
> -ck

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [ck] Staircase Deadline cpu scheduler version 0.48 - 2.6.18 to 2.6.20 backports patches

2007-05-03 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Con Kolivas
 Envoyé : 3 mai 2007 02:30
 
 I've done some minor cleanups and microoptimisations to the 
 code since version
 0.46 in preparation for releasing -ck with SD as its base.
 
 http://www.kernel.org/pub/linux/kernel/people/ck/patches/stair
 case-deadline/2.6.21/2.6.21-sd-0.48.patch

Hi all,

I made backport patches of SD 0.48 for 2.6.18.8, 2.6.19.7  2.6.20.11 kernels 
and also diff patches 0.46 - 0.47 - 0.48.

All info  download available at http://linux-dev.qc.ec.gc.ca

There are also various versions of SD  CFS kernels for FC6  Debian Sarge/Etch

 --
 -ck

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [ck] [ANNOUNCE] Staircase Deadline cpu scheduler version 0.44

2007-04-21 Thread Fortier,Vincent [Montreal]
> -Message d'origine-
> De : [EMAIL PROTECTED] 
> [mailto:[EMAIL PROTECTED] De la part de Con Kolivas
> Envoyé : 21 avril 2007 03:57
> 
> A significant bugfix for forking tasks was just posted, so 
> here is an updated version of the staircase deadline cpu 
> scheduler. This may cause noticeable behavioural improvements 
> under certain workloads (such as compiling software with make).
> 
> Thanks to Al Boldi for making me check the fork code!
> 
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.7-sd-0.44.patch
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc7-sd-0.44.patch
> 
> Incrementals in
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20.7/
> http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc7/
> 
> Renicing X to -10, while not essential, is preferable.
> 
> See the patch just posted called 'sched: implement staircase 
> scheduler yaf fix' for full changelog.

Kernels for FC6 x86_64 & i686 using SD 0.44 and latest source (2944) now 
available at http://linux-dev.qc.ec.gc.ca

Note that backport patches for 2.6.18 & 2.6.19 kernels are also available.

> 
> --
> -ck
> 

Again, nice work CK!

-vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [ck] [ANNOUNCE] Staircase Deadline cpu scheduler version 0.44

2007-04-21 Thread Fortier,Vincent [Montreal]
 -Message d'origine-
 De : [EMAIL PROTECTED] 
 [mailto:[EMAIL PROTECTED] De la part de Con Kolivas
 Envoyé : 21 avril 2007 03:57
 
 A significant bugfix for forking tasks was just posted, so 
 here is an updated version of the staircase deadline cpu 
 scheduler. This may cause noticeable behavioural improvements 
 under certain workloads (such as compiling software with make).
 
 Thanks to Al Boldi for making me check the fork code!
 
 http://ck.kolivas.org/patches/staircase-deadline/2.6.20.7-sd-0.44.patch
 http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc7-sd-0.44.patch
 
 Incrementals in
 http://ck.kolivas.org/patches/staircase-deadline/2.6.20.7/
 http://ck.kolivas.org/patches/staircase-deadline/2.6.21-rc7/
 
 Renicing X to -10, while not essential, is preferable.
 
 See the patch just posted called 'sched: implement staircase 
 scheduler yaf fix' for full changelog.

Kernels for FC6 x86_64  i686 using SD 0.44 and latest source (2944) now 
available at http://linux-dev.qc.ec.gc.ca

Note that backport patches for 2.6.18  2.6.19 kernels are also available.

 
 --
 -ck
 

Again, nice work CK!

-vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 4GB Physical. Less than 3GB in Linux.

2007-04-16 Thread Fortier,Vincent [Montreal]
> 
> I've noticed that with 4GB physical ram, I'm only see 3098000 
> MB on the Dell 745, and 3107056MB on the IBM X60s.
> 
> I've tested with CONFIG_HIGHMEM4G=y, CONFIG_HIGHMEM64G=y, 
> CONFIG_X86_PAE=y, but nothing makes any difference.
> 
> Tested on Linux 2.6.20, Linux 2.6.21-rc7.

Got similar problem on a HP XW6200:
http://bugzilla.kernel.org/show_bug.cgi?id=7111

Tought I was the only one getting this!

> # dmesg
> copy_e820_map() start:  size: 0009f000 end:
> 0009f000 type: 1
> copy_e820_map() type is E820_RAM
> copy_e820_map() start: 0009f000 size: 1000 end:
> 000a type: 2
> copy_e820_map() start: 000dc000 size: 00024000 end:
> 0010 type: 2
> copy_e820_map() start: 0010 size: bf5d end:
> bf6d type: 1
> copy_e820_map() type is E820_RAM
> copy_e820_map() start: bf6d size: f000 end:
> bf6df000 type: 3
> copy_e820_map() start: bf6df000 size: 00021000 end:
> bf70 type: 4
> copy_e820_map() start: bf70 size: 0090 end:
> c000 type: 2
> copy_e820_map() start: f000 size: 0400 end:
> f400 type: 2
> copy_e820_map() start: fec0 size: 0001 end:
> fec1 type: 2
> copy_e820_map() start: fed0 size: 0400 end:
> fed00400 type: 2
> copy_e820_map() start: fed14000 size: 6000 end:
> fed1a000 type: 2
> copy_e820_map() start: fed1c000 size: 00074000 end:
> fed9 type: 2
> copy_e820_map() start: fee0 size: 1000 end:
> fee01000 type: 2
> copy_e820_map() start: ff80 size: 0080 end:
> 0001 type: 2
>  BIOS-e820:  - 0009f000 (usable)
>  BIOS-e820: 0009f000 - 000a (reserved)
>  BIOS-e820: 000dc000 - 0010 (reserved)
>  BIOS-e820: 0010 - bf6d (usable)
>  BIOS-e820: bf6d - bf6df000 (ACPI data)
>  BIOS-e820: bf6df000 - bf70 (ACPI NVS)
>  BIOS-e820: bf70 - c000 (reserved)
>  BIOS-e820: f000 - f400 (reserved)
>  BIOS-e820: fec0 - fec1 (reserved)
>  BIOS-e820: fed0 - fed00400 (reserved)
>  BIOS-e820: fed14000 - fed1a000 (reserved)
>  BIOS-e820: fed1c000 - fed9 (reserved)
>  BIOS-e820: fee0 - fee01000 (reserved)
>  BIOS-e820: ff80 - 0001 (reserved) 
> 2166MB HIGHMEM available.
> 896MB LOWMEM available.
> 
> Memory: 3106140k/3136320k available (2225k kernel code, 
> 29072k reserved, 910k data, 196k init, 2218816k highmem) 
> virtual kernel memory layout:
> fixmap  : 0xfff5 - 0xf000   ( 700 kB)
> pkmap   : 0xff80 - 0xffc0   (4096 kB)
> vmalloc : 0xf880 - 0xff7fe000   ( 111 MB)
> lowmem  : 0xc000 - 0xf800   ( 896 MB)
>   .init : 0xc0416000 - 0xc0447000   ( 196 kB)
>   .data : 0xc032c513 - 0xc040ff34   ( 910 kB)
>   .text : 0xc010 - 0xc032c513   (2225 kB)
> 
> 
> Thanks,
> Jeff.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 4GB Physical. Less than 3GB in Linux.

2007-04-16 Thread Fortier,Vincent [Montreal]
 
 I've noticed that with 4GB physical ram, I'm only see 3098000 
 MB on the Dell 745, and 3107056MB on the IBM X60s.
 
 I've tested with CONFIG_HIGHMEM4G=y, CONFIG_HIGHMEM64G=y, 
 CONFIG_X86_PAE=y, but nothing makes any difference.
 
 Tested on Linux 2.6.20, Linux 2.6.21-rc7.

Got similar problem on a HP XW6200:
http://bugzilla.kernel.org/show_bug.cgi?id=7111

Tought I was the only one getting this!

 # dmesg
 copy_e820_map() start:  size: 0009f000 end:
 0009f000 type: 1
 copy_e820_map() type is E820_RAM
 copy_e820_map() start: 0009f000 size: 1000 end:
 000a type: 2
 copy_e820_map() start: 000dc000 size: 00024000 end:
 0010 type: 2
 copy_e820_map() start: 0010 size: bf5d end:
 bf6d type: 1
 copy_e820_map() type is E820_RAM
 copy_e820_map() start: bf6d size: f000 end:
 bf6df000 type: 3
 copy_e820_map() start: bf6df000 size: 00021000 end:
 bf70 type: 4
 copy_e820_map() start: bf70 size: 0090 end:
 c000 type: 2
 copy_e820_map() start: f000 size: 0400 end:
 f400 type: 2
 copy_e820_map() start: fec0 size: 0001 end:
 fec1 type: 2
 copy_e820_map() start: fed0 size: 0400 end:
 fed00400 type: 2
 copy_e820_map() start: fed14000 size: 6000 end:
 fed1a000 type: 2
 copy_e820_map() start: fed1c000 size: 00074000 end:
 fed9 type: 2
 copy_e820_map() start: fee0 size: 1000 end:
 fee01000 type: 2
 copy_e820_map() start: ff80 size: 0080 end:
 0001 type: 2
  BIOS-e820:  - 0009f000 (usable)
  BIOS-e820: 0009f000 - 000a (reserved)
  BIOS-e820: 000dc000 - 0010 (reserved)
  BIOS-e820: 0010 - bf6d (usable)
  BIOS-e820: bf6d - bf6df000 (ACPI data)
  BIOS-e820: bf6df000 - bf70 (ACPI NVS)
  BIOS-e820: bf70 - c000 (reserved)
  BIOS-e820: f000 - f400 (reserved)
  BIOS-e820: fec0 - fec1 (reserved)
  BIOS-e820: fed0 - fed00400 (reserved)
  BIOS-e820: fed14000 - fed1a000 (reserved)
  BIOS-e820: fed1c000 - fed9 (reserved)
  BIOS-e820: fee0 - fee01000 (reserved)
  BIOS-e820: ff80 - 0001 (reserved) 
 2166MB HIGHMEM available.
 896MB LOWMEM available.
 
 Memory: 3106140k/3136320k available (2225k kernel code, 
 29072k reserved, 910k data, 196k init, 2218816k highmem) 
 virtual kernel memory layout:
 fixmap  : 0xfff5 - 0xf000   ( 700 kB)
 pkmap   : 0xff80 - 0xffc0   (4096 kB)
 vmalloc : 0xf880 - 0xff7fe000   ( 111 MB)
 lowmem  : 0xc000 - 0xf800   ( 896 MB)
   .init : 0xc0416000 - 0xc0447000   ( 196 kB)
   .data : 0xc032c513 - 0xc040ff34   ( 910 kB)
   .text : 0xc010 - 0xc032c513   (2225 kB)
 
 
 Thanks,
 Jeff.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Questions regarding ACPI vs FADT dump on HP XW6200 using 2.6.16 - 2.6.20 kernels?

2007-04-02 Thread Fortier,Vincent [Montreal]
Hi all,

In reference to this Bugzilla entry:
http://bugzilla.kernel.org/show_bug.cgi?id=7110

I'm getting ACPI errors in the dmesg:
[7.443482] ACPI Error (evgpe-0711): No handler or method for GPE[
0], disabling event [20060707]
[7.449881] ACPI Error (evgpe-0711): No handler or method for GPE[
1], disabling event [20060707]
[7.456145] ACPI Error (evgpe-0711): No handler or method for GPE[
2], disabling event [20060707]
[7.462271] ACPI Error (evgpe-0711): No handler or method for GPE[
5], disabling event [20060707]
[7.468310] ACPI Error (evgpe-0711): No handler or method for GPE[
6], disabling event [20060707]
[7.474260] ACPI Error (evgpe-0711): No handler or method for GPE[
7], disabling event [20060707]
[7.480124] ACPI Error (evgpe-0711): No handler or method for GPE[
9], disabling event [20060707]
[7.485893] ACPI Error (evgpe-0711): No handler or method for GPE[
A], disabling event [20060707]
[7.491473] ACPI Error (evgpe-0711): No handler or method for GPE[
F], disabling event [20060707]
[7.491484] ACPI Error (evgpe-0711): No handler or method for
GPE[10], disabling event [20060707]
[7.491489] ACPI Error (evgpe-0711): No handler or method for
GPE[11], disabling event [20060707]
[7.491494] ACPI Error (evgpe-0711): No handler or method for
GPE[12], disabling event [20060707]
[7.491499] ACPI Error (evgpe-0711): No handler or method for
GPE[13], disabling event [20060707]
[7.491504] ACPI Error (evgpe-0711): No handler or method for
GPE[14], disabling event [20060707]
[7.491509] ACPI Error (evgpe-0711): No handler or method for
GPE[15], disabling event [20060707]
[7.491514] ACPI Error (evgpe-0711): No handler or method for
GPE[16], disabling event [20060707]
[7.491519] ACPI Error (evgpe-0711): No handler or method for
GPE[17], disabling event [20060707]
[7.491527] ACPI Error (evgpe-0711): No handler or method for
GPE[19], disabling event [20060707]
[7.491532] ACPI Error (evgpe-0711): No handler or method for
GPE[1A], disabling event [20060707]
[7.491537] ACPI Error (evgpe-0711): No handler or method for
GPE[1B], disabling event [20060707]
[7.491542] ACPI Error (evgpe-0711): No handler or method for
GPE[1C], disabling event [20060707]
[7.491547] ACPI Error (evgpe-0711): No handler or method for
GPE[1D], disabling event [20060707]
[7.491552] ACPI Error (evgpe-0711): No handler or method for
GPE[1E], disabling event [20060707]
[7.491557] ACPI Error (evgpe-0711): No handler or method for
GPE[1F], disabling event [20060707]

The ACPI: FADT dmesg entry is:
[0.00] ACPI: RSDP (v002 COMPAQ)
@ 0x000e8e10
[0.00] ACPI: XSDT (v001 COMPAQ CPQ0063  0x20051222  0x)
@ 0xbfff50ec
[0.00] ACPI: FADT (v003 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff52ac
[0.00] ACPI: SSDT (v001 COMPAQ  PROJECT 0x0001 MSFT
0x010e) @ 0xbfff657a
[0.00] ACPI: MADT (v001 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff53b8
[0.00] ACPI: ASF! (v032 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff543c
[0.00] ACPI: MCFG (v001 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff54a6
[0.00] ACPI: DSDT (v001 COMPAQ DSDT 0x0001 MSFT
0x010e) @ 0x

I've been told to use this command to dump this memory segment:
dd if=/dev/mem skip=$[0xbfff52ac] bs=1c count=244 2> /dev/null >
fadt.bin
But I'm always getting an empty file?

My questions are:
1) How can I dump the FADT ?
2) Could this be related to this memory detection bug
http://bugzilla.kernel.org/show_bug.cgi?id=7111 ?

Thnx!

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Questions regarding ACPI vs FADT dump on HP XW6200 using 2.6.16 - 2.6.20 kernels?

2007-04-02 Thread Fortier,Vincent [Montreal]
Hi all,

In reference to this Bugzilla entry:
http://bugzilla.kernel.org/show_bug.cgi?id=7110

I'm getting ACPI errors in the dmesg:
[7.443482] ACPI Error (evgpe-0711): No handler or method for GPE[
0], disabling event [20060707]
[7.449881] ACPI Error (evgpe-0711): No handler or method for GPE[
1], disabling event [20060707]
[7.456145] ACPI Error (evgpe-0711): No handler or method for GPE[
2], disabling event [20060707]
[7.462271] ACPI Error (evgpe-0711): No handler or method for GPE[
5], disabling event [20060707]
[7.468310] ACPI Error (evgpe-0711): No handler or method for GPE[
6], disabling event [20060707]
[7.474260] ACPI Error (evgpe-0711): No handler or method for GPE[
7], disabling event [20060707]
[7.480124] ACPI Error (evgpe-0711): No handler or method for GPE[
9], disabling event [20060707]
[7.485893] ACPI Error (evgpe-0711): No handler or method for GPE[
A], disabling event [20060707]
[7.491473] ACPI Error (evgpe-0711): No handler or method for GPE[
F], disabling event [20060707]
[7.491484] ACPI Error (evgpe-0711): No handler or method for
GPE[10], disabling event [20060707]
[7.491489] ACPI Error (evgpe-0711): No handler or method for
GPE[11], disabling event [20060707]
[7.491494] ACPI Error (evgpe-0711): No handler or method for
GPE[12], disabling event [20060707]
[7.491499] ACPI Error (evgpe-0711): No handler or method for
GPE[13], disabling event [20060707]
[7.491504] ACPI Error (evgpe-0711): No handler or method for
GPE[14], disabling event [20060707]
[7.491509] ACPI Error (evgpe-0711): No handler or method for
GPE[15], disabling event [20060707]
[7.491514] ACPI Error (evgpe-0711): No handler or method for
GPE[16], disabling event [20060707]
[7.491519] ACPI Error (evgpe-0711): No handler or method for
GPE[17], disabling event [20060707]
[7.491527] ACPI Error (evgpe-0711): No handler or method for
GPE[19], disabling event [20060707]
[7.491532] ACPI Error (evgpe-0711): No handler or method for
GPE[1A], disabling event [20060707]
[7.491537] ACPI Error (evgpe-0711): No handler or method for
GPE[1B], disabling event [20060707]
[7.491542] ACPI Error (evgpe-0711): No handler or method for
GPE[1C], disabling event [20060707]
[7.491547] ACPI Error (evgpe-0711): No handler or method for
GPE[1D], disabling event [20060707]
[7.491552] ACPI Error (evgpe-0711): No handler or method for
GPE[1E], disabling event [20060707]
[7.491557] ACPI Error (evgpe-0711): No handler or method for
GPE[1F], disabling event [20060707]

The ACPI: FADT dmesg entry is:
[0.00] ACPI: RSDP (v002 COMPAQ)
@ 0x000e8e10
[0.00] ACPI: XSDT (v001 COMPAQ CPQ0063  0x20051222  0x)
@ 0xbfff50ec
[0.00] ACPI: FADT (v003 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff52ac
[0.00] ACPI: SSDT (v001 COMPAQ  PROJECT 0x0001 MSFT
0x010e) @ 0xbfff657a
[0.00] ACPI: MADT (v001 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff53b8
[0.00] ACPI: ASF! (v032 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff543c
[0.00] ACPI: MCFG (v001 COMPAQ TUMWATER 0x0001  0x)
@ 0xbfff54a6
[0.00] ACPI: DSDT (v001 COMPAQ DSDT 0x0001 MSFT
0x010e) @ 0x

I've been told to use this command to dump this memory segment:
dd if=/dev/mem skip=$[0xbfff52ac] bs=1c count=244 2 /dev/null 
fadt.bin
But I'm always getting an empty file?

My questions are:
1) How can I dump the FADT ?
2) Could this be related to this memory detection bug
http://bugzilla.kernel.org/show_bug.cgi?id=7111 ?

Thnx!

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


staircase deadline 0.37 backport 2.6.18.8 and 2.6.19.7 & debian sarge/etch kernel availability

2007-03-30 Thread Fortier,Vincent [Montreal]
 
Hi all,
 
Backports of the staircase deadline cpu scheduler version 0.37 for
2.6.18.8 and 2.6.19.7 kernels are available at
http://linux-dev.qc.ec.gc.ca/.

Also, Debian Etch 4.0 and Sarge 3.1 kernels for i686 are available (Deb
etch amd64 is currently compiling and FC6 will follow).

Vincent Fortier
Informatique
Environnement Canada

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


staircase deadline 0.37 backport 2.6.18.8 and 2.6.19.7 debian sarge/etch kernel availability

2007-03-30 Thread Fortier,Vincent [Montreal]
 
Hi all,
 
Backports of the staircase deadline cpu scheduler version 0.37 for
2.6.18.8 and 2.6.19.7 kernels are available at
http://linux-dev.qc.ec.gc.ca/.

Also, Debian Etch 4.0 and Sarge 3.1 kernels for i686 are available (Deb
etch amd64 is currently compiling and FC6 will follow).

Vincent Fortier
Informatique
Environnement Canada

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


staircase deadline 0.36 backport 2.6.18.8 and 2.6.19.7

2007-03-29 Thread Fortier,Vincent [Montreal]
Hi all,

Backports of the staircase deadline cpu scheduler for 2.6.18.8 and
2.6.19.7 kernels are available at http://linux-dev.qc.ec.gc.ca/.

Also, Debian Etch 4.0 kernels for both i686 and x86_64 are available.
Fedora Core 6 kernels might also be available later today/tomorrow.

Both patches looks quite stable.  I've been able to fix the 2.6.19
backport by reversing this patch: "[PATCH] move_task_off_dead_cpu()
should be called with disabled ints" (see
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=co
mmit;h=054b9108e01ef27e2e6b32b4226abb6024626f06).  I don't know if this
bug (see
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/2.6.19/bug-2.6.19
.7-rsdl-033.jpg) can also affect a Vanilla 2.6.19.7 kernel or even the
actual mainline?  Duno if it would be worth investigating a bit further?

Although, here is a top of a 2.6.19.7 + SD 0.36 patch in action while
building 3 kernels at once and running a glxgears in foreground:
top - 12:39:39 up 17 min,  2 users,  load average: 5.84, 4.37, 2.20
Tasks: 162 total,   5 running, 157 sleeping,   0 stopped,   0 zombie
Cpu0  : 87.0%us, 13.0%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si,
0.0%st
Cpu1  : 88.4%us, 11.6%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si,
0.0%st
Mem:   2059940k total,  1249476k used,   810464k free,   137112k buffers
Swap:  1959888k total,0k used,  1959888k free,   620552k cached

  PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
15970 root  33   0 45016  36m 4520 R   18  1.8   0:00.54 cc1
16025 root   1   0 40904  32m  R   18  1.6   0:00.53 cc1
16015 root  33   0 40620  32m 4496 R   17  1.6   0:00.52 cc1
13401 th0ma7 1   0 42356 9420 6700 R6  0.5   1:16.13 glxgears
16589 root   1   0  3756  400  192 S1  0.0   0:01.04 faked-sysv
 5012 root   5   0 91208  46m 7200 S0  2.3   0:11.57 Xorg

Note that my KDE session/firefox/kontact/etc. was still pretty
responsive!

Vincent Fortier
Informatique
Environnement Canada



-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


staircase deadline 0.36 backport 2.6.18.8 and 2.6.19.7

2007-03-29 Thread Fortier,Vincent [Montreal]
Hi all,

Backports of the staircase deadline cpu scheduler for 2.6.18.8 and
2.6.19.7 kernels are available at http://linux-dev.qc.ec.gc.ca/.

Also, Debian Etch 4.0 kernels for both i686 and x86_64 are available.
Fedora Core 6 kernels might also be available later today/tomorrow.

Both patches looks quite stable.  I've been able to fix the 2.6.19
backport by reversing this patch: [PATCH] move_task_off_dead_cpu()
should be called with disabled ints (see
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=co
mmit;h=054b9108e01ef27e2e6b32b4226abb6024626f06).  I don't know if this
bug (see
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/2.6.19/bug-2.6.19
.7-rsdl-033.jpg) can also affect a Vanilla 2.6.19.7 kernel or even the
actual mainline?  Duno if it would be worth investigating a bit further?

Although, here is a top of a 2.6.19.7 + SD 0.36 patch in action while
building 3 kernels at once and running a glxgears in foreground:
top - 12:39:39 up 17 min,  2 users,  load average: 5.84, 4.37, 2.20
Tasks: 162 total,   5 running, 157 sleeping,   0 stopped,   0 zombie
Cpu0  : 87.0%us, 13.0%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si,
0.0%st
Cpu1  : 88.4%us, 11.6%sy,  0.0%ni,  0.0%id,  0.0%wa,  0.0%hi,  0.0%si,
0.0%st
Mem:   2059940k total,  1249476k used,   810464k free,   137112k buffers
Swap:  1959888k total,0k used,  1959888k free,   620552k cached

  PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
15970 root  33   0 45016  36m 4520 R   18  1.8   0:00.54 cc1
16025 root   1   0 40904  32m  R   18  1.6   0:00.53 cc1
16015 root  33   0 40620  32m 4496 R   17  1.6   0:00.52 cc1
13401 th0ma7 1   0 42356 9420 6700 R6  0.5   1:16.13 glxgears
16589 root   1   0  3756  400  192 S1  0.0   0:01.04 faked-sysv
 5012 root   5   0 91208  46m 7200 S0  2.3   0:11.57 Xorg

Note that my KDE session/firefox/kontact/etc. was still pretty
responsive!

Vincent Fortier
Informatique
Environnement Canada



-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


rsdl 0.33 backport to 2.6.18.8 available & pre-built DebianEtch and Fedora 6 kernels

2007-03-23 Thread Fortier,Vincent [Montreal]
Hi all,

A backported version of the RSDL 0.33 patch is now available at
http://linux-dev.qc.ec.gc.ca for a 2.6.18.8 kernel.

There are also a few 2.6.18/2.6.20 pre-built kernels available for both
Debian Etch (x86_64) and Fedora 6 (x86_64, i686 will follow).

Vincent Fortier
Informatique
Environnement Canada

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21-rc4-mm1

2007-03-23 Thread Fortier,Vincent [Montreal]
> 
> Andy Whitcroft wrote:
> > Con Kolivas wrote:
> >> On Friday 23 March 2007 05:17, Andy Whitcroft wrote:
> >>> Ok, I have yet a third x86_64 machine is is blowing up with the 
> >>> latest
> >>> 2.6.21-rc4-mm1+hotfixes+rsdl-0.32 but working with 
> >>> 2.6.21-rc4-mm1+hotfixes-RSDL.  I have results on various hotfix 
> >>> levels so I have just fired off a set of tests across the affected

> >>> machines on that latest hotfix stack plus the RSDL backout and the

> >>> results should be in in the next hour or two.
> >>>
> >>> I think there is a strong correlation between RSDL and these
hangs.  
> >>> Any suggestions as to the next step.
> >> Found a nasty in requeue_task
> >> +  if (list_empty(old_array->queue + old_prio))
> >> +  __clear_bit(old_prio, p->array->prio_bitmap);
> >>
> >> see anything wrong there? I do :P
> >>
> >> I'll queue that up with the other changes pending and hopefully
that 
> >> will fix your bug.
> > 
> > Tests queued with your rdsl-0.33 patch (I am assuming its in there).
> > Will let you know how it looks.
> 
> Hmmm, this is good for the original machine (as was 0.32) but 
> not for either of the other two.  I am seeing panics as below 
> on those two.
> 
> -apw
> 

I don't know if this might help out or even if it is related but I get a
"similar" crash every time using my backported rsdl patch on a 2.6.19.7
kernel.  "Maybie" this type of BUG might be easier to trigger under that
specific kernel?  Here is a picture of the output:
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/2.6.19/bug-2.6.19
.7-rsdl-033.jpg.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: 2.6.21-rc4-mm1

2007-03-23 Thread Fortier,Vincent [Montreal]
 
 Andy Whitcroft wrote:
  Con Kolivas wrote:
  On Friday 23 March 2007 05:17, Andy Whitcroft wrote:
  Ok, I have yet a third x86_64 machine is is blowing up with the 
  latest
  2.6.21-rc4-mm1+hotfixes+rsdl-0.32 but working with 
  2.6.21-rc4-mm1+hotfixes-RSDL.  I have results on various hotfix 
  levels so I have just fired off a set of tests across the affected

  machines on that latest hotfix stack plus the RSDL backout and the

  results should be in in the next hour or two.
 
  I think there is a strong correlation between RSDL and these
hangs.  
  Any suggestions as to the next step.
  Found a nasty in requeue_task
  +  if (list_empty(old_array-queue + old_prio))
  +  __clear_bit(old_prio, p-array-prio_bitmap);
 
  see anything wrong there? I do :P
 
  I'll queue that up with the other changes pending and hopefully
that 
  will fix your bug.
  
  Tests queued with your rdsl-0.33 patch (I am assuming its in there).
  Will let you know how it looks.
 
 Hmmm, this is good for the original machine (as was 0.32) but 
 not for either of the other two.  I am seeing panics as below 
 on those two.
 
 -apw
 

I don't know if this might help out or even if it is related but I get a
similar crash every time using my backported rsdl patch on a 2.6.19.7
kernel.  Maybie this type of BUG might be easier to trigger under that
specific kernel?  Here is a picture of the output:
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/2.6.19/bug-2.6.19
.7-rsdl-033.jpg.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


rsdl 0.33 backport to 2.6.18.8 available pre-built DebianEtch and Fedora 6 kernels

2007-03-23 Thread Fortier,Vincent [Montreal]
Hi all,

A backported version of the RSDL 0.33 patch is now available at
http://linux-dev.qc.ec.gc.ca for a 2.6.18.8 kernel.

There are also a few 2.6.18/2.6.20 pre-built kernels available for both
Debian Etch (x86_64) and Fedora 6 (x86_64, i686 will follow).

Vincent Fortier
Informatique
Environnement Canada

-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


rsdl 0.32 backport to 2.6.18.8 available & pre-built Debian Etch and Fedora 6 kernels

2007-03-22 Thread Fortier,Vincent [Montreal]
Hi all,

A backported version of the RSDL 0.32 patch is now available at
http://linux-dev.qc.ec.gc.ca for a 2.6.18.8 kernel.

There are also a few 2.6.18/2.6.20 pre-built kernels available for both
Debian Etch (x86_64) and Fedora 6 (x86_64).

Vincent Fortier
Informatique
Environnement Canada


-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


rsdl 0.32 backport to 2.6.18.8 available pre-built Debian Etch and Fedora 6 kernels

2007-03-22 Thread Fortier,Vincent [Montreal]
Hi all,

A backported version of the RSDL 0.32 patch is now available at
http://linux-dev.qc.ec.gc.ca for a 2.6.18.8 kernel.

There are also a few 2.6.18/2.6.20 pre-built kernels available for both
Debian Etch (x86_64) and Fedora 6 (x86_64).

Vincent Fortier
Informatique
Environnement Canada


-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: [ck] Re: RSDL v0.30 cpu scheduler for ... 2.6.18.8 kernel

2007-03-12 Thread Fortier,Vincent [Montreal]
> On Monday 12 March 2007 22:21, Michael Gerdau wrote:
> > > > And here are the backported RSDL 0.30 patches in case any of you

> > > > would still be running an older 2.6.18.8 kernel ...
> >
> > The original mail doesn't seem to have made it to the ck-list.
> >
> > Where would I find the backported RSDL 0.30 patches for 2.6.18.8 ?
> 
> It might have been filtered due to containing attachments, or 
> because the author wasn't a member of the mailing list, sorry.
> 

I've found a place to put theses patches to:
http://linux-dev.qc.ec.gc.ca/kernel/rsdl/2.6.18.8-rsdl-0.30.patch
http://linux-dev.qc.ec.gc.ca/kernel/rsdl/2.6.18.8-rsdl-0.29-0.30.patch


There is also a Debian Etch x86_64 2.6.18-rsdl-0.30 kernel available at:
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/linux-headers-2.6
.18.8-rsdl-0.30-amd64-envcan_2.6.18.8-rsdl-0.30-003_amd64.deb
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/linux-image-2.6.1
8.8-rsdl-0.30-amd64-envcan_2.6.18.8-rsdl-0.30-003_amd64.deb


Hope this helps!

-vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: RSDL v0.30 cpu scheduler for ... 2.6.18.8 kernel

2007-03-12 Thread Fortier,Vincent [Montreal]
> 
> Hello Vincent,
> 
> As it seems the 5s net gain is during the early phase of the 
> boot process, could you please post longer dmesgs, to let us 
> know where the gain occurs ?

Hi Paul,

Here is an sdiff between the two dmesg:
[0.00] Linux version 2.6.18.8-amd64-envcan-003 (root@ | [
0.00] Linux version 2.6.18.8-rsdl-0.30-amd64-envcan
[0.00] time.c: Detected 2210.220 MHz processor.   | [
0.00] time.c: Detected 2210.215 MHz processor.
[   37.396210] Console: colour dummy device 80x25 | [
32.320335] Console: colour dummy device 80x25
[   37.397484] Dentry cache hash table entries: 262144 (order | [
32.321611] Dentry cache hash table entries: 262144 (order
[   37.399120] Inode-cache hash table entries: 131072 (order: | [
32.323318] Inode-cache hash table entries: 131072 (order:
[   37.399633] Checking aperture...   | [
32.323869] Checking aperture...
[   37.399641] CPU 0: aperture @ 84 size 32 MB| [
32.323877] CPU 0: aperture @ 84 size 32 MB
[   37.399645] Aperture too small (32 MB) | [
32.323881] Aperture too small (32 MB)
[   37.404577] No AGP bridge found| [
32.328813] No AGP bridge found
[   37.421192] Memory: 2054704k/2096000k available (1934k ker | [
32.345833] Memory: 2054704k/2096000k available (1935k ker
[   37.498110] Calibrating delay using timer specific routine | [
32.422230] Calibrating delay using timer specific routine

Here is a bit more detailed info:

Vanilla Version:
[0.00] SMP: Allowing 2 CPUs, 0 hotplug CPUs
[0.00] Built 1 zonelists.  Total pages: 515848
[0.00] Kernel command line: root=LABEL=DebianEtch_64 ro
vga=0x305
[0.00] Initializing CPU#0
[0.00] PID hash table entries: 4096 (order: 12, 32768 bytes)
[0.00] Disabling vsyscall due to use of PM timer
[0.00] time.c: Using 3.579545 MHz WALL PM GTOD PM timer.
[0.00] time.c: Detected 2210.220 MHz processor.
[   37.396210] Console: colour dummy device 80x25
[   37.397484] Dentry cache hash table entries: 262144 (order: 9,
2097152 bytes)
[   37.399120] Inode-cache hash table entries: 131072 (order: 8, 1048576
bytes)
[   37.399633] Checking aperture...
[   37.399641] CPU 0: aperture @ 84 size 32 MB
[   37.399645] Aperture too small (32 MB)
[   37.404577] No AGP bridge found
[   37.421192] Memory: 2054704k/2096000k available (1934k kernel code,
40908k reserved, 868k data, 188k init)


RSDL Version:
[0.00] Initializing CPU#0
[0.00] PID hash table entries: 4096 (order: 12, 32768 bytes)
[0.00] Disabling vsyscall due to use of PM timer
[0.00] time.c: Using 3.579545 MHz WALL PM GTOD PM timer.
[0.00] time.c: Detected 2210.215 MHz processor.
[   32.320335] Console: colour dummy device 80x25
[   32.321611] Dentry cache hash table entries: 262144 (order: 9,
2097152 bytes)
[   32.323318] Inode-cache hash table entries: 131072 (order: 8, 1048576
bytes)
[   32.323869] Checking aperture...
[   32.323877] CPU 0: aperture @ 84 size 32 MB
[   32.323881] Aperture too small (32 MB)
[   32.328813] No AGP bridge found

There seems to be a some sort of lag happening adter the CPU
detection... It might simply be the call to vga=0x305 which takes less
or more time depending of the kernel?  I have attached a gzip full
dmesg.

> 
> Regards,
> Paul
> 

- vin 


dmesg.2.6.18.8-rsdl-0.30-amd64-envcan.gz
Description: dmesg.2.6.18.8-rsdl-0.30-amd64-envcan.gz


RE: RSDL v0.30 cpu scheduler for ... 2.6.18.8 kernel

2007-03-12 Thread Fortier,Vincent [Montreal]
 
 Hello Vincent,
 
 As it seems the 5s net gain is during the early phase of the 
 boot process, could you please post longer dmesgs, to let us 
 know where the gain occurs ?

Hi Paul,

Here is an sdiff between the two dmesg:
[0.00] Linux version 2.6.18.8-amd64-envcan-003 (root@ | [
0.00] Linux version 2.6.18.8-rsdl-0.30-amd64-envcan
[0.00] time.c: Detected 2210.220 MHz processor.   | [
0.00] time.c: Detected 2210.215 MHz processor.
[   37.396210] Console: colour dummy device 80x25 | [
32.320335] Console: colour dummy device 80x25
[   37.397484] Dentry cache hash table entries: 262144 (order | [
32.321611] Dentry cache hash table entries: 262144 (order
[   37.399120] Inode-cache hash table entries: 131072 (order: | [
32.323318] Inode-cache hash table entries: 131072 (order:
[   37.399633] Checking aperture...   | [
32.323869] Checking aperture...
[   37.399641] CPU 0: aperture @ 84 size 32 MB| [
32.323877] CPU 0: aperture @ 84 size 32 MB
[   37.399645] Aperture too small (32 MB) | [
32.323881] Aperture too small (32 MB)
[   37.404577] No AGP bridge found| [
32.328813] No AGP bridge found
[   37.421192] Memory: 2054704k/2096000k available (1934k ker | [
32.345833] Memory: 2054704k/2096000k available (1935k ker
[   37.498110] Calibrating delay using timer specific routine | [
32.422230] Calibrating delay using timer specific routine

Here is a bit more detailed info:

Vanilla Version:
[0.00] SMP: Allowing 2 CPUs, 0 hotplug CPUs
[0.00] Built 1 zonelists.  Total pages: 515848
[0.00] Kernel command line: root=LABEL=DebianEtch_64 ro
vga=0x305
[0.00] Initializing CPU#0
[0.00] PID hash table entries: 4096 (order: 12, 32768 bytes)
[0.00] Disabling vsyscall due to use of PM timer
[0.00] time.c: Using 3.579545 MHz WALL PM GTOD PM timer.
[0.00] time.c: Detected 2210.220 MHz processor.
[   37.396210] Console: colour dummy device 80x25
[   37.397484] Dentry cache hash table entries: 262144 (order: 9,
2097152 bytes)
[   37.399120] Inode-cache hash table entries: 131072 (order: 8, 1048576
bytes)
[   37.399633] Checking aperture...
[   37.399641] CPU 0: aperture @ 84 size 32 MB
[   37.399645] Aperture too small (32 MB)
[   37.404577] No AGP bridge found
[   37.421192] Memory: 2054704k/2096000k available (1934k kernel code,
40908k reserved, 868k data, 188k init)


RSDL Version:
[0.00] Initializing CPU#0
[0.00] PID hash table entries: 4096 (order: 12, 32768 bytes)
[0.00] Disabling vsyscall due to use of PM timer
[0.00] time.c: Using 3.579545 MHz WALL PM GTOD PM timer.
[0.00] time.c: Detected 2210.215 MHz processor.
[   32.320335] Console: colour dummy device 80x25
[   32.321611] Dentry cache hash table entries: 262144 (order: 9,
2097152 bytes)
[   32.323318] Inode-cache hash table entries: 131072 (order: 8, 1048576
bytes)
[   32.323869] Checking aperture...
[   32.323877] CPU 0: aperture @ 84 size 32 MB
[   32.323881] Aperture too small (32 MB)
[   32.328813] No AGP bridge found

There seems to be a some sort of lag happening adter the CPU
detection... It might simply be the call to vga=0x305 which takes less
or more time depending of the kernel?  I have attached a gzip full
dmesg.

 
 Regards,
 Paul
 

- vin 


dmesg.2.6.18.8-rsdl-0.30-amd64-envcan.gz
Description: dmesg.2.6.18.8-rsdl-0.30-amd64-envcan.gz


RE: [ck] Re: RSDL v0.30 cpu scheduler for ... 2.6.18.8 kernel

2007-03-12 Thread Fortier,Vincent [Montreal]
 On Monday 12 March 2007 22:21, Michael Gerdau wrote:
And here are the backported RSDL 0.30 patches in case any of you

would still be running an older 2.6.18.8 kernel ...
 
  The original mail doesn't seem to have made it to the ck-list.
 
  Where would I find the backported RSDL 0.30 patches for 2.6.18.8 ?
 
 It might have been filtered due to containing attachments, or 
 because the author wasn't a member of the mailing list, sorry.
 

I've found a place to put theses patches to:
http://linux-dev.qc.ec.gc.ca/kernel/rsdl/2.6.18.8-rsdl-0.30.patch
http://linux-dev.qc.ec.gc.ca/kernel/rsdl/2.6.18.8-rsdl-0.29-0.30.patch


There is also a Debian Etch x86_64 2.6.18-rsdl-0.30 kernel available at:
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/linux-headers-2.6
.18.8-rsdl-0.30-amd64-envcan_2.6.18.8-rsdl-0.30-003_amd64.deb
http://linux-dev.qc.ec.gc.ca/kernel/debian/etch/x86_64/linux-image-2.6.1
8.8-rsdl-0.30-amd64-envcan_2.6.18.8-rsdl-0.30-003_amd64.deb


Hope this helps!

-vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RSDL v0.28 for 2.6.20 -> backport to 2.6.18.8

2007-03-10 Thread Fortier,Vincent [Montreal]
Hi all,

> 
> Here is an update for RSDL to version 0.28
> 
> Full patch:
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20-sched-
> rsdl-0.28.patch
> 
> Series:
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20/
> 
> The patch to get you from 0.26 to 0.28:
> http://ck.kolivas.org/patches/staircase-deadline/2.6.20/sched-
> rsdl-0.26-0.28.patch
> 
> A similar patch and directories will be made for 2.6.21-rc3 
> without further announcement
> 

First of all, thanx Con for this nice piece of code.

I've been trying in the last few days to backport this new scheduler to
a 2.6.18 kernel.  After a lot of efforts I have finally been able to
compile and run a RSDL patched 2.6.18.8 kernel on a x86_64 arch and
actually my test PC booted 2-3 seconds faster with it compared to a
vanilla 2.6.18.8 kernel.

This patch includes a few other patches from 2.6.19+ kernel:

PATCH 1:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=co
mmit;h=ece8a684c75df215320b4155944979e3f78c5c93

PATCH 2:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=co
mmit;h=08c183f31bdbb709f177f6d3110d5f288ea33933

PATCH 3:
The patch to get you from 0.26 to 0.28:
http://ck.kolivas.org/patches/staircase-deadline/2.6.20/sched-rsdl-0.26-
0.28.patch

There might also be other small pieces of code comming from a 2.6.19+
kernel.

Due to the project I'm currently working on, this will, in the next few
weeks, help me out comparing heavy loads on a Debian Sarge/Etch 32/64
platform.  Suggestions on benchmark tools would greatly be appreciated.

Duno if this will be helpfull for anybody but I tought it would be nice
to give it back to the lkml community.

- vin


sched-rsdl-0.26-backport-kernel-2.6.18.patch.gz
Description: sched-rsdl-0.26-backport-kernel-2.6.18.patch.gz


RSDL v0.28 for 2.6.20 - backport to 2.6.18.8

2007-03-10 Thread Fortier,Vincent [Montreal]
Hi all,

 
 Here is an update for RSDL to version 0.28
 
 Full patch:
 http://ck.kolivas.org/patches/staircase-deadline/2.6.20-sched-
 rsdl-0.28.patch
 
 Series:
 http://ck.kolivas.org/patches/staircase-deadline/2.6.20/
 
 The patch to get you from 0.26 to 0.28:
 http://ck.kolivas.org/patches/staircase-deadline/2.6.20/sched-
 rsdl-0.26-0.28.patch
 
 A similar patch and directories will be made for 2.6.21-rc3 
 without further announcement
 

First of all, thanx Con for this nice piece of code.

I've been trying in the last few days to backport this new scheduler to
a 2.6.18 kernel.  After a lot of efforts I have finally been able to
compile and run a RSDL patched 2.6.18.8 kernel on a x86_64 arch and
actually my test PC booted 2-3 seconds faster with it compared to a
vanilla 2.6.18.8 kernel.

This patch includes a few other patches from 2.6.19+ kernel:

PATCH 1:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=co
mmit;h=ece8a684c75df215320b4155944979e3f78c5c93

PATCH 2:
http://git.kernel.org/?p=linux/kernel/git/stable/linux-2.6.20.y.git;a=co
mmit;h=08c183f31bdbb709f177f6d3110d5f288ea33933

PATCH 3:
The patch to get you from 0.26 to 0.28:
http://ck.kolivas.org/patches/staircase-deadline/2.6.20/sched-rsdl-0.26-
0.28.patch

There might also be other small pieces of code comming from a 2.6.19+
kernel.

Due to the project I'm currently working on, this will, in the next few
weeks, help me out comparing heavy loads on a Debian Sarge/Etch 32/64
platform.  Suggestions on benchmark tools would greatly be appreciated.

Duno if this will be helpfull for anybody but I tought it would be nice
to give it back to the lkml community.

- vin


sched-rsdl-0.26-backport-kernel-2.6.18.patch.gz
Description: sched-rsdl-0.26-backport-kernel-2.6.18.patch.gz


RE: Boot time Bluetooth BUG: warning: (value > m) at hid-core.c:793

2007-02-27 Thread Fortier,Vincent [Montreal]
Hi Jiri,

Actually I don't know... Let me test this.

Here is the scenario:
1) If I plug my USB dongle after my system has booted, everything works fine 
(except that the keyboard repeats a few keys somethimes..)
2) If the dongle is already plugged at boot-time, there is a BUG echo in the 
dmesg when the bluetooth service fires up.  This make both the mouse and 
keyboard unresponsive until I finally unplug / replug the dongle hence going 
back at state 1)

I never run the hid2hci command manually but the bluetooth init script seems to 
do so.

Quick test:
1- unplug/plug back the dongle to get back to state 1)
The keyboard and mouse do work has expected...
2- Now run the hid2hci command has root
[EMAIL PROTECTED] init.d]# hid2hci
Switching device 046d:c70a to HCI mode was successful
Switching device 046d:c70e to HCI mode was successful

DMESG:
input: Logitech Logitech BT Mini-Receiver as /class/input/input15
input,hiddev96: USB HID v1.11 Mouse [Logitech Logitech BT Mini-Receiver] on 
usb-:00:1d.7-6.2.2.1.2.3
BUG: warning: (value > m) at drivers/usb/input/hid-core.c:793/implement() 
(Tainted: P )
 [] dump_trace+0x69/0x1b6
 [] show_trace_log_lvl+0x18/0x2c
 [] show_trace+0xf/0x11
 [] dump_stack+0x15/0x17
 [] hid_output_report+0x23c/0x2e7
 [] hid_submit_ctrl+0x4c/0x1d9
 [] hid_submit_report+0x134/0x15f
 [] hiddev_ioctl+0x327/0x88a
 [] do_ioctl+0x4c/0x62
 [] vfs_ioctl+0x24a/0x25c
 [] sys_ioctl+0x4c/0x66
 [] syscall_call+0x7/0xb
 [<001e8402>] 0x1e8402
 ===
BUG: warning: 


3) Now repeat step 1):
 ===
usb 4-6.2.2.1.2.1: new full speed USB device using ehci_hcd and address 24
usb 4-6.2.2.1.2.1: configuration #1 chosen from 1 choice
usb 4-6.2.2.1.2: USB disconnect, address 21
usb 4-6.2.2.1.2.1: USB disconnect, address 24
usb 4-6.2.2.1.2.2: USB disconnect, address 22
usb 4-6.2.2.1.2.3: USB disconnect, address 23
usb 4-6.2.2.1.2: new full speed USB device using ehci_hcd and address 25
usb 4-6.2.2.1.2: configuration #1 chosen from 1 choice
hub 4-6.2.2.1.2:1.0: USB hub found
hub 4-6.2.2.1.2:1.0: 3 ports detected
usb 4-6.2.2.1.2.1: new full speed USB device using ehci_hcd and address 26
usb 4-6.2.2.1.2.1: configuration #1 chosen from 1 choice
usb 4-6.2.2.1.2.2: new full speed USB device using ehci_hcd and address 27
usb 4-6.2.2.1.2.2: configuration #1 chosen from 1 choice
input: Logitech Logitech BT Mini-Receiver as /class/input/input16
input: USB HID v1.11 Keyboard [Logitech Logitech BT Mini-Receiver] on 
usb-:00:1d.7-6.2.2.1.2.2
usb 4-6.2.2.1.2.3: new full speed USB device using ehci_hcd and address 28
usb 4-6.2.2.1.2.3: configuration #1 chosen from 1 choice
input: Logitech Logitech BT Mini-Receiver as /class/input/input17
input,hiddev96: USB HID v1.11 Mouse [Logitech Logitech BT Mini-Receiver] on 
usb-:00:1d.7-6.2.2.1.2.3

So yes, it looks like a hid2hci BUG or kernel BUG initiated by the hid2hci 
command.

- vin


> -Message d'origine-
> De : Jiri Kosina [mailto:[EMAIL PROTECTED] 
> Envoyé : 27 février 2007 11:15
> À : Fortier,Vincent [Montreal]
> Objet : RE: Boot time Bluetooth BUG: warning: (value > m) at 
> hid-core.c:793
> 
> On Tue, 27 Feb 2007, Fortier,Vincent [Montreal] wrote:
> 
> > Actually it's the first time I'm trying this keyboard and mouse.
> 
> And I guess when you use it in HID mode (i.e. when the 
> hid2hci command is not run), then keyboard and mouse work 
> with no errors reported in the kernel logs, and all the 
> buttons work correctly, am I right?
> 
> --
> Jiri Kosina
> 
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: Boot time Bluetooth BUG: warning: (value > m) at hid-core.c:793

2007-02-27 Thread Fortier,Vincent [Montreal]
> 
> we understand the original CSR HID proxy dongles, but for the Logitech

> ones, it is wild guesses. The current support in hid2hci has been 
> tested on Logitech diNovo first generation and I have no other 
> Logitech hardware to verify it with. We might simply need 
> the full HID report descriptor to see who is at fault.
> 
> As far as I can see from a quick look, the device IDs are 
> unchanged (0x046d/0xc70e for the keyboard), but the report 
> descriptor seems to differ from the one you have at 
> http://www.holtmann.org/linux/bluetooth/logitech.html - that 
> seems pretty sad.
> 
> What puzzles me a bit is a fact that both the bugreporters 
> seem to state pretty clearly that this started happening 
> after some update, am I right?
> 
> Or did the hardware before work all the time in the HID mode, 
> without even being attempted to switch to HCI (which seems to 
> cause the bug)?
> 

Actually it's the first time I'm trying this keyboard and mouse.

- vin
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: Boot time Bluetooth BUG: warning: (value m) at hid-core.c:793

2007-02-27 Thread Fortier,Vincent [Montreal]
 
 we understand the original CSR HID proxy dongles, but for the Logitech

 ones, it is wild guesses. The current support in hid2hci has been 
 tested on Logitech diNovo first generation and I have no other 
 Logitech hardware to verify it with. We might simply need 
 the full HID report descriptor to see who is at fault.
 
 As far as I can see from a quick look, the device IDs are 
 unchanged (0x046d/0xc70e for the keyboard), but the report 
 descriptor seems to differ from the one you have at 
 http://www.holtmann.org/linux/bluetooth/logitech.html - that 
 seems pretty sad.
 
 What puzzles me a bit is a fact that both the bugreporters 
 seem to state pretty clearly that this started happening 
 after some update, am I right?
 
 Or did the hardware before work all the time in the HID mode, 
 without even being attempted to switch to HCI (which seems to 
 cause the bug)?
 

Actually it's the first time I'm trying this keyboard and mouse.

- vin
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


RE: Boot time Bluetooth BUG: warning: (value m) at hid-core.c:793

2007-02-27 Thread Fortier,Vincent [Montreal]
Hi Jiri,

Actually I don't know... Let me test this.

Here is the scenario:
1) If I plug my USB dongle after my system has booted, everything works fine 
(except that the keyboard repeats a few keys somethimes..)
2) If the dongle is already plugged at boot-time, there is a BUG echo in the 
dmesg when the bluetooth service fires up.  This make both the mouse and 
keyboard unresponsive until I finally unplug / replug the dongle hence going 
back at state 1)

I never run the hid2hci command manually but the bluetooth init script seems to 
do so.

Quick test:
1- unplug/plug back the dongle to get back to state 1)
The keyboard and mouse do work has expected...
2- Now run the hid2hci command has root
[EMAIL PROTECTED] init.d]# hid2hci
Switching device 046d:c70a to HCI mode was successful
Switching device 046d:c70e to HCI mode was successful

DMESG:
input: Logitech Logitech BT Mini-Receiver as /class/input/input15
input,hiddev96: USB HID v1.11 Mouse [Logitech Logitech BT Mini-Receiver] on 
usb-:00:1d.7-6.2.2.1.2.3
BUG: warning: (value  m) at drivers/usb/input/hid-core.c:793/implement() 
(Tainted: P )
 [c0405018] dump_trace+0x69/0x1b6
 [c040517d] show_trace_log_lvl+0x18/0x2c
 [c0405778] show_trace+0xf/0x11
 [c0405875] dump_stack+0x15/0x17
 [c05993c0] hid_output_report+0x23c/0x2e7
 [c05994b7] hid_submit_ctrl+0x4c/0x1d9
 [c05997fd] hid_submit_report+0x134/0x15f
 [c059bd09] hiddev_ioctl+0x327/0x88a
 [c04802c8] do_ioctl+0x4c/0x62
 [c0480528] vfs_ioctl+0x24a/0x25c
 [c0480586] sys_ioctl+0x4c/0x66
 [c040404b] syscall_call+0x7/0xb
 [001e8402] 0x1e8402
 ===
BUG: warning: 


3) Now repeat step 1):
 ===
usb 4-6.2.2.1.2.1: new full speed USB device using ehci_hcd and address 24
usb 4-6.2.2.1.2.1: configuration #1 chosen from 1 choice
usb 4-6.2.2.1.2: USB disconnect, address 21
usb 4-6.2.2.1.2.1: USB disconnect, address 24
usb 4-6.2.2.1.2.2: USB disconnect, address 22
usb 4-6.2.2.1.2.3: USB disconnect, address 23
usb 4-6.2.2.1.2: new full speed USB device using ehci_hcd and address 25
usb 4-6.2.2.1.2: configuration #1 chosen from 1 choice
hub 4-6.2.2.1.2:1.0: USB hub found
hub 4-6.2.2.1.2:1.0: 3 ports detected
usb 4-6.2.2.1.2.1: new full speed USB device using ehci_hcd and address 26
usb 4-6.2.2.1.2.1: configuration #1 chosen from 1 choice
usb 4-6.2.2.1.2.2: new full speed USB device using ehci_hcd and address 27
usb 4-6.2.2.1.2.2: configuration #1 chosen from 1 choice
input: Logitech Logitech BT Mini-Receiver as /class/input/input16
input: USB HID v1.11 Keyboard [Logitech Logitech BT Mini-Receiver] on 
usb-:00:1d.7-6.2.2.1.2.2
usb 4-6.2.2.1.2.3: new full speed USB device using ehci_hcd and address 28
usb 4-6.2.2.1.2.3: configuration #1 chosen from 1 choice
input: Logitech Logitech BT Mini-Receiver as /class/input/input17
input,hiddev96: USB HID v1.11 Mouse [Logitech Logitech BT Mini-Receiver] on 
usb-:00:1d.7-6.2.2.1.2.3

So yes, it looks like a hid2hci BUG or kernel BUG initiated by the hid2hci 
command.

- vin


 -Message d'origine-
 De : Jiri Kosina [mailto:[EMAIL PROTECTED] 
 Envoyé : 27 février 2007 11:15
 À : Fortier,Vincent [Montreal]
 Objet : RE: Boot time Bluetooth BUG: warning: (value  m) at 
 hid-core.c:793
 
 On Tue, 27 Feb 2007, Fortier,Vincent [Montreal] wrote:
 
  Actually it's the first time I'm trying this keyboard and mouse.
 
 And I guess when you use it in HID mode (i.e. when the 
 hid2hci command is not run), then keyboard and mouse work 
 with no errors reported in the kernel logs, and all the 
 buttons work correctly, am I right?
 
 --
 Jiri Kosina
 
-
To unsubscribe from this list: send the line unsubscribe linux-kernel in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


  1   2   >