Re: Linux 2.6.22 released

2007-07-11 Thread Stephen Frost
* Linus Torvalds ([EMAIL PROTECTED]) wrote:
> But yeah, if Debian/sid is just using random compiler snapshots of the 
> day, I htink we can just bury this as "pointless".

Err, debian/sid *isn't* defaulting to gcc-4.2 yet, but it is made
available to people who want to install it and play with it.

Additionally, the kernel build is actually tied to a specific compiler
(doesn't default to just using whatever the default compiler is) so even
when gcc-4.2 is eventually made the default compiler the linux images
built for Debian won't use it immediately.  That change will be made
seperately and after additional testing to make sure it doesn't break
the built kernel which is distributed.

Thanks,

Stephen


signature.asc
Description: Digital signature


Re: Linux 2.6.22 released

2007-07-11 Thread Stephen Frost
* Linus Torvalds ([EMAIL PROTECTED]) wrote:
> I'm hoping your Debian/sid gcc version is some very experimental 
> known-buggy one, and not something that people _expect_ to be solid and 
> work well?

No such luck. :(  Debian's close to moving to gcc-4.2 as the default
compiler in sid.  We've rebuilt the archive a number of times (both
since the 4.2 release and during its development) using gcc-4.2 and
thought we'd identified most of the issues with it.

It clearly sounds like we need to open a high-severity bug on this issue
and track it down before we move to it as the default compiler.  I'll
start harassing the appropriate folks also.

Stefano, can you file that bug, including the config, dmesg, and
backtrace from gcc if you can get it?

Thanks,

Stephen


signature.asc
Description: Digital signature


Re: OpenAFS gatekeepers request addition of AFS_SUPER_MAGIC to magic.h

2006-12-29 Thread Stephen Frost
* Adam Megacz ([EMAIL PROTECTED]) wrote:
> --- include/linux/magic.h   2006-12-29 15:48:50.0 -0800
> +++ include/linux/magic.h   2006-11-29 13:57:37.0 -0800
> @@ -3,7 +3,6 @@
>  
>  #define ADFS_SUPER_MAGIC   0xadf5
>  #define AFFS_SUPER_MAGIC   0xadff
> -#define AFS_SUPER_MAGIC0x5346414F
>  #define AUTOFS_SUPER_MAGIC 0x0187
>  #define CODA_SUPER_MAGIC   0x73757245
>  #define EFS_SUPER_MAGIC0x414A53

Wouldn't you want a patch which *adds* it, rather than one which
*removes* it...?

Thanks,

Stephen


signature.asc
Description: Digital signature


Re: [git patches] libata fixes

2006-12-20 Thread Stephen Frost
* Jeff Garzik ([EMAIL PROTECTED]) wrote:
> FWIW the Tejun cleanups are a fix, split into three reviewable pieces.
> 
> Also, my local iomap branch has advanced sufficiently enough that I
> think it's high time to kill those libata warnings that spew on every
> build.  (I hear the crowds roar)

Perhaps I missed it, but, when is PMP support going to be included in
main-stream?  Seems like it's been around for a while now and I doubt
I'm the only one extremely frustrated with having to hunt down an
external patch to get a new toy supported.  I'll probably bug the Debian
kernel folks next but it'd be nicer if it was in upstream.

Thanks!

Stephen
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to [EMAIL PROTECTED]
More majordomo info at  http://vger.kernel.org/majordomo-info.html
Please read the FAQ at  http://www.tux.org/lkml/


Re: NCQ support NVidia NForce4 (CK804) SATAII

2005-08-15 Thread Stephen Frost
* Lion Vollnhals ([EMAIL PROTECTED]) wrote:
> Am Sonntag, den 14.08.2005, 09:29 -0400 schrieb Willem Riede:
> > On Wed, 10 Aug 2005 20:54:35 +, Allen Martin wrote:
> > That is disappointing. I was seriously considering a motherboard with your
> > chipset because of its impressive specifications, but now I have to
> > reconsider (I'm one of those folks that never bought an nVidia graphics
> > board due to the lack of open full-function driver). I _hate_ not being
> > able to use all features.
> 
> I agree.
> 
> I am considering buying an nForce4 board, too.
> But i am discouraged by it's closed source nature.

I also agree and am rather disappointed by this news.  Unfortunately,
I've already bought an A8N-SLI.  I've been considering some nvidia-based
systems for work though and now plan to ask my vendor
(penguincomputing.com in this case) if this will be an issue with their
systems or not.

Thanks,

Stephen


signature.asc
Description: Digital signature


Re: 10 GB in Opteron machine

2005-07-22 Thread Stephen Frost
* Jakob Oestergaard ([EMAIL PROTECTED]) wrote:
> This is really the clever way to run a 64-bit system - 99% of what is
> commonly run on most systems only gains overhead from the 64-bit address
> space - tools like postfix, cron, syslog, apache, ... will not gain from
> being native 64-bit.

For most 64-bit systems, sure.  For amd64 it's a little different
because there are additional changes to the architecture (as compared to
ia32/x86) which can more than make up for the difference for many
applications.  Then there's also things like encryption (postfix/tls,
apache/ssl, etc) which can benefit greatly from better handling of
64bit (and larger) types.

So, basically, it's not nearly so clear-cut as you portray it. :)

> Solaris has done this for ages - maintaining a mostly 32-bit user space,
> a 64-bit kernel, and then allowing for certain memory intensive
> applications to run natively 64-bit.

The differences between a 64bit sparc chip in 32bit and 64bit are quite
a bit less than the differences between an amd64 chip in 32bit and
64bit.  Thus, this makes alot more sense for sparc.

> It's a nice way to run a Linux based system too, IMO.

Perhaps on sparc or mips; it's much less clear-cut on amd64.

Stephen


signature.asc
Description: Digital signature


Re: Kernel header policy

2005-07-11 Thread Stephen Frost
* Marc Aurele La France ([EMAIL PROTECTED]) wrote:
> To that end, I would propose, as a possible technical solution, extending
> the kernel build process to detect these errors during kernel development.

Well, couple stupid comments:

#1: I'm not *entirely* sure Linus reads every mail to lkml..  Dunno if
you also sent it to him directly, but if not then that might explain
the lack of response...

#2: Have you got a patch that does this?  If not, you might want to
write one up and just submit it..  If you're expecting someone else
to write it, well, that might also be why you've not had much
response... :)

Thanks,

Stephen


signature.asc
Description: Digital signature


Re: [PATCH] dynamic tick patch

2005-01-19 Thread Stephen Frost
* Benjamin Herrenschmidt ([EMAIL PROTECTED]) wrote:
> Hrm... reading more of the patch & Martin's previous work, I'm not sure
> I like the idea too much in the end... The main problem is that you are
> just "replaying" the ticks afterward, which I see as a problem for
> things like sched_clock() which returns the real current time, no ?
> 
> I'll toy a bit with my own implementation directly using Martin's work
> and see what kind of improvement I really get on ppc laptops.

I don't know if this is the same thing, or the same issue, but I've
noticed on my Windows machines that the longer my laptop sleeps the
longer it takes for it to wake back up- my guess is that it's doing
exactly this (replaying ticks).  It *really* sucks though because it can
take quite a while for it to come back if it's been asleep for a while.

Stephen


signature.asc
Description: Digital signature


Re: rsync hangs on RedHat 2.4.2 or stock 2.4.4

2001-06-12 Thread Stephen Frost

* David S. Miller ([EMAIL PROTECTED]) wrote:
> 
> Russell King writes:
>  > At the time I suggested it was because of a missing wakeup in 2.4.2 kernels,
>  > but I was shouted down for using 2.2.15pre13.  Since then I've seen these
>  > reports appear on lkml several times, each time without a solution nor
>  > explaination.
>  > 
>  > Oh, and yes, we're still using the same setup here at work, and its running
>  > fine now - no rsync lockups.  I'm not sure why that is. ;(
> 
> Look everyone, it was determined to be a deadlock because of some
> interaction between how rsync sets up it's communication channels
> with the ssh subprocess, readas: userland bug.
> 
> I don't remember if the specific problem was in rsync itself or some
> buggy version of ssh.  One can search the list archives to discover
> Alexey's full analysis of the problem.  I don't have a URL handy.

I have to say I find this likely to be the case for those who are
having issues with rsync over ssh.  I was recently playing with
rsync over ssh (newer openssh to older openssh) and was just using
it as a pass-through to another machine.

When I replaced ssh with rinetd, everything worked fine.  I havn't
had a chance yet (though I'd like to) to try with two recent versions
of ssh but I'm curious what the result will be.  It may be that the
problem has been fixed in later versions of ssh.

Stephen

 PGP signature


'spurious APIC interrupt' - Dell PowerEdge 1400

2001-03-26 Thread Stephen Frost

Running into a problem with one of our Dell PowerEdge 1400 servers.
We see these messages very rarely, but after they show up the machine
goes into a really odd state:

Mar 26 09:37:27 maul kernel: spurious APIC interrupt on CPU#1, should never happen.
Mar 26 09:37:27 maul kernel: unexpected IRQ vector 216 on CPU#1!

Basically things seem to only kinda work.  My guess is that possibly one of
the CPUs has been shut down or similar and so half the processes are getting
kind of 'stuck'.

Any thoughts?  dmesg follows, more information availible upon request,
thanks!


Stephen

---
===# dmesg
Linux version 2.2.18-raid-mosix (bma@black) (gcc version 2.95.2 2220 (Debian 
GNU/Linux)) #2 SMP Tue Jan 2 12:40:13 EST 2001
Intel MultiProcessor Specification v1.4
Virtual Wire compatibility mode.
OEM ID: DELL Product ID: POWEREDGE CE APIC at: 0xFEE0
Processor #1 Pentium(tm) Pro APIC version 17
Processor #0 Pentium(tm) Pro APIC version 17
I/O APIC #2 Version 17 at 0xFEC0.
I/O APIC #3 Version 17 at 0xFEC01000.
Processors: 2
mapped APIC to e000 (fee0)
mapped IOAPIC to d000 (fec0)
mapped IOAPIC to c000 (fec01000)
Detected 794719 kHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1585.97 BogoMIPS
Memory: 1034844k/1048512k available (1512k kernel code, 424k reserved, 11676k data, 
56k init)
Dentry hash table entries: 131072 (order 8, 1024k)
Buffer cache hash table entries: 524288 (order 9, 2048k)
Page cache hash table entries: 262144 (order 8, 1024k)
256K L2 cache (8 way)
CPU: L2 Cache: 256K
Checking 386/387 coupling... OK, FPU using exception 16 error reporting.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
mtrr: v1.35a (19990819) Richard Gooch ([EMAIL PROTECTED])
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#1.
256K L2 cache (8 way)
CPU: L2 Cache: 256K
per-CPU timeslice cutoff: 50.03 usecs.
CPU1: Intel Pentium III (Coppermine) stepping 06
calibrating APIC timer ... 
. CPU clock speed is 794.6271 MHz.
. system bus clock speed is 132.4377 MHz.
Booting processor 0 eip 2000
Calibrating delay loop... 1585.97 BogoMIPS
Intel machine check reporting enabled on CPU#0.
256K L2 cache (8 way)
CPU: L2 Cache: 256K
OK.
CPU0: Intel Pentium III (Coppermine) stepping 06
Total of 2 processors activated (3171.94 BogoMIPS).
enabling symmetric IO mode... ...done.
ENABLING IO-APIC IRQs
...changing IO-APIC physical APIC ID to 2
...changing IO-APIC physical APIC ID to 3
init IO_APIC IRQs
 IO-APIC (apicid-pin) 2-0, 2-2, 2-5, 2-11, 2-13WARNING: ASSIGN_IRQ_VECTOR wrapped back 
to 52
, 3-13 not connected.
...trying to set up timer as ExtINT... .. (found pin 0) ... works.
number of MP IRQ sources: 39.
number of IO-APIC #2 registers: 16.
number of IO-APIC #3 registers: 16.
testing the IO APIC...

IO APIC #2..
 register #00: 0200
...: physical APIC id: 02
 register #01: 000F0011
... : max redirection entries: 000F
... : IO APIC version: 0011
 register #02: 
... : arbitration: 00
 IRQ redirection table:
 NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:   
 00 001 01  000   0   00751
 01 000 00  000   0   01159
 02 000 00  100   0   00000
 03 000 00  000   0   01161
 04 000 00  000   0   01169
 05 000 00  100   0   00000
 06 000 00  000   0   01171
 07 000 00  000   0   01179
 08 000 00  000   0   01181
 09 000 00  000   0   01189
 0a 0FF 0F  110   1   01191
 0b 000 00  100   0   00000
 0c 000 00  000   0   01199
 0d 000 00  100   0   00000
 0e 000 00  000   0   011A1
 0f 000 00  000   0   011A9

IO APIC #3..
 register #00: 0300
...: physical APIC id: 03
 register #01: 000F0011
... : max redirection entries: 000F
... : IO APIC version: 0011
 register #02: 0E00
... : arbitration: 0E
 IRQ redirection table:
 NR Log Phy Mask Trig IRR Pol Stat Dest Deli Vect:   
 00 0FF 0F  110   1   011B1
 01 0FF 0F  110   1   011B9
 02 0FF 0F  110   1   011C1
 03 0FF 0F  110   1   011C9
 04 0FF 0F  110   1   011D1
 05 0FF 0F  110   1   011D9
 06 0FF 0F  110   1   011E1
 07 0FF 0F  110   1   011E9
 08 0FF 0F  110   1   011F1
 09 0FF 0F  110   1   011F9
 0a 0FF 0F  110   1   01152
 0b 0FF 0F  110   1   0115A
 0c 0FF 0F  110   1   01162
 0d 000 00  100   0   0

Re: Linux stifles innovation...

2001-02-15 Thread Stephen Frost

* fsnchzjr ([EMAIL PROTECTED]) wrote:
> Watch Microsoft's Jim Allchin go Linux-bashing!!!
> Nice little article on how we're all going to die of herpes from our
> repeated exposition to Linux...
> http://news.cnet.com/investor/news/newsitem/0-9900-1028-4825719-RHAT.html?tag=ltnc

Just remember, the tag is 'ltnc' --> 'long-time, no clue'.

Stephen

 PGP signature


Re: 2.4.x and SMP fails to compile (`current' undefined)

2001-01-30 Thread Stephen Frost

* David Ford ([EMAIL PROTECTED]) wrote:
> A person just brought up a problem in #kernelnewbies, building an SMP
> kernel doesn't work very well, current is undefined.  I don't have more
> time to debug it but I'll strip the config and put it up at
> http://stuph.org/smp-config

They're trying to compile SMP for Athlon/K7 (CONFIG_MK7=y).

Stephen

 PGP signature


Re: Updated zerocopy patches on kernel.org

2001-01-15 Thread Stephen Frost

* David S. Miller ([EMAIL PROTECTED]) wrote:
> 
> Now against 2.4.1-pre2:
> 
> ftp.kernel.org:/pub/linux/kernel/people/davem/zerocopy-2.4.1p2-1.diff.gz

Tried it with 2.4.1-pre3, didn't have any problem applying it, but
when I rebooted the system it pretty much had no interest in talking TCP
to anything.  2.4.1-pre3 alone has no problem.  Should I give 2.4.1p2 a try
with the zerocopy patch?  I think that's my next step, but if it isn't
likely to change anything..

The problem tended to be that a connection could reach the 
'ESTABLISHED' point (in netstat), but then very little data would pass over
the connection.  ie; 'telnet somehost 25' would give me the SMTP HELO
statement, but nothing I typed seemed to make it anywhere.  Inbound and
outbound ssh connections would reach 'ESTABLISHED', but then wouldn't make
it to the point of prompting me for a password.  Nothing apparent in any
log files or anything.

Info attached, more availible upon request.

Stephen


Linux version 2.4.1-pre2 (root@gw2-snowman) (gcc version 2.95.3 20010111 (prerelease)) 
#1 SMP Mon Jan 15 12:59:07 EST 2001
BIOS-provided physical RAM map:
 BIOS-e820: 0009fc00 @  (usable)
 BIOS-e820: 0400 @ 0009fc00 (reserved)
 BIOS-e820: 0001 @ 000f (reserved)
 BIOS-e820: 0feec000 @ 0010 (usable)
 BIOS-e820: 3000 @ 0ffec000 (ACPI data)
 BIOS-e820: 0001 @ 0ffef000 (reserved)
 BIOS-e820: 1000 @ 0000 (ACPI NVS)
 BIOS-e820: 0001 @  (reserved)
Scan SMP from c000 for 1024 bytes.
Scan SMP from c009fc00 for 1024 bytes.
Scan SMP from c00f for 65536 bytes.
Scan SMP from c009fc00 for 4096 bytes.
On node 0 totalpages: 65516
zone(0): 4096 pages.
zone(1): 61420 pages.
zone(2): 0 pages.
mapped APIC to e000 (01444000)
Kernel command line: auto BOOT_IMAGE=Linux ro root=301
Initializing CPU#0
Detected 655.851 MHz processor.
Console: colour VGA+ 80x25
Calibrating delay loop... 1307.44 BogoMIPS
Memory: 255292k/262064k available (1062k kernel code, 6384k reserved, 429k data, 200k 
init, 0k highmem)
Dentry-cache hash table entries: 32768 (order: 6, 262144 bytes)
Buffer-cache hash table entries: 16384 (order: 4, 65536 bytes)
Page-cache hash table entries: 65536 (order: 6, 262144 bytes)
Inode-cache hash table entries: 16384 (order: 5, 131072 bytes)
CPU: Before vendor init, caps: 0183f9ff c1c7f9ff , vendor = 2
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 64K (64 bytes/line)
CPU: After vendor init, caps: 0183f9ff c1c7f9ff  
CPU: After generic, caps: 0183f9ff c1c7f9ff  
CPU: Common caps: 0183f9ff c1c7f9ff  
Enabling fast FPU save and restore... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
CPU: Before vendor init, caps: 0183f9ff c1c7f9ff , vendor = 2
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 64K (64 bytes/line)
CPU: After vendor init, caps: 0183f9ff c1c7f9ff  
CPU: After generic, caps: 0183f9ff c1c7f9ff  
CPU: Common caps: 0183f9ff c1c7f9ff  
CPU0: AMD Athlon(tm) Processor stepping 00
per-CPU timeslice cutoff: 182.95 usecs.
SMP motherboard not detected. Using dummy APIC emulation.
Setting commenced=1, go go go
PCI: PCI BIOS revision 2.10 entry at 0xf1010, last bus=1
PCI: Using configuration type 1
PCI: Probing PCI hardware
Unknown bridge resource 0: assuming transparent
Unknown bridge resource 1: assuming transparent
Unknown bridge resource 2: assuming transparent
PCI: Using IRQ router VIA [1106/0686] at 00:04.0
isapnp: Scanning for Pnp cards...
isapnp: No Plug & Play device found
Linux NET4.0 for Linux 2.4
Based upon Swansea University Computer Society NET3.039
Initializing RT netlink socket
DMI 2.3 present.
49 structures occupying 1371 bytes.
DMI table at 0x000F27D0.
BIOS Vendor: Award Software, Inc.
BIOS Version: ASUS A7V ACPI BIOS Revision 1003
BIOS Release: 07/21/2000
System Vendor: System Manufacturer.
Product Name: System Name.
Version System Version.
Serial Number SYS-1234567890.
Board Vendor: ASUSTeK Computer INC..
Board Name: .
Board Version: REV 1.xx.
Asset Tag: Asset-1234567890.
Starting kswapd v1.8
pty: 256 Unix98 ptys configured
RAMDISK driver initialized: 16 RAM disks of 4096K size 1024 blocksize
loop: enabling 8 loop devices
Uniform Multi-Platform E-IDE driver Revision: 6.31
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller on PCI bus 00 dev 21
VP_IDE: chipset revision 16
VP_IDE: not 100% native mode: will probe irqs later
ide0: BM-DMA at 0xd800-0xd807, BIOS settings: hda:DMA, hdb:pio
ide1: BM-DMA at 0xd808-0xd80f, BIOS settings: hdc:pio, hdd:pio
PDC20265: IDE controller on PCI bus 00 dev 88
PCI: Found IRQ 10 for device 00:11.0
PDC20265: chipset revision 2
PDC202

Re: [PLEASE-TESTME] Zerocopy networking patch, 2.4.0-1

2001-01-09 Thread Stephen Frost

* Ingo Molnar ([EMAIL PROTECTED]) wrote:
> 
> On Tue, 9 Jan 2001, Stephen Frost wrote:
> 
> > Now, the interesting bit here is that the processes can grow to be
> > pretty large (200M+, up as high as 500M, higher if we let it ;) ) and what
> > happens with MOSIX is that entire processes get sent over the wire to
> > other machines for work.  MOSIX will also attempt to rebalance the load on
> > all of the machines in the cluster and whatnot so it can often be moving
> > processes back and forth.
> 
> then you'll love the zerocopy patch :-) Just use sendfile() or specify
> MSG_NOCOPY to sendmsg(), and you'll see effective memory-to-card
> DMA-and-checksumming on cards that support it.

Excellent, this patch certainly sounds interesting which is why
I've been following this discussion.  Once the MOSIX patch for 2.4 comes
out I think I'm going to tinker with this and see if I can get MOSIX to
use these methods.

> the discussion with Stephen is about various device-to-device schemes.
> (which Mosix i dont think wants to use. Mosix wants to use memory to
> device zero-copy, right?)

Yes, very much so actually now that I think about it.  Alot of
memory->device and device->memory work going on.  I was mainly replying
to the idea of clustering since that's what MOSIX is all about.


Stephen

 PGP signature


Re: [PLEASE-TESTME] Zerocopy networking patch, 2.4.0-1

2001-01-09 Thread Stephen Frost

* Ingo Molnar ([EMAIL PROTECTED]) wrote:
> 
> On Tue, 9 Jan 2001, Stephen C. Tweedie wrote:
> 
> > but it just doesn't apply when you look at some other applications,
> > such as streaming out video data or performing fileserving in a
> > high-performance compute cluster where you are serving bulk data.
> > The multimedia and HPC worlds typically operate on datasets which are
> > far too large to cache, so you want to keep them in memory as little
> > as possible when you ship them over the wire.
> 
> i'd love to first see these kinds of applications (under Linux) before
> designing for them. Eg. if an IO operation (eg. streaming video webcast)
> does a DMA from a camera card to an outgoing networking card, would it be
> possible to access the packet data in case of a TCP retransmit? Basically
> these applications are limited enough in scope to justify even temporary
> 'hacks' that enable them - and once we *see* things in action, we could
> design for them. Not the other way around.

Well, I know I for one use a system that you might have heard
of called 'MOSIX'.  It's a (kinda large) kernel patch with some user-space
tools but allows for migration of processes between machines without
modifying any code.  There are some limitations (threaded applications and
shared memory and whatnot) but it works very well for the rendering work
we use it for.  We use radiance which in general has pretty little inter-
process communication and what it has is done through the filesystem.

Now, the interesting bit here is that the processes can grow to be
pretty large (200M+, up as high as 500M, higher if we let it ;) ) and what
happens with MOSIX is that entire processes get sent over the wire to 
other machines for work.  MOSIX will also attempt to rebalance the load on
all of the machines in the cluster and whatnot so it can often be moving
processes back and forth.

So, anyhow, this is just an fyi if you weren't aware of it that I
believe more than a few people are using MOSIX these days for similar
appliactions and that it's availible at http://www.mosix.org if you're
curious.

Stephen

 PGP signature


Re: [PLEASE-TESTME] Zerocopy networking patch, 2.4.0-1

2001-01-08 Thread Stephen Frost

* Jes Sorensen ([EMAIL PROTECTED]) wrote:
> > "David" == David S Miller <[EMAIL PROTECTED]> writes:
> 
> I don't question Alexey's skills and I have no intentions of working
> against him. All I am asking is that someone lets me know if they make
> major changes to my code so I can keep track of whats happening. It is
> really hard to maintain code if you work on major changes while
> someone else branches off in a different direction without you
> knowing. It's simply a waste of everybody's time.

Perhaps you missed it, but I believe Dave's intent is for this to
only be a proof-of-concept idea at this time.  These changes are not 
currently up for inclusion into the mainstream kernel.  I can not think
that Dave would ever just step around a maintainer and submit a patch to
Linus for large changes.

If many people test these and things work out well for them 
then I'm sure Dave will go back to the maintainers with the code and 
the api and work with them to get it into the mainstream kernel.  
Soliciting ideas and suggestions on how to improve the api and the code 
paths in the drivers to handle this new method most effectively.

Stephen

 PGP signature


Re: test13-pre1 changelog

2000-12-15 Thread Stephen Frost

* Oliver Xymoron ([EMAIL PROTECTED]) wrote:
> On Thu, 14 Dec 2000, Linus Torvalds wrote:
> 
> > A 100ms delay sounds like some interrupt shut up or similar (and then
> > timer handling makes it limp along).
> 
> Possibly related datapoint: after several days of uptime, my
> 2.4.0-test10pre? machine went into some sort of slow mode after coming
> back from suspend (and doing an /etc/init.d/networking restart). Symptoms
> seemed to be extra second or so setting up a TCP connection. Ping, etc.,
> appeared to work just fine, no packet loss apparent, bandwidth looked good
> too. Sadly I had to do actual work that required zippy web access, so I
> rebooted rather than doing a thorough diagnostic. This is a VAIO with
> compiled in eepro100, no special networking options.

Actually, I figured out what it was and I feel kind of stupid, and
suprised.  I knew I should have tried rebooting before complaining.  It
turns out it actually was something in my firewall rules, it appears that
for every logged packet there is something along the lines of a 100ms
delay that gets added on.

Not sure if that is something that could be easily fixed or not, or
perhaps I'm doing something wrong, but that seems unlikely since all I
changed was if it jumped to the LOG chain or not.

Stephen

 PGP signature


Re: test13-pre1 changelog

2000-12-14 Thread Stephen Frost

* Linus Torvalds ([EMAIL PROTECTED]) wrote:
> 
> 
> On Thu, 14 Dec 2000, Stephen Frost wrote:
> > 
> > This go around I compiled everything into the kernel, actually.
> > If it would be useful I can compile them as modules reboot and then see
> > what happens...
> 
> Even when compiled into the kernel, you might just ifdown/ifup the device.
> That will re-initialize most of the driver state.

I'll give that a shot...  ifdown -a/ifup -a, no change in behaviour.

> Is this ppp over serial.c, or what?

There is a ppp connection, but the slowdown is on *all* interfaces,
of which there are a total of 4; eth0, eth1, eth2, ppp0.

Stephen

 PGP signature


Re: test13-pre1 changelog

2000-12-14 Thread Stephen Frost

* Linus Torvalds ([EMAIL PROTECTED]) wrote:
> 
> 
> On Thu, 14 Dec 2000, Stephen Frost wrote:
> > 
> > Any idea if these issues would cause a general slow-down of a
> > machine?  For no apparent reason after 5 days running 2.4.0test12
> > everything going through my firewall (set up using iptables) I got about
> > 100ms time added on to pings and traceroutes.
> 
> Probably not related to that particular bug - the netfilter issue has
> apparently been around forever, and it was just some changes in IP
> fragmentation that just made it show up as an oops. 
> 
> A 100ms delay sounds like some interrupt shut up or similar (and then
> timer handling makes it limp along).

Hmm, it's happening on all interfaces.  No oops or anything in
the logs/dmesg.  I can check console when I get home, but I doubt there's
anything of interest.  All cards are 3com 3c905's.

Does this info help any?

===# cat /proc/interrupts
   CPU0   CPU1   
  0:   29170703   23315160IO-APIC-edge  timer
  1:  2  0IO-APIC-edge  keyboard
  2:  0  0  XT-PIC  cascade
  3: 258815 247131IO-APIC-edge  serial
  4:101120IO-APIC-edge  serial
  5:17480961692143   IO-APIC-level  usb-uhci, eth0
  8:  1  0IO-APIC-edge  rtc
 10:11992271146776   IO-APIC-level  eth2
 12:23672392389531   IO-APIC-level  eth1
 14: 210804 193050IO-APIC-edge  ide0
 15:   7052   6391IO-APIC-edge  ide1
NMI:   52509191   52509191 
LOC:   52472090   52472489 
ERR:  0
===# sleep 10
===# cat /proc/interrupts
   CPU0   CPU1   
  0:   29171536   23315741IO-APIC-edge  timer
  1:  2  0IO-APIC-edge  keyboard
  2:  0  0  XT-PIC  cascade
  3: 258818 247134IO-APIC-edge  serial
  4:101120IO-APIC-edge  serial
  5:17482951692372   IO-APIC-level  usb-uhci, eth0
  8:  1  0IO-APIC-edge  rtc
 10:11992301146777   IO-APIC-level  eth2
 12:23672442389534   IO-APIC-level  eth1
 14: 210833 193050IO-APIC-edge  ide0
 15:   7052   6391IO-APIC-edge  ide1
NMI:   52510605   52510605 
LOC:   52473504   52473902 
ERR:  0
===# 

Boot log:

Linux version 2.4.0-test12 (root@whitegryphon) (gcc version 2.95.2 2220 (Debian 
GNU/Linux)) #1 SMP Wed Dec 6 01:53:29 EST 2000
BIOS-provided physical RAM map:
 BIOS-e820: 000a @  (usable)
 BIOS-e820: 0001 @ 000f (reserved)
 BIOS-e820: 0fefd000 @ 0010 (usable)
 BIOS-e820: 2000 @ 0fffd000 (ACPI data)
 BIOS-e820: 1000 @ 0000 (ACPI NVS)
 BIOS-e820: 1000 @ fec0 (reserved)
 BIOS-e820: 1000 @ fee0 (reserved)
 BIOS-e820: 0001 @  (reserved)
Scan SMP from c000 for 1024 bytes.
Scan SMP from c009fc00 for 1024 bytes.
Scan SMP from c00f for 65536 bytes.
found SMP MP-table at 000f6e80
hm, page 000f6000 reserved twice.
hm, page 000f7000 reserved twice.
hm, page 000f6000 reserved twice.
hm, page 000f7000 reserved twice.
On node 0 totalpages: 65533
zone(0): 4096 pages.
zone(1): 61437 pages.
zone(2): 0 pages.
Intel MultiProcessor Specification v1.1
Virtual Wire compatibility mode.
OEM ID: OEM0 Product ID: PROD APIC at: 0xFEE0
Processor #1 Pentium(tm) Pro APIC version 17
Floating point unit present.
Machine Exception supported.
64 bit compare & exchange supported.
Internal APIC present.
SEP present.
MTRR  present.
PGE  present.
MCA  present.
CMOV  present.
PAT  present.
PSE  present.
MMX  present.
FXSR  present.
Bootup CPU
Processor #0 Pentium(tm) Pro APIC version 17
Floating point unit present.
Machine Exception supported.
64 bit compare & exchange supported.
Internal APIC present.
SEP present.
MTRR  present.
PGE  present.
MCA  present.
CMOV  present.
PAT  present.
PSE  present.
MMX  present.
FXSR  present.
Bus #0 is PCI   
Bus #1 is ISA   
I/O APIC #2 Version 17 at 0xFEC0.
Int: type 3, pol 0, trig 0, bus 1, IRQ 00, APIC ID 2, APIC INT 00
Int: type 0, pol 0, trig 0, bus 1, IRQ 01, APIC ID 2, APIC INT 01
Int: type 0, pol 0, trig 0, bus 1, IRQ 00, APIC ID 2, APIC INT 02
Int: type 0, pol 0, trig 0, bus 1, IRQ 03, APIC ID 2, APIC INT 03
Int: type 0, pol 0, trig 0, bus 1, IRQ 04, APIC ID 2, APIC INT 04
Int: type 0, pol 0, trig 0, bus 1, IRQ 06, APIC ID 2, APIC INT 06
Int: type 0, pol 0, trig 0, bus 1, IRQ 07, APIC ID 2, APIC INT 07
Int: type 0, pol 0, trig 0, bus 1, IRQ 08, APIC ID 2, APIC INT 08
Int: type 0, pol 0, trig 0, bus 1, IRQ 09, APIC ID 2, APIC INT 09
Int: type 0, pol 0, trig 0, bus 1, IRQ 0e, APIC ID 2, APIC INT 0e

Re: test13-pre1 changelog

2000-12-14 Thread Stephen Frost

* Alan Cox ([EMAIL PROTECTED]) wrote:
> > machine?  For no apparent reason after 5 days running 2.4.0test12
> > everything going through my firewall (set up using iptables) I got about
> > 100ms time added on to pings and traceroutes.  I'll probably reboot the
> > machine tonight and see if that helps.
> 
> Before you do that can you see if ifconfig down, rmmod, insmod, ifconfig up
> fixes it. 

This go around I compiled everything into the kernel, actually.
If it would be useful I can compile them as modules reboot and then see
what happens...

===# cat /proc/modules
ppp_deflate39200   0 (autoclean)
bsd_comp4160   0 (autoclean)
ppp_async   6512   1 (autoclean)
ppp_generic15232   3 (autoclean) [ppp_deflate bsd_comp ppp_async]
slhc4528   0 (autoclean) [ppp_generic]
===#

I can say that cleaning out all my firewall rules and adding them
back didn't change behaviour any.  Also, I'm sure that this was not happening
until today or maybe yesterday.  Earlier in the week the machine was doing
fine and I was getting reasonable response times.  Now, out *every* interface,
I get something close to 100ms additional time.  Also of note, traceroutes
appear to be more lagged than pings, for what that's worth (traceroute using
udp, ping using icmp, dunno if it makes a difference).

Stephen

 PGP signature


Re: test13-pre1 changelog

2000-12-14 Thread Stephen Frost

* Linus Torvalds ([EMAIL PROTECTED]) wrote:
> 
> Especially if we get that netfilter problem sorted out (see the other
> thread about the IP fragmentation issues associated with that one), and
> if we figure out why apparently some people have trouble with external
> modules (at least one person has trouble with loading alsa modules). 

Any idea if these issues would cause a general slow-down of a
machine?  For no apparent reason after 5 days running 2.4.0test12
everything going through my firewall (set up using iptables) I got about
100ms time added on to pings and traceroutes.  I'll probably reboot the
machine tonight and see if that helps.

Stephen

 PGP signature


Re: 2.2.17 & Eepro(10)

2000-11-30 Thread Stephen Frost

* Jeroen Geusebroek ([EMAIL PROTECTED]) wrote:
> 
> I'm having troubles with the eepro driver included in kernel 2.2.17.
> It stops sometimes with no apparent reason. The one thing i noticed
> is that it seems to have a lot of carrier problems(998!)
> 
> This is part of the result from ifconfig:
> 
> eth1  Link encap:Ethernet  HWaddr 00:AA:00:A6:05:01  
>   inet addr:24.132.xx.xxx  Bcast:24.132.xx.xxx  Mask:255.255.254.0
>   UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
>   RX packets:248714 errors:0 dropped:0 overruns:0 frame:0
>   TX packets:64711 errors:1925 dropped:0 overruns:0 carrier:998
>   collisions:832 txqueuelen:100 
>   Interrupt:10 Base address:0x230

I have similar problems, though I don't have any carrier problems:

eth0  Link encap:Ethernet  HWaddr 00:A0:C9:66:12:9B  
  inet addr:xx.xx.xx.xx  Bcast:xx.xx.xx.xx  Mask:255.255.255.0
  UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
  RX packets:214835 errors:0 dropped:0 overruns:0 frame:0
  TX packets:33050 errors:0 dropped:0 overruns:0 carrier:0
  collisions:0 txqueuelen:100 
  Interrupt:9 Base address:0xe0e0 

Seems to just drop out when there hasn't been any activity for
a while for me though.  Give it about 10-15 seconds and it comes back.

> Needless to say i didn't have this problem with previous kernels
> (including 2.2.16).

Linux junior 2.2.17-raid #1 Wed Nov 8 07:48:57 EST 2000 i686 unknown

I didn't run this machine very long w/ 2.2.16 so I don't really
know if previous versions had the same problem.  If I have some time,
perhaps over the weekend, I'll try and find out.

Stephen

 PGP signature


Re: 32-bit pid_t / security

2000-10-01 Thread Stephen Frost

* Andries Brouwer ([EMAIL PROTECTED]) wrote:
> So, in the long run we want a large pid_t. What about the short run?
> For today the disadvantages are negligeable, and for people who
> like security there are definite advantages.

Much more the problem is giving people the *impression* of
actual security advantage.

> David, I already said the same to someone else:
> Security is not a yes/no matter. It is a matter of less or more.
> Thus, "Hoping for security" is meaningless.
> But "Hoping for more security by having more PID's" is quite
> reasonable. If I am local user on your system then I can break in
> using a wraparound. If that takes 2147483647 processes I have to
> wait longer than when that takes 32000 processes.

Yes, it does, but it doesn't increase security signifigantly.
Trying to claim it does just seems completely nutty.  There are
certainly other reasons for a larger pid, and personally I would find
it reasonable for it to be a config option, but if someone came up to
me and said I should be using 32bit pids because it's more secure than
15bit pids I'd laugh at them.

So, how about a patch that adds it in as a config option with
an appropriate default?  My personal feeling is that the default should
be 15bit because it's what is currently used and has no chance of
breaking anything.  Admittedly, nothing *should* break w/ 32bit pids,
but then reality steps in.  Would be nice to see what does break.

Stephen

 PGP signature


Re: Linux kernel modules development in C++

2000-09-28 Thread Stephen Frost

* Igmar Palsenberg ([EMAIL PROTECTED]) wrote:
> 
> Tell my teacher it's a good idea, he is telling otherwise :)

Academics and reality don't tend to equate. :)  Something to do with the
world not exactly being perfect.  The reality is, if you hadn't guessed, Linux
is doing rather well. :)

Stephen

 PGP signature


Re: eepro100 trouble

2000-09-06 Thread Stephen Frost

* Admin Mailing Lists ([EMAIL PROTECTED]) wrote:
> On Tue, 5 Sep 2000, Andrey Savochkin wrote:
> 
> > On Sun, Sep 03, 2000 at 02:57:54PM -0300, Cesar Eduardo Barros wrote:
> > > 
> > > I'm having endless problem with an eepro100 here. After some trying found out
> > > that doing a soft reset (ctrl+alt+del) fixed the problem, and that a power
> > > cycle made it happen again.
> > > 
> > > Kernel version is 2.2.17pre20
> > 
> > The problem you faced is a known one.
> > But I expected 2.2.17pre20 to work, it contains a work-around which helped
> > all other people complaining about the same things.
> 
> is it fixed in 2.2.17 final or any of the 2.2.18 releases? My eepro100 has
> been working fine (i'm in 2.2.15pre18) and i'm planning to upgarde to
> 2.2.17. Wondering if it's a longstanding problem or a NEW one.

In 2.2.17pre20 I started running into *really* annoying issues w/ an
eepro100, I'm going to go back to 2.2.16 and see if they clear up.  Basically
things were constantly seeming to stall for me.  Not everything though, it
seemed like larger things would whereas smaller things wouldn't, but only
sometimes. :)

Examples:
dmesg consistantly would stall
ps awux had no real issues (small process list)
top worked okay
switching between windows under screen quickly would stall it
grabbing kernel source wasn't a problem
scp'ing a couple >1M files wasn't an issue

Very odd.  Never had any problems w/ eepro100's under 2.2.16
or earlier.

Stephen

 PGP signature