In the spring I encountered a problem (system freeze) with the
LM 8.2 enterprise kernel. The problem never happened with the
standard stock kernel of LM 8.2. That it was actually a highmem
problem was verified by compiling the kernel with the standard
options + the highmem option. The problem was solved by using
the stock enterprise kernel from LM 8.1.

>From a discussion on the kernel list that I happened to find
it looks as if the Red Hat 7.3 kernel had a similar problem
for memory above 896MB. This has now been solved in the
Red Hat 7.3 kernel for memory below 4Gb.
But it is not clear to me if RedHat's 7.3 kernel is based
on Rik van Riel's VM or Andrea's VM.
There seems to be some confusion, even on the kernel mailing
list, so could somebody, please, explain the situation for the
various Mandrake kernels. Has the problem been solved in
the latest enterprise kernel in the cooker? Will it be solved
in the new 2.4.19 kernel?

To illustrate the situation, let me finish with a citation.

Martin J. Bligh ([EMAIL PROTECTED]):
>There are definitely good things in both trees for this problem area at
>the moment. If you're interested in fixing this Alan and Andrea, let's
start a
>mergefest. I'm sure I can volunteer some IBM resources to help port
patches,>and test the hell out of it .... if you're willing to consider
taking things,
>I'll drawup a list of what the issues are, what patches are available,
>and which trees they reside in (often none).

So, where are the Mandrake kernels in this discussion?

For easy reference I have attached an abreviated version of the
discussion on the kernel mailing list.

Dr Bjarne Thomsen

Department of Physics and Astronomy
University of Aarhus
Denmark


------------------------------------------------------------------------------
A few months ago, there was a flurry of reports from people having
difficulties with memory management on large machines (ia32 over 4 GB). I've
seen a lot of 2.4.x-yy kernels go by and much VM discussion, but what I'm
*not* seeing is reports of either catastrophic behavior or its absence on
large machines. I haven't had a chance to run my own test cases on the
2.4.18 kernel from Red Hat 7.3 yet, so I can't make any personal
contribution to this discussion.

--
M. Edward Borasky
------------------------------------------------------------------------------
RedHat has fixed the problem in its kernels. There are fixes out there, but
Linus is not applying them. I would venture that this is because they would
fix the problems *for the moment* and take away interest in revamping VM for
real.

It might help if Linus would actually state his intentions. So far the
problem has been that the AA vm was badly documented and a big chunk of
patches. Andrew Morton split them up nicely and documented each patch, so
that is resolved.

Regards,

bert
------------------------------------------------------------------------------
7.3 has some of what is needed but not all. To go past 16Gb you need highmem
mapped page tables which I'm pretty sure did not make it in.

Alan Cox
------------------------------------------------------------------------------
> 7.3 has some of what is needed but not all.

Can you outline the changes in this area? I want to make sure we're
not all fighting the same problems seperately ;-) I know bounce
buffers is one large element of that, though I believe you still
only go up to 4Gb, unless I'm mistaken?

> To go past 16Gb you need highmem mapped page tables which I'm
> pretty sure did not make it in.

You need it earlier than that if you have many large tasks (4Gb or so).

Martin J. Bligh
------------------------------------------------------------------------------
> Can you outline the changes in this area? I want to make sure we're
> not all fighting the same problems seperately ;-) I know bounce
> buffers is one large element of that, though I believe you still
> only go up to 4Gb, unless I'm mistaken?

Bounce buffer handling

> You need it earlier than that if you have many large tasks (4Gb or so).

That I can believe

Alan Cox
------------------------------------------------------------------------------
> Can you outline the changes in this area? I want to make sure we're
> not all fighting the same problems seperately ;-) I know bounce
> buffers is one large element of that, though I believe you still
> only go up to 4Gb, unless I'm mistaken?

Yes, it only goes up to 4Gb. It's because of the error handling code in
the SCSI mid-layer and above, it fails to properly handle the >4gb sg
entries on error conditions. I'm working on that now and should have it
fixed soon.

  Doug Ledford
------------------------------------------------------------------------------
> ... Linus is not applying them

Shouldn't that be "Marcelo is not applying them"? Linus has devolved
all responsibility for 2.4 now, and is concentrating on the 2.5 series
and all its radical changes.

Marcelo objected to Andrea's mega-patch, but if I recall, he hinted that
me might start merging the split-up patches for 2.4.20 - in the
meantime, you can always apply the latest -aa patch yourself to a
2.4.19-pre kernel. Otherwise, the Red Hat patched kernel (which I
believe still doesn't use Andrea's VM at all) ought to work well, with
all their spiffy regression testing etc....

Cheers

Alastair Stevens
------------------------------------------------------------------------------
> 2.4.19-pre kernel. Otherwise, the Red Hat patched kernel (which I
> believe still doesn't use Andrea's VM at all) ought to work well, with
> all their spiffy regression testing etc....

The Red Hat 7.3 kernel uses Rik van Riel's rmap and Andre Hedricks IDE
updates. It did indeed pass our stress testing and seems to perform very
well under memory contention and high shared page counts - the classic
desktop/developer set up.

Alan
------------------------------------------------------------------------------
The catastrophic failures are still happening, in fact, the last
lse-tech conference call a week or two ago was dedicated at least in
part to them. The number of different ways in which these failures
occur is large, so it's taking a while for the iterations of whack-a-mole
game to converge to kernel stability. Andrea has probably been doing the
most visible stuff on this front with the recent bh/inode exhaustion
patches, with due credit to akpm as well for the unconditional bh
stripping patch.

Cheers,
Bill (William Lee Irwin III)
------------------------------------------------------------------------------
> I wouldn't bother using RedHat's kernel for this at the moment,
> Andrea's tree is where the development work for this area has all
> been happening recently. He's working on integrating O(1) sched
> right now, which will get rid of the biggest issue I have with -aa
> at the moment (the issue being that I'm too busy to merge it ;-)).

Oh, of course, I left of the bounce buffer issue, which RedHat *have*
fixed in their tree, I believe. Not sure what the status of this work
is for -aa at the moment.

Martin J. Bligh
------------------------------------------------------------------------------
> I wouldn't bother using RedHat's kernel for this at the moment,
> Andrea's tree is where the development work for this area has all
> been happening recently. He's working on integrating O(1) sched
> right now, which will get rid of the biggest issue I have with -aa

Still ? Its been in the Red Hat 7.3 tree since we released it. Its also
in the -ac tree all nicely merged. I guess your definition of happening
is my definition of "happened" 8)

Alan Cox
------------------------------------------------------------------------------
>Still ? Its been in the Red Hat 7.3 tree since we released it. Its also
>in the -ac tree all nicely merged. I guess your definition of happening
>is my definition of "happened" 8)
>

Huh? RH 7.3 kernel has the O(1) scheduler merged?

If the RH kernel is anything like the 2.4.19-pre8-ac5
I'm currently running, that is sweet indeed.

Joe
------------------------------------------------------------------------------
> Huh? RH 7.3 kernel has the O(1) scheduler merged?
>
> If the RH kernel is anything like the 2.4.19-pre8-ac5
> I'm currently running, that is sweet indeed.

rpm -q --changelog |grep "O(1)"

Alan Cox
------------------------------------------------------------------------------
>
>rpm -q --changelog |grep "O(1)"
>
>
>
Yes, I was being lazy - but now that finals
are done with for now, I'll grab that kernel
source rpm and have a proper look at it.

Thanks,

Joe
------------------------------------------------------------------------------
> Still ? Its been in the Red Hat 7.3 tree since we released it. Its also
> in the -ac tree all nicely merged. I guess your definition of happening
> is my definition of "happened" 8)

There are definitely good things in both trees for this problem area at
the moment. If you're interested in fixing this Alan and Andrea, let's start a
mergefest. I'm sure I can volunteer some IBM resources to help port patches,
and test the hell out of it .... if you're willing to consider taking things, I'll drawup a list of what the issues are, what patches are available, and which
trees they reside in (often none ;-( )

If my spies are correct, 7.3AS kernel is still based off the old 2.4.9 VM, with
no rmap at present ... correct? I presume 7.3 is 2.4.18 or so VM with rmap?

Martin J. Bligh ([EMAIL PROTECTED])
------------------------------------------------------------------------------
> If my spies are correct, 7.3AS kernel is still based off the old
> 2.4.9 VM, with no rmap at present ... correct? I presume 7.3 is
> 2.4.18 or so VM with rmap?

<Red Hat Marketing>There is no such product as 7.3AS</Red Hat Marketing> ;)

The AS 2.1 kernel is 2.4.9 based for enterprise stability with the pre
Linus being hit by cosmic rays VM fixed and tuned for enterprise workloads.

7.3 is the rmap VM

Alan
------------------------------------------------------------------------------

Want to buy your Pack or Services from MandrakeSoft? 
Go to http://www.mandrakestore.com

Reply via email to