Re: Improving Keeps and Breaks

Andreas L Delmelle Thu, 18 Oct 2007 14:51:59 -0700

On Oct 18, 2007, at 19:23, Vincent Hennebert wrote:

<snip />
OTOH, the above is semantically equivalent to (I think we had already
established that there should not be a double page-break here)
<fo:block break-before="page">
  <fo:block>
    <fo:block>

If the LMs would be guaranteed to receive the 'normalized' form, the
break-condition can be tested for internally by the outer LMitself. Noneed to look forward or back... The first descendants wouldn'teven need
to check for breaks anymore.
I think I see your point. Basically you’re proposing a push method(a LMnotifies its parent LM that it has a break-before) while mine is apull
method (a LM asks its children LMs if they have break-before).

Yep, although it would not be the LM but rather the FO that pushesthe break-before upwards to its parent if it is also the first child.The LMs would largely continue to work as they do now, except thatunder a certain set of conditions, they don't need to check theoutside anymore: only take into account the forced break on its ownFO. If there is none, then no need to recursively check for firstdescendants having forced breaks.

Currently (sorry if it becomes boring to stress this) theconstruction of the layout-tree starts only when the end-of-page-sequence event occurs. I still see room for changing this in thefuture, and so I need to consider the effects on the layout-algorithmas well: the algorithm will, for instance, no longer be able to relyon *all* childLMs being available the first time it enters theloop... The last childLM in an iteration might turn out to be not-the-last-one-after-all. For many following FONodes, the LMs do not existyet at that point. Not in my head, at least. ;-)

You’re
more at the FO tree building stage, I’m more at the layout stage. In
terms of efficiency I think both methods are equivalent as the same
amount of method calls will be performed in either way.

Right, but OTOH... it's more a matter of /when/ (in the process) thathappens.

The push method might be slighty more complicated to implement in
special cases like tables: when an fo:cell notifies its parent

fo:table-body that it has a break-before, the table-body mustfigure out

if the cell lies in the first row or not.

Almost everything is /slightly/ more complicated in case offo:tables, especially those without explicit fo:table-rows or -columns. ;-)

Anyway, I remember that when I implemented implicit column-numbers, Ialso gave TableBody an instance member to check whether we are addingcells in the first row or not, so this particular case would beeasily addressed. (Checking... yep, it's still there.)

Come to think of tables, I'd consider 'propagation' in terms ofpushing a forced break on a cell to the first cell in the row.In the table-layout code, at the point where we have a reference tothe row or the first cell in a row, we would immediately know whetherthere is a forced break on a first descendant in any of the followingsibling cells without having to request the corresponding childLMsand trigger a tree-traversal of who-knows-how-many levels.

Keeping in mind the above mentioned idea of triggering layout sooner,if we can guarantee that the layoutengine always receives completerows, then the table-layout job should become a bit simpler in thegeneral use-case, while still not adding much complexity in trickier,more exotic cases, like:

//table-cell/block[position() > [EMAIL PROTECTED]'page']

especially where the cell's column-number corresponds to the highestcolumn-number.

Triggering layout sooner is the only way we are ever going to get FOPto accept arbitrarily large tables, without consuming massive amountsof heap. A 'simple' grid of 5 x 500 cells generates +5000 FONodes(table-cells must have at least one block each) that stay in memoryuntil the page-sequence is completely finished. I wonder how manybreak-possibilities that generates... :/

A matter of taste, probably, but I think I’d prefer the pullmethod: theLM performs requests to the appropriate children LMs exactly whenand if
needed.

The only thing an LM should initially pull/request from its children,AFAIU, is a list of elements, given a certain LayoutContext.When composing its own element list, an LM should ideally be able torely on the lists it receives from its children. Then add/delete/update elements and (un)wrap, depending on context that is unknown orirrelevant to the child.

That may simplify code as well (and improve its readability) as
some form of pull method is necessary anyway (the
mustKeepWithPrevious/WithNext/Together methods).

Keeps are a different story indeed. Big difference is that keeps havestrengths, and breaks do not.


Consider:

<fo:block id="b1">
  ...
  <fo:block id="b2">
    <fo:block id="b3" keep-with-previous.within-page="...">
      <fo:block id="b4">
        <fo:block id="b5" break-before="page">

This may be interpretation: you cannot specify a 'strength' for abreak. It is either there or not. I take this to mean that a forcedbreak overrules any keep.

Main advantage to the layoutengine would be that forced breaks areknown as early as possible: the break is either there, on the FO,when the LM is initialized --propagated upwards from a first child,maybe seven or eight levels down--, or it is not.The above can be normalized at parse time, with only a marginal cost,so that the break is propagated upwards to block b2, and the keep issuppressed before any LM is even created.

I believe you already mentioned this idea of normalizing/simplifying theFO tree in the past. Note that it may exist in parallel as itaddresses
a different general issue. One concern I’d have is to make sure that
a simplification leads to a semantically equivalent result.

That is precisely the purpose of normalization: to remove ambiguitiesat a point where it is still relatively simple. Ambiguities thatwould otherwise cause a significant amount of checks or tree- or list-traversals later on to get every possible scenario right. (FWIW: XEPalso normalizes the input FO, but there it happens by means of anXSLT; IIRC, they normalize tables to always have columns and rows,for example; implicit column-numbers can also quite easily becomputed/assigned as part of an XSL Transform)

Given the complexity of the spec that might be difficult toestablish. Not surealso if the overhead is compensated by the gain in the furtherprocesses
(layout, area tree generation). But that’s a different topic.

The key advantage in the longer term is that the start of thosefurther processes can be triggered sooner, without adding too muchcomplexity to the related source code.

Agreed with the concerns, but I'm wondering if these portions ofcode,instead of extracting them into a separate class, could becentralized
in, say, BlockStackingLM and InlineStackingLM...?
I thought of that, but a separate class looked cleaner to me for some
reasons:
- the LMs classes are already overcrowded with many different concerns


True.

- the code would be about the same for Block- and InlineStackingLM
- we could factorize it into a common super-class



AbstractStackingLM...?

I kind of like the idea. For the really shared portions,AbstractStackingLM could then implement a set of static methods.

but both those classes
  have subclasses to which breaks don’t apply (Flow-, StaticContentLM,
  for example).

I wouldn't really see this as a problem. The related methods willnever be called, unless there is a flaw in our logic[*]. To stressthe fact that they serve no purpose there, we could add overridesthat always return false.


[*] (They won't be called, precisely because breaks don't apply?)

OTOH keeps apply to AbstractGraphicsLM which doesn’t
  inherit any of those classes.

That's a special case, since in principle a graphic does not itselfconsist of more layout-objects that need to be stacked. To thelayoutengine, a graphic is simply a monolithic box. Graphics areinline by definition nonetheless, so it could be InlineStackingLMwith the same reservations as for FlowLM and StaticContentLM, but forother methods (the actual 'inline-stacking' can be considered to bedelegated to the producer of the graphic, here).




Cheers

Andreas

Re: Improving Keeps and Breaks

Reply via email to