Re: [GSoC] Wiki page for progress informations

Vincent Hennebert Wed, 31 May 2006 06:52:55 -0700

Thanks a lot Luca! This will help me find my way in the code. I keep
your comments in mind for when I better understand the whole issue.


Vincent


Luca Furini a écrit :

Jeremias Maerki wrote:
did you already investigate how footnotes are implemented? Can you say
anything about how similar the problem of footnotes is to before-floats?
Just so you don't have to start from scratch while there may be
something to build upon. After all, the footnotes also contain some
logic to move certain parts to a different page than where anchor is
located.
A few quick comments about the footnote implementation:
1) the FootnoteLM returns only the sequence of elements representing theinline part (not the footnote-body part); it just adds to the last(inline) box a reference to the FootnoteBodyLM.
2) the LineLM, after computing the breaks, adds to each (block) boxrepresenting a line the references to the FootnoteBodyLM whose citationsare in that line
3) during the remaining of the element collection phase, thesereferences are not used (but in the creation of "combined" elementlists, when they should be copied inside the new elements)
4) the PageSequenceLM.PageBreaker.getNextKnuthElements() method, afterreceiving all the (block) elements, scans them looking for footnoteinformation, gets the elements from the referenced FootnoteBodyLM andputs them in a different list (at the moment a list of lists, but thisis sub-optimal), and from the footnote-separator (in a separate list)
5) these lists are looked at inPageBreakingAlgorithm.computeDifference(), where we try to add somefootnote content to the "normal" page content using getFootnoteSplit(),and in computeDemerits(), where some extra demerits are added if webreak a footnote or some footnotes are deferred.
This last point at the moment is performed using manyPageBreakingAlgorithm private variables, which is maybe not the best wayto do it, as we must be very careful about their initialization andtheir use, especially when the algorithm restarts. I think that a"state" object storing these variables could be used to store thesevalues, and explicitly passed along the methods instead of relying onthe class members, but concerning this I'd like to hear the opinions ofthe other committers ...
Insertion of before-floats could be implemented in a similar way, givingthe precedence to the footnote insertion (as it is affected by morestrict constraints).
An important difference between a footnote and a before-float is thatthe latter does not have an "inline part", so (if we want to follow thesame pattern) we need to either store the reference inside apreviously-created box or to add some new elements containing thereference (but we must be sure that these elements cannot be parted fromthe previous ones, see the constraints in section 6.10.2 in the spec).
A crucial point is the demerit function as, if I remember correctly, itgreatly affect the computational complexity of the breaking algorithm(thre should be a M. Plass paper concerning this).
HTH
Another thing that we may need to keep in mind: There was lots of desire
from the user community that FOP supports large documents (long-term
goal, not necessary yours). I wrote that a first-fit algorithm could
help free memory earlier. Obviously, for complex before-float situations
a total-fit approach is probably more interesting as it can come up with
more "creative" solutions. I'm just mentioning it so we keep the bigger
picture in mind and since there could be conflicting goals.
A "first degree" of first-fit algorithm could be achieved quite quicklyby having a BreakingAlgorithm interface which is implemented by aTotalFitBA (the existing implementation) and a FirstFitBA which wouldhave a much simpler considerLegalBreak() method that, instead of thecomplex set of nodes, just keeps in mind a single node.
This would surely decrease the memory footprint, but is not (I think)what we really want, as this simplified algorithm would be performed onthe whole sequence of elements.
In order to start processing the sequence as soon as we receive a fewelements we need to do some deeper changes.
An idea (I just had it now, so I did not fully consider all itsimplications).At the moment, the block-level LM collect elements from their childrenand return just a single sequence (if there are no break conditions); wecould have a parameter requesting them to return after they receive eachchild sub-sequence, and have a canStartComputingBreak() method thatreturns true if the sequence contains enough elements and we are using afirst-fit algorithm, or false otherwise ...
Sorry for the long post ... and for the long absence too, but it seemsthat just after thinking "great, now I've really got some time to spendon FOP" I receive tons of other things to do ... :-(
Regards
    Luca

Re: [GSoC] Wiki page for progress informations

Reply via email to