Re: Memory issue

Jean-François El Fouly Thu, 08 May 2008 03:58:19 -0700

Andreas Delmelle a écrit :

OK. Just curious: Any chance you could test it on another build ormaybe even Java 6?

Probably, if required or useful. Our sys admins are very cooperative ;-)

In my personal experience, optimizing the stylesheet code usually doesnot offer much improvement in terms of global memory usage, but itcould have a noticeable impact on the processing time. One of thethings I've learned about generated XSL-FO stylesheets by Altova isthat they add a lot of fo:inlines to specify, for example,font-properties on the lowest levels in the generated FO while, whencomparing to the font-properties of the fo:inlines' parents nothingreally changes, except for the size, style or weight. From FOP's pointof view, that's somewhat of a waste. Much better to specify a globalfont-size on the page-sequence, and override on the lower levels onlywhat is really necessary. After adapting the stylesheet manually, andremoving the redundant fo:inlines, the stylesheet and the generated FOwere reduced to not even half the original size.

Yes. That is exactly what happened to the stylesheet we use. I'vereduced it drastically.One issue with stylesheets generated by StyleVision is that you must becareful when you tweak them to avoid certain [fo-block inside fo:inline]combinations that make FOP crash with a stack trace and no really usefulinformation about what's happening or where. This bug is mentioned inthe FOP bug tracker, though in a rather raw, loose manner. I removed allsuch constructs and that made the XSLT much simpler and cleaner.

Something else that bothered me, but I don't know if that was alsogenerated by Altova, is that in one of the stylesheets I saw, theentire transformation was contained in one giant template...

With the last version, or our XSLT ? this was no longer the case.

AFAIU, this gives little opportunity for the XSLT processor to cleanup anything. Java 1.5 uses Xalan XSLTC by default, which convertstemplates into Java objects. One giant template would then mean onevery long-living object that may reference numerous others for thewhole duration of the processing run. If you look at the chain, whenusing XML+XSLT input, FOP is always the first one to finish, then theXSLT processor, then the XML parser.If the XSLT processor cannot reclaim anything, this will give FOP lessroom to work with, so it ultimately runs slower. As the heap increasesto reach the maximum, the points where the JVM will launch the GC byitself, will also increase. Since it cannot expand the heap anymore,it will try to clean up more frequently.

Yep, that is why I've tried to be cautious not to accuse FOP publicly ;-)

The problem is in the (Xalan + FOP) subsystem and the profiling couldwell show that the issue is Xalan-related.BTW, we've made the Xalan-FOP coupling a parameter so that we can usetight coupling (with Sax events) or loose coupling (writing theintermediate FO files on disk). We usually use the second option, sincethe possibility to read the FO intermediate code is helpful when youdebug. And I guess without being really sure that not to have Xalan andFOP working at the same time should use less memory. This separationprobably accounts for the long execution time, but that is not an issuesince document generation does not occur often in the target system (youcan generate chapters for proofreading but you generate the wholedocument once-twice a day).



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Re: Memory issue

Reply via email to