Re: [Oorexx-devel] New Process for Building the ooRexx Documentation

Gil Barmwater Wed, 29 Jan 2020 13:09:42 -0800

Previously I wrote: One other bit of good news is that the combinationof these patches and the Common_Content sub-folder work-around are theonly required changes in order to use the XSLTPROC and FOP tools tosuccessfully build our documents. I will describe that process in mynext post.

...

So this is that next post but I am replying to Rony's post as I wantedto also address the questions that he raised. The process I came up withis very similar to that used with the Publican tools - run a transformtool, either Publican or XSLTPROC, to create an XSL-FO file from ourDocbook/XML files and a (modified) Docbook stylesheet. Run an ooRexxprogram written by Erich to remove extra blank lines in the .fo file.Run FOP to create a PDF from the (modified) .fo file. But as always, thedevil is in the details.

I chose XSLTPROC as several web sites suggested it although other toolslike Xalan were mentioned as well. I was attempting to follow some stepby step directions for building a PDF from Docbook source but, ofcourse, those web sites are never up to date and I had to adapt thedirections as I encountered problems. I also wanted to minimize thenumber of changes to our Publican process as we are generally happy withthe results it produces. So substituting XSLTPROC for Publican as theXSL transform tool seemed a good starting point. Likewise, I kept thePublican stylesheet - an override to the standard Docbook stylesheet -that we had further modified but I was able to eliminate a part of it asDocbook had corrected a problem that it was fixing, something to do withfootnote spacing. And, of course, I used the most current versions ofthe tools that were available, both for XSLTPROC and FOP (ver. 2.4).

Now I know that some folks are "chomping at the bit" to replicate what Ihave done but before you run off and start searching for the tools todownload, let me give you a list of the "pieces" that are needed. Firstthere is the XSLTPROC transform tool: this is actually 4 packages(!)which need to be downloaded, unzipped, and the executable folders (bin)added to the path. Then of course there is the FOP package which needsto be downloaded, unzipped and the appropriate sub-folder added to thepath. In order to get the same "look" to the documents as produced byPublican, you need to add some special fonts - 2 packages - to yoursystem. And then there are the two Publican stylesheets, one of whichhas been modified, and a configuration file for FOP so that it can findthe graphic files to be included and use the special fonts that wereinstalled. Finally, you need to retrieve the blank-stripping program byErich from the SVN repository. And once you have all the "pieces" inplace, you need to checkout the latest version of the documents fromSVN, copy the "common" folder to the working copy for the book you willbe building and add the fop configuration file to it. Then you can runxsltproc, the blank-line stripping program and then FOP. Piece of cake!

Because the above might seem overwhelming(!), I have been developing a"package" that simplifies it to a large degree. If you were to use thispackage, it contains all the "pieces" and a set of CMD files to executethe process steps. It is designed to be unzipped into a folder that willbecome the working location for building one or more? documents. Afterinstalling it, you would need to install the fonts (included) and thenyou could build a document. The first cmd file to be run is DOCPATHwhich takes one argument - the path to the SVN working copy of thedocuments. That path is saved in an environment variable for use by theremaining steps. Then you run DOCPREP which also takes one argument -the name of the "book" you want to build, e.g. rxmath. It takes care ofcreating the "Common_Content" sub-folder and adding the FOPconfiguration file to it as well as saving the document name in anotherenvironment variable. Next you run DOC2FO which runs the transform step.And finally, FO2PDF which runs FOP. The .fo file, the .pdf file and a.log file containing all the (many) messages from FOP are placed in asub-directory named e.g. out-rxmath.

The cmd files are written and have been tested on the rxmath "book". Ineed to put the pieces together and zip them up which is my next step.Then I will provide a link so anyone interested can download it and giveit a try. Note that I have NOT tried this on any other "books" so Iexpect there will be issues with some of them. E.g. as P.O. noted in adifferent thread and mentioned by Erich as well, the Java heap spaceneeds to be increased for some of our documents. I do not know how to dothat <blush> but it was not necessary for the rxmath book. Any otherissues should be "book-related", not process-related and can be fixed asthey are uncovered. And any process issues or enhancements I am willingto investigate.

If it is the consensus that I should run this process on "all" thedocuments before I release it, i.e. actually do a full test(!), I wouldbe willing to do so.


Your thoughts and comments are welcome.

Gil B.

On 1/7/2020 9:28 AM, Rony G. Flatscher wrote:

Hi Gil,
any chance for your next posting to get an idea of what you have doneand come up to? Maybe with abird eyes's view how you now would suggest to create the documentationaccording to your analysis,
tests?
Also, would you have already suggestions for the software to use, e.g.xsltproc (how about using
Apache Xalan [1] for this), the FOP is probably Apache FOP [2].
Guessing that everyone has been waiting eagerly for your next insightsand directions of how to
duplicate your efforts to successfully create the documentation! :)

---rony

[1] Apache Xalan Project:<https://xalan.apache.org/>
[2] Apache FOP:<https://xmlgraphics.apache.org/fop/>u


On 06.01.2020 20:07, Gil Barmwater wrote:
This thread is a continuation of the thread titled "Questions adgenerating the documentation(publican, pandoc)" with a different Subject since Pandoc is nolonger being considered as an
alternative.
To review, the ooRexx documentation is written in DocBook and hasbeen turned into PDFs and HTMLfiles using a system called Publican, originally developed by RedHat.Publican is no longersupported and works only occasionally under Windows 10. Under thecovers, Publican transforms theDocBook XML into XSL-FO using xsltproc, probably the Perl bindingsbased on comments by Erich, andmodified DocBook stylesheets. It then runs the FOP program to convertthe xsl-fo output into a PDFfile. In between those two steps, we run a Rexx program written byErich to remove extra blank
lines from the examples.
The new process uses the latest XSLTPROC programs directly along withthe latest version of FOP.However, Publican imposes some unique structure to the DocBook XMLwhich must be accounted for.Publican has the concept of a "brand" which lets one define commontext and graphics that shouldappear the same in all of a project's documentation. One denotesthose common text/graphic filesin the XML by preceding their names with "Common_Content/". AsPublican merges the various partsof the document together so that it can be transformed by thestylesheets, it resolves anyreferences to Common_Content so that the correct file is merged intothe complete source. As thisprocess is unique to Publican, we must account for it in order to useXSLTPROC instead.
One approach we could take would be to replace Common_Content/ witheither a relative or absolutepath to the location in our source tree where the files actually arelocated. For the sake of thisdiscussion, I will assume the working copy of the documentation hasbeen checked out to adirectory named docs. Then the main xml file for the rxmath bookwould be located atdocs\rxmath\en-US\rxmath.xml. And the files referenced byCommon_Content would be indocs\oorexx\en-US\. The relative path would then be..\..\oorexx\en-US\. The only problem withthis approach is the number of places this would need to be changed.My analysis shows over 140
locations in over 50 files.
A more expedient approach, and the one I would advocate, is to createa "temporary" sub-directoryfor the purpose of building the documentation and then to copyeverything from docs\oorexx\en-US\
into it. So if one were going to build the rxmath book, one would create
docs\rxmath\en-US\Common_Content\ and copy into it. This allowsXSLTPROC to locate the files thatneed to be merged without having to make any changes to our source.The disadvantage is that oneneeds to do this for each book being built. It is however a simplestep that can be done either
with File Explorer or automated using the xcopy or robocopy commands.
Having gotten by the Common_Content issue, running XSLTPROC revealsanother problem caused by theway Publican does the merge of the Common_Content files which I willdescribe in the next posting.
_______________________________________________
Oorexx-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/oorexx-devel


--
Gil Barmwater



_______________________________________________
Oorexx-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/oorexx-devel

Re: [Oorexx-devel] New Process for Building the ooRexx Documentation

Reply via email to