You need to download and read the OASIS ODF Specification. Since you are
interested specifically in Writer, you might as well start with ODF 1.1 (a
single document). You can get all of the bits at
http://docs.oasis-open.org/office/v1.1/OS/.
The ODT file is a Zip file that has multiple parts of the document as XML files
within the Zip.
You can see such a file by renaming it. E.g., rename test.odt to test.odt.zip
and open it as a Zip. Then rename it back.
The specification for OpenOffice 1.0 XML format does not apply to any version
of OpenOffice since ODF was adopted and implemented.
Also, the special Office Writer Web format is not singled out in the ODF
specification. What I suggest you do is make test documents and save them as
HTML Document (OpenOffice Writer) and see what is produced. Also, open such a
document in Writer to see how it comes up.
The latest specification is ODF 1.2. It is much larger and only Part 1 and
Part 3 apply to Writer. (Part 2 is all about spreadsheet formulas.)
You also might want to take a look at the new Corinthia project which is
interested in document conversions using HTML as an intermediary.
- Dennis
From: Howard Morris (aka Col Boogie) [mailto:[email protected]]
Sent: Monday, January 5, 2015 20:36
To: [email protected]
Subject: html code generated from Open Office
The reason I joined Open Office was to enhance the html code generated from
writer.
For now, I have constructed html code to upload an html file generated by Open
Office and PHP code to tweak that code and download a better version.
Everything I have doe so far is in the attached zzz.zip file. Explanations are
in Readme.txt
I would have liked to have done this directly, but when I asked how to get
there, I was directed to a site where I could download all the modules one by
one, and there seemed to be hundreds of them and no indication what any of them
contained with dubious directions how to put them together. This is not how I
like to do things, so I went the other way for now.
I seem to have run across documentation that Open Office puts its files
internally into a XML format. If I could extract the XML directly from the .odt
file I could do everything from there. Assuming that is true, is there an
updated copy of https://www.openoffice.org/xml/xml_specification.pdf ? I would
hope that whoever is doing the documentation keeps that up to date. It will
take me at least 20 hours to read that document, but at least I will retain
most of it the first time.
Howard Morris