You need to download and read the OASIS ODF Specification.  Since you are 
interested specifically in Writer, you might as well start with ODF 1.1 (a 
single document).  You can get all of the bits at 
http://docs.oasis-open.org/office/v1.1/OS/.
 
The ODT file is a Zip file that has multiple parts of the document as XML files 
within the Zip.  
 
You can see such a file by renaming it.  E.g., rename test.odt to test.odt.zip 
and open it as a Zip.  Then rename it back.
 
The specification for OpenOffice 1.0 XML format does not apply to any version 
of OpenOffice since ODF was adopted and implemented.
 
Also, the special Office Writer Web format is not singled out in the ODF 
specification.  What I suggest you do is make test documents and save them as 
HTML Document (OpenOffice Writer) and see what is produced.  Also, open such a 
document in Writer to see how it comes up.
 
The latest specification is ODF 1.2.  It is much larger and only Part 1 and 
Part 3 apply to Writer.  (Part 2 is all about spreadsheet formulas.)
 
You also might want to take a look at the new Corinthia project which is 
interested in document conversions using HTML as an intermediary.
 
-   Dennis
 
 
 
From: Howard Morris (aka Col Boogie) [mailto:[email protected]] 
Sent: Monday, January 5, 2015 20:36
To: [email protected]
Subject: html code generated from Open Office
 
The reason I joined Open Office was to enhance the html code generated from 
writer.
 
For now, I have constructed html code to upload an html file generated by Open 
Office and PHP code to tweak that code and download a better version. 
Everything I have doe so far is in the attached zzz.zip file. Explanations are 
in Readme.txt
 
I would have liked to have done this directly, but when I asked how to get 
there, I was directed to a site where I could download all the modules one by 
one, and there seemed to be hundreds of them and no indication what any of them 
contained with dubious directions how to put them together. This is not how I 
like to do things, so I went the other way for now.
 
I seem to have run across documentation that Open Office puts its files 
internally into a XML format. If I could extract the XML directly from the .odt 
file I could do everything from there. Assuming that is true, is there an 
updated copy of https://www.openoffice.org/xml/xml_specification.pdf ? I would 
hope that whoever is doing the documentation keeps that up to date. It will 
take me at least 20 hours to read that document, but at least I will retain 
most of it the first time.
 
Howard Morris

Reply via email to