Having tried various combinations of 'strings' and 'sed', I have concluded that the text cannot be reliably extracted without some more intelligent parsing of the PPT format. OO obviously performs this parsing since all the PPT files open flawlessly in OpenOffice.org Impress.

Is there any way I can, using OpenOffice.org, create a macro to extract the text from all of these files? There must be something better than 1500 copy/paste operations!

Greg,

1) there is not save to text in OOo for presentation files.
2) all the contents is there in the converted OD file, in the xml
3) there was recently an annoucement about an OOo batch conversion utility

with 3) you transform the PPT files to OD format, since 1) you can't use that directly but thanks to 2) and smart XML parsers/conversion tools you can readily access the textual data by removing _all_ the xml tags.

I have never tried that because I never _had_ to dump to text but my feeling is that what you ask, although a little unorthodox is possible with a few tricks.

Jean-Christophe

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to