On Fri, 23 Jan 2004, Norbert Kamenicky wrote:
The OpenOffice file format is zipped xml. So you should just be able to use zgrep, although it will miss any phrases that have extra formatting in them, such as a phrase in which part of it is underlined. Also, I'm not sure how that will handle newlines.
Since all the text is in one xml formated line, grep will print complete text (if matches). Some workaround is needed before grep ...
these steps are needed:
1. unzip "content.xml" from OOo document 2. convert it to text format 3. grep
try this (not tested)
unzip -p your-OOo-document content.xml \ | sed "s/<[^<>]*>/\n/g" \ | grep string
I'd use 'fold -s' instead of the sed line. I've never looked at an OOo file, so I didn't know it was all on a single line. I suppose that saves a little bit of space, but it's a bit inconvenient.
If u look inside, u learn your answer is a bad joke :-) Which space u like to save ? Those cca 10 Bytes on cmd line ? (And make it unfunctional ? )
noro
-- [EMAIL PROTECTED] mailing list
