Marshal Newrock wrote:
On Fri, 23 Jan 2004, Norbert Kamenicky wrote:


The OpenOffice file format is zipped xml.  So you should just be able to
use zgrep, although it will miss any phrases that have extra formatting in
them, such as a phrase in which part of it is underlined.  Also, I'm not
sure how that will handle newlines.


Since all the text is in one xml formated line, grep will print complete text (if matches). Some workaround is needed before grep ...

these steps are needed:

1. unzip "content.xml" from OOo document
2. convert it to text format
3. grep

try this (not tested)

unzip -p your-OOo-document content.xml \
| sed "s/<[^<>]*>/\n/g" \
| grep string


I'd use 'fold -s' instead of the sed line.
I've never looked at an OOo file, so I didn't know it was all on a single
line.  I suppose that saves a little bit of space, but it's a bit
inconvenient.


If u look inside, u learn your answer is a bad joke :-) Which space u like to save ? Those cca 10 Bytes on cmd line ? (And make it unfunctional ? )

noro





--
[EMAIL PROTECTED] mailing list



Reply via email to