Re: [gentoo-user] Searching OpenOffice documents

Norbert Kamenicky Fri, 23 Jan 2004 14:59:20 -0800

Marshal Newrock wrote:

On Fri, 23 Jan 2004, Norbert Kamenicky wrote:

The OpenOffice file format is zipped xml.  So you should just be able to
use zgrep, although it will miss any phrases that have extra formatting in
them, such as a phrase in which part of it is underlined.  Also, I'm not
sure how that will handle newlines.


Since all the text is in one xml formated line, grep will
print complete text (if matches).
Some workaround is needed before grep ...

these steps are needed:

1. unzip "content.xml" from OOo document
2. convert it to text format
3. grep

try this (not tested)

unzip -p your-OOo-document content.xml \
| sed "s/<[^<>]*>/\n/g" \
| grep string

I'd use 'fold -s' instead of the sed line.
I've never looked at an OOo file, so I didn't know it was all on a single
line.  I suppose that saves a little bit of space, but it's a bit
inconvenient.


If u look inside, u learn your answer is a bad joke :-)
Which space u like to save ?
Those cca 10 Bytes on cmd line ? (And make it unfunctional ? )

noro

--
[EMAIL PROTECTED] mailing list

Re: [gentoo-user] Searching OpenOffice documents

Reply via email to