Re: [libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-05 Thread Tom Davies
Hi :)
CVAlkan (=Frank) is using Nabble to view things from the mailing-list
and for posting.  The 2 other ways are GMane and as just normal
emails.  Of those 3 ways it's only Nabble that has a system for
uploading 'attachments'.

Follow the links in Frank's email to get to the right place in Nabble
or go through the official LibreOffice website thru "Get Help" and
find the correct thread by looking at the subject-lines and/or
date&time.


With scripts people have often just copy&pasted the code directly into
an email rather than try to upload a file.  Either way around is fine.
Regards from
Tom :)








On 5 February 2014 12:12, CVAlkan  wrote:
> Hi Peter:
>
> Hmmm. I hadn't noticed that there was no "attachment" button on this forum,
> as I haven't ever uploaded anything longer than a few lines.
>
> Under the "More" button on the top of the message box, there is an option to
> "upload a file" but I don't know if other users will have easy access to
> that.
>
> Maybe someone who is more familiar with this forum can advise. But, even it
> it only goes to the LibreOffice folks themselves, I think that would be
> useful, since perhaps they can use it is a guide for further development of
> the indexing feature.
>
> Frank
>
>
>
> --
> View this message in context: 
> http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4095645.html
> Sent from the Users mailing list archive at Nabble.com.
>
> --
> To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
> Problems? 
> http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
> Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
> List archive: http://listarchives.libreoffice.org/global/users/
> All messages sent to this list will be publicly archived and cannot be deleted
>

-- 
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



[libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-05 Thread CVAlkan
Hi Peter:

Hmmm. I hadn't noticed that there was no "attachment" button on this forum,
as I haven't ever uploaded anything longer than a few lines.

Under the "More" button on the top of the message box, there is an option to
"upload a file" but I don't know if other users will have easy access to
that.

Maybe someone who is more familiar with this forum can advise. But, even it
it only goes to the LibreOffice folks themselves, I think that would be
useful, since perhaps they can use it is a guide for further development of
the indexing feature.

Frank



--
View this message in context: 
http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4095645.html
Sent from the Users mailing list archive at Nabble.com.

-- 
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-04 Thread Peter West

D'uh!

Thanks Frank.  I've updated the script with that change.

Attachments aren't accepted by this list, are they? What did you mean by 
"post the shell script"?


Peter

Peter West
"...and a sword will pierce through your own soul also..."

On 5/02/2014 1:54 am, CVAlkan wrote:

Peter:

The actual perl command should be changed slightly to:
perl -pi'orig_*' -e 's///g' Index_Experiment.fodt

After [[:alpha:]]*" the \/> needs to be added to remove the "/>" ending of
the XML tag - otherwise it seems to work fine.

The full blown shell script you sent me (I don't see it here on the forum
for some reason) needs to be modified in the same way of course.

I used [A-Za-z] instead of the [:alpha:] that you used because some systems
don't respect that substitution syntax (I can't remember what it's called),
which limits things just a bit, but a comment might be added to take care of
that - the [:alpha:] syntax, again, is probably a little easier to
understand for those not familiar with grep, sed and their relatives.

You should post the shell script, as it is probably easier to use (?) for
some folks than my simple sed command, since it takes care of hand-holding,
locating the right directories and so forth.

Of course, I hope some of the LibreOffice developers will incorporate the
option and capability to remove old markers when an index is regenerated,
and "fix" the generator so that it doesn't add additional markers to the
same word when updating takes place. (It doesn't always do that, but I
haven't figured out the exact conditions when it does).

So - good work.




--
View this message in context: 
http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4095465.html
Sent from the Users mailing list archive at Nabble.com.



--
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted


[libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-04 Thread CVAlkan
Peter:

The actual perl command should be changed slightly to:
perl -pi'orig_*' -e 's///g' Index_Experiment.fodt

After [[:alpha:]]*" the \/> needs to be added to remove the "/>" ending of
the XML tag - otherwise it seems to work fine.

The full blown shell script you sent me (I don't see it here on the forum
for some reason) needs to be modified in the same way of course.

I used [A-Za-z] instead of the [:alpha:] that you used because some systems
don't respect that substitution syntax (I can't remember what it's called),
which limits things just a bit, but a comment might be added to take care of
that - the [:alpha:] syntax, again, is probably a little easier to
understand for those not familiar with grep, sed and their relatives.

You should post the shell script, as it is probably easier to use (?) for
some folks than my simple sed command, since it takes care of hand-holding,
locating the right directories and so forth.

Of course, I hope some of the LibreOffice developers will incorporate the
option and capability to remove old markers when an index is regenerated,
and "fix" the generator so that it doesn't add additional markers to the
same word when updating takes place. (It doesn't always do that, but I
haven't figured out the exact conditions when it does).

So - good work.




--
View this message in context: 
http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4095465.html
Sent from the Users mailing list archive at Nabble.com.

-- 
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



Re: [libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-04 Thread Peter West

Hi Frank,

This script works on OS X, to the extent that does the conversions and 
creates new filtered odt files.  The only questions are whether it works 
in Linux, and whether the filter does the right thing with the index 
markers.


I haven't created any files with such index-markers, so maybe you can 
run it against your files to see how it goes.


http://pbw.id.au/src/sh/strip-odt-index-markers

It's a shell script, using soffice and perl.


On 29/01/2014 11:23 pm, CVAlkan wrote:

Peter:

I mentioned sed and grep, but don't see any reason why perl couldn't be used
as well. If you test this and it works, please post back to give others
another option.

BUT: my sed command only removed the markers from the fodt. As I mentioned I
was unable to convert the odt to an fodt (essentially uncompressing the odt
to readable xml) using the unzip capability of my OS (as I'm pretty sure
could be done with earlier open office documents).

Since LO can easily write and read fodt files, though, it really wasn't
necessary to do any file format conversion, and I didn't bother spending the
time to figure out how to do everything in one shot.

Frank



--
Peter West
"...and a sword will pierce through your own soul also..."

--
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted


Re: [libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-03 Thread Peter West

The file name goes at the end of the command, of course.

On 4/02/2014 3:44 pm, Peter West wrote:

Hi Frank,

Finally got back to this. Can you check this for me against an .fodt
file, please?

  perl -pi'orig_*' -e 's/

--
Peter West
"...and a sword will pierce through your own soul also..."

--
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted


Re: [libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-02-03 Thread Peter West

Hi Frank,

Finally got back to this. Can you check this for me against an .fodt 
file, please?


 perl -pi'orig_*' -e 's/text:string-value="[[:alpha:]]*"//g'

On 29/01/2014 11:23 pm, CVAlkan wrote:

Peter:

I mentioned sed and grep, but don't see any reason why perl couldn't be used
as well. If you test this and it works, please post back to give others
another option.

BUT: my sed command only removed the markers from the fodt. As I mentioned I
was unable to convert the odt to an fodt (essentially uncompressing the odt
to readable xml) using the unzip capability of my OS (as I'm pretty sure
could be done with earlier open office documents).

Since LO can easily write and read fodt files, though, it really wasn't
necessary to do any file format conversion, and I didn't bother spending the
time to figure out how to do everything in one shot.

Frank



--
View this message in context: 
http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4094482.html
Sent from the Users mailing list archive at Nabble.com.



--
Peter West
"...and a sword will pierce through your own soul also..."

--
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted


[libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-01-29 Thread CVAlkan
Peter:

I mentioned sed and grep, but don't see any reason why perl couldn't be used
as well. If you test this and it works, please post back to give others
another option.

BUT: my sed command only removed the markers from the fodt. As I mentioned I
was unable to convert the odt to an fodt (essentially uncompressing the odt
to readable xml) using the unzip capability of my OS (as I'm pretty sure
could be done with earlier open office documents).

Since LO can easily write and read fodt files, though, it really wasn't
necessary to do any file format conversion, and I didn't bother spending the
time to figure out how to do everything in one shot.

Frank



--
View this message in context: 
http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4094482.html
Sent from the Users mailing list archive at Nabble.com.

-- 
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted



[libreoffice-users] Re: Removing Index Markers from Writer: a How-To

2014-01-28 Thread Mareich
Thanks for this work-around.  I think it will also work on Mac OSX as it
includes sed in Terminal.  
I will try it as soon as I get a chance.



--
View this message in context: 
http://nabble.documentfoundation.org/Removing-Index-Markers-from-Writer-a-How-To-tp4094327p4094380.html
Sent from the Users mailing list archive at Nabble.com.

-- 
To unsubscribe e-mail to: users+unsubscr...@global.libreoffice.org
Problems? http://www.libreoffice.org/get-help/mailing-lists/how-to-unsubscribe/
Posting guidelines + more: http://wiki.documentfoundation.org/Netiquette
List archive: http://listarchives.libreoffice.org/global/users/
All messages sent to this list will be publicly archived and cannot be deleted