Removing date and null characters within a file

Steve Hodgson Wed, 27 Jun 2007 11:12:46 -0700

I currently use a macro in a Windows text editor to remove datesfrom bookmarks within a PDF file, which started off as a fairlysimple regex but gradually grew arms and legs as Word andAcrobat move to Unicode.

The form of each bookmark within the PDF (i.e. as displayed inBBEdit) is shown below.


/Title(˛ˇ·2·4·/·0·5·/·2·0·0·6· ·M·y· ·B·o·o·k·m·a·r·k)

Where the · character is a null character i.e \x00.

The search and replace expression I used to strip out the dateis as follows:


Find What:
/Title\(˛ˇ(\x00[0-9]){2}\x00/(\x00[0-9]){2}\x00/(\x00[0-9]){4}\x00

Replace with:
/Title\(˛ˇ

Basically just stripping off the date.

With the current combination of Word/Acrobat I now need toremove the null characters from within the variable lengthstring that follows the date. Is there a way to accomplish thiseither with a second search and replace or by enhancing thefirst search?

I can do it by repeatedly running a search and replace removingone null character at a time but this is laborious in theextreme. Would there be anyway to automate this?


The only way I have found to reduce the number of steps is to:

1) Strip out the date as before but now also the BOM character too.

2) Select the section of the file that defines the bookmarks.

3) Zap gremlins within that section only.

Any better ways?
--
Regards,

Steve Hodgson                        <mailto:[EMAIL PROTECTED]>



--
------------------------------------------------------------------
Have a feature request? Not sure the software's working correctly?
If so, please send mail to <[EMAIL PROTECTED]>, not to the list.
List FAQ: <http://www.barebones.com/support/lists/bbedit_talk.shtml>
List archives: <http://www.listsearch.com/BBEditTalk.lasso>
To unsubscribe, send mail to:  <[EMAIL PROTECTED]>

Removing date and null characters within a file

Reply via email to