On Wednesday, 15 August 2012 12:39:21 UTC-3, jcordes wrote:
> I am working on an HTML file generated by WordPerfect's Publish to HTML,
> trying to get clean html but maintaining the format fairly closely. One thing
> I'd like to do is delete empty tag pairs, such as this:
> <SPAN STYLE="text-decoration: underline"></SPAN>
>
> I'm sure this must be a trivial regex problem for some but I'm apparently
> missing some key idea. I'm working under Linux, with
> VIM - Vi IMproved 7.1 (2007 May 12, compiled Oct 17 2008 18:11:28)
> (sorry, not easy to upgrade)
> I have tried search patterns along the following lines
> /<SPAN .\{-}><\/SPAN>
> Have also tried some 'grouping' possibilites. Whatever I do my patterns
> always go beyond the closing </SPAN> tag and find a later "<". For example if
> a line in the file looks like this:
> <P><SPAN STYLE="font-size: 10pt"> //she age 20, born NS, d/o William and
> Sophia - image 581, pg.187] (<STRONG>note</STRONG>: <EM>marriages in
> </EM></SPAN></P>
>
> (that's all on one line)
>
> my pattern selects
> <SPAN STYLE="font-size: 10pt"> //she age 20, born NS, d/o William and Sophia
> - image 581, pg.187] (<STRONG>note</STRONG>: <EM>marriages in </EM></SPAN>
>
> I seem to need something which says that after finding the first (non-greedy)
> ">" I want the very next "<" character, rather than searching further down
> the line.
>
> Any help most gratefully received!
>
> Thanks,
> John
So sorry -- posted the wrong example, without an empty tag pair. I'm having to
rethink -- maybe this pattern works after all!
John
--
You received this message from the "vim_use" maillist.
Do not top-post! Type your reply below the text you are replying to.
For more information, visit http://www.vim.org/maillist.php