On Friday 26 September 2008 01:20:29 pm Rob Dixon wrote:
> Jack Gates wrote:
> > s!(<|</)([^\!][A-Z0-9 ]+>)!$1\L$2\E!g;
> > or
> > s/(<|<\/)([^!][A-Z0-9 ]+>)/$1\L$2\E/g;
> >
> > The RE above captures and replaces all HTML tags with lowercase
> > as desired except for any tag that has only one letter such as
> > <P>, <B> or <I>
> >
> > It will get the </B>, </P> and </I>
> >
> > It properly ignores the <!DOCTYPE> tag
> >
> > What is the correct way to write the above RE?
>
> HTML tag names can't contain spaces, so you want
>
>     s|(</?)([A-Z][A-Z0-9]*)|$1\L$2|g;
>
> Rob

Thanks for the effort. Your RE does not work as well as what I have.

HTML tags can contain spaces.
You forgot about
<p id="something" class="something">

-- 
Silverfox

Please don't CC me. I am subscribed to the list.

-- 
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/


Reply via email to