On Friday 26 September 2008 01:20:29 pm Rob Dixon wrote: > Jack Gates wrote: > > s!(<|</)([^\!][A-Z0-9 ]+>)!$1\L$2\E!g; > > or > > s/(<|<\/)([^!][A-Z0-9 ]+>)/$1\L$2\E/g; > > > > The RE above captures and replaces all HTML tags with lowercase > > as desired except for any tag that has only one letter such as > > <P>, <B> or <I> > > > > It will get the </B>, </P> and </I> > > > > It properly ignores the <!DOCTYPE> tag > > > > What is the correct way to write the above RE? > > HTML tag names can't contain spaces, so you want > > s|(</?)([A-Z][A-Z0-9]*)|$1\L$2|g; > > Rob
Thanks for the effort. Your RE does not work as well as what I have. HTML tags can contain spaces. You forgot about <p id="something" class="something"> -- Silverfox Please don't CC me. I am subscribed to the list. -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED] http://learn.perl.org/