Watch out for something interesting coming out soon that may help in this kind of scenario - our HTML to XHTML converter. This is a core subsystem that now forms the heart of a whole bunch of techonlogies and solutions I've been working on recently. Provided your starting point is HTML, albeit that it may include JavaScript, CSS, PHP script, COS within <script> tags, it can all be converted to XHTML, and once in that format, it can be transformed in any way you like using DOM API methods.
Rob On Thu, 08 Jul 2004 23:14:59 -0400, Denver Braughler <[EMAIL PROTECTED]> wrote: >> does anybody know how to strip out html tags from a text using regexp > >If Cach� has regular expressions, that is good news to me. > >> (or anything else) > >The DOM parser (Robb Tweed?) might help you depending on your purpose. > >I use an edit utility classmethod that replaces from string1 to string2 with string3 >for every occurrence. >I strip everything from < to >, and replace specific &...; occurrences with a >single character. >You could start by stripping everything from <script> to </script>. >What the PHP code does is less general. > >Where I have had numerous such edits, I have listed them in $text() and iterated >through them at runtime. --- Rob Tweed M/Gateway Developments Ltd Global DOMination with eXtc : http://www.mgateway.com ---
