Watch out for something interesting coming out soon that may help in
this kind of scenario - our HTML to XHTML converter.  This is a core
subsystem that now forms the heart of a whole bunch of techonlogies
and solutions I've been working on recently.  Provided your starting
point is HTML, albeit that it may include JavaScript, CSS, PHP script,
COS within <script> tags, it can all be converted to XHTML, and once
in that format, it can be transformed in any way you like using  DOM
API methods.

Rob

On Thu, 08 Jul 2004 23:14:59 -0400, Denver Braughler
<[EMAIL PROTECTED]> wrote:

>> does anybody know how to strip out html tags from a text using regexp
>
>If Cach� has regular expressions, that is good news to me.
>
>> (or anything else)
>
>The DOM parser (Robb Tweed?) might help you depending on your purpose.
>
>I use an edit utility classmethod that replaces from string1 to string2 with string3 
>for every occurrence.
>I strip everything from < to >, and replace specific &...; occurrences with a 
>single character.
>You could start by stripping everything from <script> to </script>.
>What the PHP code does is less general.
>
>Where I have had numerous such edits, I have listed them in $text() and iterated 
>through them at runtime.

---
Rob Tweed
M/Gateway Developments Ltd 

Global DOMination with eXtc : http://www.mgateway.com
---

Reply via email to