Do you mean stripping the header, from the top of the page (doctype declaration) to the end of the opening <body> tag?
If this is what you have in mind, try
^([^<][^b^B][^o^O][^d^D][^y^Y])*<body[^>]*>
This means 'match 0 or more of anything but the combination of letters '<body' (case insensitive), then <body, then any nunber of characters that arent '>', then a '>'
HTH
Vik
Steve Onnis wrote:
Any one have a regular expression around that will strip out everything between and including the HTML opening tags?
so ---------------------------------------------------------------- <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html> <head> <title>Untitled</title> </head>
<body> ----------------------------------------------------------------
I could do it with a series of FIND() functions but reg expression will be cleaner
Regards
Steve Onnis Domain Concept Designs +61 422 337 685 +61 3 9444 7504
http://www.domainconceptdesigns.com <http://www.domainconceptdesigns.com>
("If you think it can't be done, you haven't asked me!") - Steve Onnis
--- You are currently subscribed to cfaussie as: [EMAIL PROTECTED] To unsubscribe send a blank email to [EMAIL PROTECTED]
MX Downunder AsiaPac DevCon - http://mxdu.com/
