On Mon, 09 Jun 2003 07:13:43 +0200, you wrote:
>Does anyone have a good regular expression for capturing all http URL in
>a string?
>now i'm using ?(http://[a-z0-9-/_;&=+-\.\?:@]+)\b?mi
>in one of my programs, that is, all strings beginning with http:// and
>contain chars like a-z0-9 ...
>
>does anione have something better?
Hmm. I just glanced at RFC 2396, which offers this regex to validate an URI:
^(([^:/?#]+):)?(//([^/?#]*))?([^?#]*)(\?([^#]*))?(#(.*))?
12 3 4 5 6 7 8 9
the numbers indicate the reference points for each subexpression
1 = http:
2 = http
3 = //www.ics.uci.edu
4 = www.ics.uci.edu
5 = /pub/ietf/uri/
6 = <undefined>
7 = <undefined>
8 = #Related
9 = Related
http://www.ietf.org/rfc/rfc2396.txt
--
PHP General Mailing List (http://www.php.net/)
To unsubscribe, visit: http://www.php.net/unsub.php