Regex question

M. Lewis Wed, 03 Jan 2007 23:59:32 -0800

I'm trying to parse the domain name out of some URLs. In the exampledata, my regex works fine on the first two URLs, but clips off the firsttwo characters of the domain on the third example. My regex probablycould be much better.


#!/usr/bin/perl

use strict;
use warnings;

my $regex = qr'http://\w+?\.?\w+?\.?(\w+\.com)';

while(<DATA>){
  if (/$regex/o){print "$1 \t $_"}
}

__DATA__
http://www.asldkjlkwerj.com/
http://w71r2xk22q1affwp1ewpjeee.alaskjhhawe.com/?
http://qwlkjekwl.com/?IJESRKUFZedFRCVFJYQV4cUFtY

Thanks for any pointers,
Mike

--

It is easier to change the specification to fit the program than viceversa.

  02:50:01 up 20 days, 23:41,  0 users,  load average: 0.21, 0.22, 0.24

 Linux Registered User #241685  http://counter.li.org

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
http://learn.perl.org/

Regex question

Reply via email to