Re: Regrex Question

Mike Blezien Mon, 25 Nov 2013 11:46:24 -0800

Thank you for your and the other list members suggestions, gives me a good 
place to start.



Mike(mickalo)Blezien
=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=
Thunder Rain Internet Publishing
-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-
  ----- Original Message ----- 
  From: Jim Gibson 
  To: Perl List 
  Sent: Monday, November 25, 2013 1:23 PM
  Subject: Re: Regrex Question



  On Nov 25, 2013, at 10:55 AM, Mike Blezien wrote:

  > Hello,
  >  
  > Regular expression have never been my strong suite so hoping to get a litte 
help with a line in file I need to extract a portion of it.
  >  
  > The text I need to extract from this line is "November 21, 2013" from this 
line in the file, just the date:
  >  
  > Posted by <a href="mailto:someem...@email.com";>Some Name</a> on November 
21, 2013 at 23:21:58:<p>
  >  
  > what would be the regrex to use to extract the date from the line ?

  The usual advice applies: don't use regular expressions to parse HTML. 
However, lots of people do it anyway, myself included. Your success at 
extracting usable data depends upon how rigid the format of the HTML is from 
page to page.

  In your case, if the date always follows a link ('</a>') followed by 'on', 
and the date is always followed by 'at' and a time. you can use this:

    if( $line =~ m{ </a> \s+ on \s+ (\w+ \s \d{1,2} , \s \d{4}) \s at}x ) {
      print "The date is $1\n";
    }else{
      print "No match\n";
    }

  Note I am using the extended regular expression syntax with the x modifier.


  -- 
  To unsubscribe, e-mail: beginners-unsubscr...@perl.org
  For additional commands, e-mail: beginners-h...@perl.org
  http://learn.perl.org/

Re: Regrex Question

Reply via email to