On 24 Jul 2006, at 5:48 PM, Rob Dixon wrote:
- The character wildcard '.' is just a dot within a character
class, so [.\n]
will match only a dot or a newline
Ah, I hadn't realized that characters in [ ] are literals. That
clears up a lot of the problem.
- Regexes aren't the best way of parsing HTML, unless the document
is very
simple and predictable. Take a look at somthing like
HTML::TreeBuilder if you're
doing this a lot on varying or non-trivial documents.
Yeah, I realize that it's not the best way to go about it, but I had
many documents that were all the same, plus I figured this was a good
excuse to learn regexes.
Thanks for the help!
Jonathan
--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>