On 24 Jul 2006, at 5:48 PM, Rob Dixon wrote:

- The character wildcard '.' is just a dot within a character class, so [.\n]
will match only a dot or a newline

Ah, I hadn't realized that characters in [ ] are literals. That clears up a lot of the problem.

- Regexes aren't the best way of parsing HTML, unless the document is very simple and predictable. Take a look at somthing like HTML::TreeBuilder if you're
doing this a lot on varying or non-trivial documents.

Yeah, I realize that it's not the best way to go about it, but I had many documents that were all the same, plus I figured this was a good excuse to learn regexes.

Thanks for the help!

Jonathan

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>


Reply via email to