Re: regular expression help

Jonathan Weber Mon, 24 Jul 2006 18:15:41 -0700

On 24 Jul 2006, at 5:48 PM, Rob Dixon wrote:

- The character wildcard '.' is just a dot within a characterclass, so [.\n]
will match only a dot or a newline

Ah, I hadn't realized that characters in [ ] are literals. Thatclears up a lot of the problem.

- Regexes aren't the best way of parsing HTML, unless the documentis verysimple and predictable. Take a look at somthing likeHTML::TreeBuilder if you're
doing this a lot on varying or non-trivial documents.

Yeah, I realize that it's not the best way to go about it, but I hadmany documents that were all the same, plus I figured this was a goodexcuse to learn regexes.


Thanks for the help!

Jonathan

--
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]
<http://learn.perl.org/> <http://learn.perl.org/first-response>

Re: regular expression help

Reply via email to