[EMAIL PROTECTED] [EMAIL PROTECTED] wrote:
My apologies, given that Google Groups messes up the formatting, the
regexp should read
regexp = re.compile(li class=\post\.*?h4 class=\desc\a
href=
\(.*?)\ rel=\nofollow\(.*?)/a.*?/div\s*(?:p class=\notes
\(.*?)/p)?.*?div class=\meta\(?:to
[EMAIL PROTECTED] wrote:
Hi everyone,
I am using Python's re module to extract some data from html. The
following code never returns, and I was wondering if someone can
explain to me why. Is this a problem with my regexp (I tried really
hard to find it?)?
[snip] html/xml string
regexp =
Hi everyone,
I am using Python's re module to extract some data from html. The
following code never returns, and I was wondering if someone can
explain to me why. Is this a problem with my regexp (I tried really
hard to find it?)?
The string contains three records (list items in a html page).
My apologies, given that Google Groups messes up the formatting, the
regexp should read
regexp = re.compile(li class=\post\.*?h4 class=\desc\a
href=
\(.*?)\ rel=\nofollow\(.*?)/a.*?/div\s*(?:p class=\notes
\(.*?)/p)?.*?div class=\meta\(?:to ((?:a class=\tag\.*? )
+))*.*?span class=\date\