Re: why does this call to re.findall() loop forever?

2008-11-10 Thread Nick Craig-Wood
[EMAIL PROTECTED] [EMAIL PROTECTED] wrote: My apologies, given that Google Groups messes up the formatting, the regexp should read regexp = re.compile(li class=\post\.*?h4 class=\desc\a href= \(.*?)\ rel=\nofollow\(.*?)/a.*?/div\s*(?:p class=\notes \(.*?)/p)?.*?div class=\meta\(?:to

Re: why does this call to re.findall() loop forever?

2008-11-09 Thread Terry Reedy
[EMAIL PROTECTED] wrote: Hi everyone, I am using Python's re module to extract some data from html. The following code never returns, and I was wondering if someone can explain to me why. Is this a problem with my regexp (I tried really hard to find it?)? [snip] html/xml string regexp =

why does this call to re.findall() loop forever?

2008-11-09 Thread james . kirin40
Hi everyone, I am using Python's re module to extract some data from html. The following code never returns, and I was wondering if someone can explain to me why. Is this a problem with my regexp (I tried really hard to find it?)? The string contains three records (list items in a html page).

Re: why does this call to re.findall() loop forever?

2008-11-09 Thread james . kirin40
My apologies, given that Google Groups messes up the formatting, the regexp should read regexp = re.compile(li class=\post\.*?h4 class=\desc\a href= \(.*?)\ rel=\nofollow\(.*?)/a.*?/div\s*(?:p class=\notes \(.*?)/p)?.*?div class=\meta\(?:to ((?:a class=\tag\.*? ) +))*.*?span class=\date\