Re: Unexpected behaviour with HTMLParser...

2007-10-10 Thread Andrew Durdin
On 10/9/07, Just Another Victim of the Ambient Morality <[EMAIL PROTECTED]> wrote: > > "Diez B. Roggisch" <[EMAIL PROTECTED]> wrote in message > news:[EMAIL PROTECTED] > > > > Without code, that's hard to determine. But you are aware of e.g. > > > > handle_entityref(name) > > handle_charref(ref) >

Re: Unexpected behaviour with HTMLParser...

2007-10-10 Thread Stefan Behnel
Just Another Victim of the Ambient Morality wrote: > HTMLParser is behaving in, what I find to be, strange ways and I would > like to better understand what it is doing and why. In case you also want an HTML library that is easy to use (and powerful and flexible and...), look at lxml.html. h

Re: Unexpected behaviour with HTMLParser...

2007-10-09 Thread Diez B. Roggisch
Just Another Victim of the Ambient Morality schrieb: > "Diez B. Roggisch" <[EMAIL PROTECTED]> wrote in message > news:[EMAIL PROTECTED] >> Just Another Victim of the Ambient Morality schrieb: >>> HTMLParser is behaving in, what I find to be, strange ways and I >>> would like to better underst

Re: Unexpected behaviour with HTMLParser...

2007-10-09 Thread Just Another Victim of the Ambient Morality
"Diez B. Roggisch" <[EMAIL PROTECTED]> wrote in message news:[EMAIL PROTECTED] > Just Another Victim of the Ambient Morality schrieb: >> HTMLParser is behaving in, what I find to be, strange ways and I >> would like to better understand what it is doing and why. >> >> First, it doesn't a

Re: Unexpected behaviour with HTMLParser...

2007-10-09 Thread Diez B. Roggisch
Just Another Victim of the Ambient Morality schrieb: > HTMLParser is behaving in, what I find to be, strange ways and I would > like to better understand what it is doing and why. > > First, it doesn't appear to translate HTML escape characters. I don't > know the actual terminology but

Unexpected behaviour with HTMLParser...

2007-10-09 Thread Just Another Victim of the Ambient Morality
HTMLParser is behaving in, what I find to be, strange ways and I would like to better understand what it is doing and why. First, it doesn't appear to translate HTML escape characters. I don't know the actual terminology but things like & don't get translated into & as one would like.