Re: HTML::Parse: what if I don't want entities decoded?

John Stracke Fri, 06 Apr 2001 13:36:33 -0700

Gisle Aas wrote:

> John Stracke <[EMAIL PROTECTED]> writes:
>
> > I just noticed that, when HTML::Parse encountes &nbsp;, it sends
> > it to me as \240.  Since I want to treat my files as UTF-8, this
> > is a problem.  Is there any way to tell it not to decode
> > entities, or do I need to bite the bullet and implement the UTF-8
> > option the manpage talks about?
>
> Do you mean HTML::Parse or HTML::Parser here?

HTML::Parser, sorry.

> HTML::Parser decode entities with the 'dtext' argspec and leave them
> alone for 'text'.

I'm not specifying dtext, and &nbsp; is getting decoded.

Uh...but I might be using an old form of the interface, with different
defaults.  My subclass's constructor just calls HTML::Parser->new().

--
/==============================================================\
|John Stracke    | http://www.ecal.com |My opinions are my own.|
|Chief Scientist |=============================================|
|eCal Corp.      |What do you mean, *you're* a solipsist?      |
|[EMAIL PROTECTED]|                                             |
\==============================================================/

Re: HTML::Parse: what if I *don't* want entities decoded?

Reply via email to

Re: HTML::Parse: what if I don't want entities decoded?