On Mon, Jan 15, 2007 at 01:31:55PM +0000, Nic James Ferrier wrote:
> Daniel Veillard <[EMAIL PROTECTED]> writes:
> 
> > Nic said:
> >>  *** glibc detected *** double free or corruption (!prev): 0x081b6300 ***
> >>  Aborted
> >> 
> >   But did you update libxslt too and make install for it too ? Please do
> > he fixed the problems in libxslt not in libxml2,
> 
> Ah!
> 
> Yes. It stopped segfaulting. I can't get it to parse the HTML... but
> it has stopped segfaulting.

  okay. that's the important point.

> shows this for every document I get back that parses:
> 
> <?xml version="1.0" encoding="UTF-8" standalone="yes"?>
> <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" 
> "http://www.w3.org/TR/REC-html40/loose.dtd";>
> 
> Here's the relevant bit of the loader again:
> 
>       # The parserContext and resulting document
>       parserContext = libxml2.parserCtxt(_obj=pctx)

 what is pctx ??? i find suspicious the fact you could provide a C parser
context here.

>       doc = None
>       if url == "/one":
>           doc = parserContext.htmlCtxtReadFile("file2.html", "UTF8", 1)
>       else:
>           doc = parserContext.ctxtReadDoc("""<document>

  just use htmlReadFile and forget about trying to address directly the
parser context. With python overhead you won't gain anything to create
a separately accessible object. The less you touch things though Python
the better it will be, really. That said HTML parsing works for me when
using htmlReadFile.

Daniel

-- 
Red Hat Virtualization group http://redhat.com/virtualization/
Daniel Veillard      | virtualization library  http://libvirt.org/
[EMAIL PROTECTED]  | libxml GNOME XML XSLT toolkit  http://xmlsoft.org/
http://veillard.com/ | Rpmfind RPM search engine  http://rpmfind.net/
_______________________________________________
xml mailing list, project page  http://xmlsoft.org/
[email protected]
http://mail.gnome.org/mailman/listinfo/xml

Reply via email to