On Mon, Jan 15, 2007 at 01:31:55PM +0000, Nic James Ferrier wrote: > Daniel Veillard <[EMAIL PROTECTED]> writes: > > > Nic said: > >> *** glibc detected *** double free or corruption (!prev): 0x081b6300 *** > >> Aborted > >> > > But did you update libxslt too and make install for it too ? Please do > > he fixed the problems in libxslt not in libxml2, > > Ah! > > Yes. It stopped segfaulting. I can't get it to parse the HTML... but > it has stopped segfaulting.
okay. that's the important point. > shows this for every document I get back that parses: > > <?xml version="1.0" encoding="UTF-8" standalone="yes"?> > <!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN" > "http://www.w3.org/TR/REC-html40/loose.dtd"> > > Here's the relevant bit of the loader again: > > # The parserContext and resulting document > parserContext = libxml2.parserCtxt(_obj=pctx) what is pctx ??? i find suspicious the fact you could provide a C parser context here. > doc = None > if url == "/one": > doc = parserContext.htmlCtxtReadFile("file2.html", "UTF8", 1) > else: > doc = parserContext.ctxtReadDoc("""<document> just use htmlReadFile and forget about trying to address directly the parser context. With python overhead you won't gain anything to create a separately accessible object. The less you touch things though Python the better it will be, really. That said HTML parsing works for me when using htmlReadFile. Daniel -- Red Hat Virtualization group http://redhat.com/virtualization/ Daniel Veillard | virtualization library http://libvirt.org/ [EMAIL PROTECTED] | libxml GNOME XML XSLT toolkit http://xmlsoft.org/ http://veillard.com/ | Rpmfind RPM search engine http://rpmfind.net/ _______________________________________________ xml mailing list, project page http://xmlsoft.org/ [email protected] http://mail.gnome.org/mailman/listinfo/xml
