On 12/20/06, Marcos Dione <[EMAIL PROTECTED]> wrote:
On Wed, Dec 20, 2006 at 10:59:06AM -0800, yichun wei wrote:
> I am trying to grab some html pages via KHTMLPart.openURL and scrape
> the content I get. However I am not able to read out the HTML document
> sources I have in KHTMLPart.
just call:
domDocu= part.document ()
html= domDocu.toString ().string ()
that's a QString.
> toHTML() seemed to return nothing (None or ""), while toString() gave
> me an exception and my script crashed:
yes, under certain circumstances that happens. I think it's because
the KHTMLPart has no parentWidet or no parent or both. if you setup the
whole apparatus for showing the part, everythings works just fine.
Thanks a lot Marcos. I was using Jim Bublitz's
doc/examples/pyKHTMLPart.py and modified it from there. It appears to
me that parentWidget for the KHTMLPart is not 0:
class pyPartsMW(KParts.MainWindow):
def __init__(self, *args):
...
self.w = KHTMLPart(self, "HTMLart", self)
self.w.openURL (KURL("http://www.kde.org"))
domDocu = self.w.document ()
html = domDocu.toString().string()
...
Then I got the error message:
terminate called after throwing an instance of 'DOM::DOMException'
where .toString() is called. Is there any other reason leading to this
other than the 0 values when initiate the KHTMLPart object?
- yichun
_______________________________________________
PyKDE mailing list PyKDE@mats.imk.fraunhofer.de
http://mats.imk.fraunhofer.de/mailman/listinfo/pykde