I'm trying to do some web page scraping using IHTMLDocument2, which is
working fairly well and I can grab the second paragraph on a web page by
doing something like:
p := iDoc.all.tags('P');
if p.Length = 2 then
result := p.Item(1).InnerText;
Where iDoc is an isnstance of IHTMLDocument2.
scraping using IHTMLDocument2
I'm trying to do some web page scraping using IHTMLDocument2, which is
working fairly well and I can grab the second paragraph on a web page by
doing something like:
p := iDoc.all.tags('P');
if p.Length = 2 then
result := p.Item(1).InnerText;
Where iDoc
...@delphi.org.nz] *On Behalf Of *Alister Christie
*Sent:* Friday, 29 January 2010 12:22 p.m.
*To:* NZ Borland Developers Group - Delphi List
*Subject:* [DUG] web scraping using IHTMLDocument2
I'm trying to do some web page scraping using IHTMLDocument2, which is
working fairly well and I can grab
...@delphi.org.nz]
On Behalf Of Alister Christie
Sent: Friday, 29 January 2010 2:40 p.m.
To: NZ Borland Developers Group - Delphi List
Subject: Re: [DUG] web scraping using IHTMLDocument2
Thanks Cameron,
It does indeed have that header, how do I make this work?
XMLDocument1.FileName := 'c:\temp
Christie
*Sent:* Friday, 29 January 2010 2:40 p.m.
*To:* NZ Borland Developers Group - Delphi List
*Subject:* Re: [DUG] web scraping using IHTMLDocument2
Thanks Cameron,
It does indeed have that header, how do I make this work?
XMLDocument1.FileName := 'c:\temp\test.htm';
XMLDocument1.Active
Developers Group - Delphi List
Subject: Re: [DUG] web scraping using IHTMLDocument2
Thanks, although it looks like the html documents are not xml compliant,
so I'll probably have to either parse the file manually or continue
experimenting with IHTMLDocument2 (and hopefully find some documentation
-Original Message-
From: delphi-boun...@delphi.org.nz [mailto:delphi-boun...@delphi.org.nz]
On Behalf Of Alister Christie
Sent: Friday, 29 January 2010 4:26 p.m.
To: NZ Borland Developers Group - Delphi List
Subject: Re: [DUG] web scraping using IHTMLDocument2
Thanks, although it looks like