I just had to drop in at this time and insert my comments..

HTML can be and in most cases very misused and not very structured.
Now, if you can make sure that the HTML is structured correctly, then 
I would think that the MSHTML-DOM would suffice to create your tree.

However, you may also be able to write a recursive function to work
on the IHTMLDocument2 Object Model. While I have not attempted it, it
doesn't look like it would be too difficult to do.


--- Stephen Posey <[EMAIL PROTECTED]> wrote:

> -----Original Message-----
> >From: Chris Stebbing <[EMAIL PROTECTED]>
> >Sent: Apr 23, 2007 9:07 PM
> >To: Delphi List <[email protected]>
> >Subject: HTML Browser / Parser
> >
> >Hi All,
> >
> >this may seem like a strange request, but is there a program out 
> >there that will take a web page and break it down into a tree-like 
> >structure?  I need to do some web scraping and it would assist me 
> >greatly if there was a program which would allow me to traverse the 
> >HTML code in a tree-like manner..
> 
> The HTML rendering engine we created for TurboPower Internet Professional
> created a linked tree type data structure of the parsed HTML prior to
> rendering it into the HTML display component. I recall manipulating that in
> order to construct the HTTP: engine handling for the FTP sites back in the
> day.
> 
> I can't promise you how gracefully it will handle newer stuff (aspx, cold
> fusion, embedded scripts, etc.) as I don't think it's been touched since we
> made it open source, but it's free and there for the taking:
> 
> http://sourceforge.net/projects/tpipro/
> 
> HTH
> 
> Stephen Posey
> [EMAIL PROTECTED]
> 
> 
> _______________________________________________
> Delphi mailing list -> [email protected]
> http://www.elists.org/mailman/listinfo/delphi
> 


__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 
_______________________________________________
Delphi mailing list -> [email protected]
http://www.elists.org/mailman/listinfo/delphi

Reply via email to