-----Original Message-----
>From: Chris Stebbing <[EMAIL PROTECTED]>
>Sent: Apr 23, 2007 9:07 PM
>To: Delphi List <[email protected]>
>Subject: HTML Browser / Parser
>
>Hi All,
>
>this may seem like a strange request, but is there a program out 
>there that will take a web page and break it down into a tree-like 
>structure?  I need to do some web scraping and it would assist me 
>greatly if there was a program which would allow me to traverse the 
>HTML code in a tree-like manner..

The HTML rendering engine we created for TurboPower Internet Professional 
created a linked tree type data structure of the parsed HTML prior to rendering 
it into the HTML display component. I recall manipulating that in order to 
construct the HTTP: engine handling for the FTP sites back in the day.

I can't promise you how gracefully it will handle newer stuff (aspx, cold 
fusion, embedded scripts, etc.) as I don't think it's been touched since we 
made it open source, but it's free and there for the taking:

http://sourceforge.net/projects/tpipro/

HTH

Stephen Posey
[EMAIL PROTECTED]


_______________________________________________
Delphi mailing list -> [email protected]
http://www.elists.org/mailman/listinfo/delphi

Reply via email to