-----Original Message----- >From: Chris Stebbing <[EMAIL PROTECTED]> >Sent: Apr 23, 2007 9:07 PM >To: Delphi List <[email protected]> >Subject: HTML Browser / Parser > >Hi All, > >this may seem like a strange request, but is there a program out >there that will take a web page and break it down into a tree-like >structure? I need to do some web scraping and it would assist me >greatly if there was a program which would allow me to traverse the >HTML code in a tree-like manner..
The HTML rendering engine we created for TurboPower Internet Professional created a linked tree type data structure of the parsed HTML prior to rendering it into the HTML display component. I recall manipulating that in order to construct the HTTP: engine handling for the FTP sites back in the day. I can't promise you how gracefully it will handle newer stuff (aspx, cold fusion, embedded scripts, etc.) as I don't think it's been touched since we made it open source, but it's free and there for the taking: http://sourceforge.net/projects/tpipro/ HTH Stephen Posey [EMAIL PROTECTED] _______________________________________________ Delphi mailing list -> [email protected] http://www.elists.org/mailman/listinfo/delphi

