Hi, Brett:

On 12-Nov-00, Brett Handley wrote:

> > Anyone who has ever used a good HTML stripper, such as the excellent HTTX
> > (Amiga), knows how useful they can be, when called by other programs.

> How does such a thing deal with tables and other anti-text HTML format
> elements?

For my needs, HTTX strips tables just fine - all I need is the contents; I
don't care about its final format. The stripped format depends upon the
author's talents (or lack thereof). Sometimes the stripped format clearly
resembles the original table; sometimes its a jumbled mess. So long as the
format of each page is at least somewhat consistent, I can find what I want.
Parsing does all of the hard work.

Normally, non-text elements (HTML format controls) are of no value to me and
are fully eliminated.

-- 

                ---===///||| Donald Dalley |||\\\===---
                     The World of AmiBroker Support
                  http://webhome.idirect.com/~ddalley
                          UIN/ICQ#: 65203020

-- 
To unsubscribe from this list, please send an email to
[EMAIL PROTECTED] with "unsubscribe" in the 
subject, without the quotes.

Reply via email to