Mark E. Shoulson scripsit: > Heh... I've occasionally caught myself almost wishing for this kind of > setup, ridiculous though it be. It would be nice to be able to get just > the *content* of the text without having to bother with all that mucking > about with HTML rendering engines and whatnot.
TSaxon (http://www.ccil.org/~cowan/XML/tagsoup/tsaxon) is the ticket here, with a trivial stylesheet that just specifies text output. Use the -H switch to allow arbitrary HTML input. -- John Cowan [EMAIL PROTECTED] http://www.reutershealth.com "Not to know The Smiths is not to know K.X.U." --K.X.U.

