I ported JTidy to J# in Beta2 days: http://www.thundermain.com/code/JSharpTidy.aspx
I attempted to port it to C# but eventually gave up. Perhaps someone with VJ6 and JUMP might be able to easily port it. Richard > -----Original Message----- > From: Moderated discussion of advanced .NET topics. > [mailto:[EMAIL PROTECTED]]On Behalf Of Craig Andera > Sent: 31 May 2002 14:37 > To: [EMAIL PROTECTED] > Subject: Re: [ADVANCED-DOTNET] Web Page Content Parsing > > > > Are there any better approaches to meet the requirement - may > > be something like re-usable HTML parsers or Web page content > > parsers already available or pre-defined classes/libraries in .Net? > > Well, there's nothing built-in, so you have a few choices. This is the > order I would do them in. > > 1) If you can guarantee that the resulting HTML is XHTML compliant, you > can use an XML parser to process it. > 2) You can try to find some reusable code that someone else already > wrote. I believe someone was going to try to do this as part of Ghengis: > http://www.sellsbrothers.com/genghis. > 3) You can use COM interop to muck around with getting Internet Explorer > to do the parsing for you. > > You can read messages from the Advanced DOTNET archive, > unsubscribe from Advanced DOTNET, or > subscribe to other DevelopMentor lists at http://discuss.develop.com. You can read messages from the Advanced DOTNET archive, unsubscribe from Advanced DOTNET, or subscribe to other DevelopMentor lists at http://discuss.develop.com.