I ported JTidy to J# in Beta2 days:
http://www.thundermain.com/code/JSharpTidy.aspx

I attempted to port it to C# but eventually gave up. Perhaps someone with
VJ6 and JUMP might be able to easily port it.


Richard

> -----Original Message-----
> From: Moderated discussion of advanced .NET topics.
> [mailto:[EMAIL PROTECTED]]On Behalf Of Craig Andera
> Sent: 31 May 2002 14:37
> To: [EMAIL PROTECTED]
> Subject: Re: [ADVANCED-DOTNET] Web Page Content Parsing
>
>
> > Are there any better approaches to meet the requirement - may
> > be something like re-usable HTML parsers or Web page content
> > parsers already available or pre-defined classes/libraries in .Net?
>
> Well, there's nothing built-in, so you have a few choices. This is the
> order I would do them in.
>
> 1) If you can guarantee that the resulting HTML is XHTML compliant, you
> can use an XML parser to process it.
> 2) You can try to find some reusable code that someone else already
> wrote. I believe someone was going to try to do this as part of Ghengis:
> http://www.sellsbrothers.com/genghis.
> 3) You can use COM interop to muck around with getting Internet Explorer
> to do the parsing for you.
>
> You can read messages from the Advanced DOTNET archive,
> unsubscribe from Advanced DOTNET, or
> subscribe to other DevelopMentor lists at http://discuss.develop.com.

You can read messages from the Advanced DOTNET archive, unsubscribe from Advanced 
DOTNET, or
subscribe to other DevelopMentor lists at http://discuss.develop.com.

Reply via email to