I know there is one open source solution, because one of my old bosses
(Kimbo Mundy) wrote it.  I don't know where it is, it seems to have
disappeared.  Here is another one I found

http://www.printerport.com/klephacks/markdown.html

Cheers

-----Original Message-----
From: Scott Stirling [mailto:[EMAIL PROTECTED]]
Sent: Monday, February 05, 2001 5:35 PM
To: JRun-Talk
Subject: RE: Strip HTML


This is the kind of use case PERL is perfect for, but Java is not.

If you're getting arbitrary HTML of any length, this could be real tough.
Do you have any way of predicting what you'll be getting?  Will it be
well-formed HTML?  How long will the HTML be?

Maybe the regexp or ORO packages for Java at http://jakarta.apache.org have
something that could help.

Scott Stirling
Allaire Corporation
http://www.allaire.com/developer/jrunreferencedesk/

> -----Original Message-----
> From: Ralph Fiol [mailto:[EMAIL PROTECTED]]
> Sent: Monday, February 05, 2001 4:43 PM
> To: JRun-Talk
> Subject: Strip HTML
> 
> 
> Does anyone know how I can strip all HTML from a string?  
> That is, I want to
> hit a URL using HttpURLConnection (etc.), then remove any 
> HTML return by
> getInputStream().  Any ideas?
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
Structure your ColdFusion code with Fusebox. Get the official book at 
http://www.fusionauthority.com/bkinfo.cfm
Archives: http://www.mail-archive.com/[email protected]/
Unsubscribe: http://www.houseoffusion.com/index.cfm?sidebar=lists

Reply via email to