On Mon, Dec 14, 2009 at 6:43 PM, Ashley Sheridan
a...@ashleysheridan.co.uk wrote:
I'm looking for a way to strip HTML tags out of some text content
(sourced from a web page) to leave just the text which I'll be running
some basic analysis on. The thing is, I want to preserve text that is in
I've had quite some luck using the html2text class by Jon Abernathy
http://www.chuggnutt.com/html2text.php
It's targetted to php 4, and rather old code - but it does the job for me.
Where the 'job for me' is converting html to text for when I'm sending out
emails in HTML format and want to
On Tue, Dec 15, 2009 at 6:44 AM, Wouter van Vliet / Interpotential
pub...@interpotential.com wrote:
And if that doesn't suit your needs - you might want to take a look at this:
http://sourceforge.net/projects/simplehtmldom/
+1
I've never used the html2text library, but simplehtmldom is very
I'm looking for a way to strip HTML tags out of some text content
(sourced from a web page) to leave just the text which I'll be running
some basic analysis on. The thing is, I want to preserve text that is in
alt and title attributes. I can't use any DOM functions, as I can't
guarantee that the
4 matches
Mail list logo