Well you need to be able to identify something in the HTML markup that
clearly and consistently defines where the text you want starts and stops.

First, you get the markup of the entire page. Then you extract the portion
you want from between the delimiters you define. This might be done using a
regular _expression_ match.

I wrote a custom tag to make this easier some years ago. Some info here:
http://www.electricsheep.co.nz/toolbox/scrape.cfm although the example
doesn't work any more.

-----Original Message-----
From: brobborb [mailto:[EMAIL PROTECTED]
Sent: Friday, 20 February 2004 9:49 a.m.
To: CF-Talk
Subject: Re: Extracting Text From web page

Just the text.  no HTML stuff :)
  ----- Original Message -----
  From: Matthew Walker
  To: CF-Talk
  Sent: Thursday, February 19, 2004 3:42 PM
  Subject: RE: Extracting Text From web page

  When you say text do you mean all the HTML markup of the page or do you
mean
  only the content, without the navigation elements etc.?

  -----Original Message-----
  From: brobborb [mailto:[EMAIL PROTECTED]
  Sent: Friday, 20 February 2004 9:36 a.m.
  To: CF-Talk
  Subject: Extracting Text From web page

  Hey guys I'm kinda a newbie at CFHTTP n stuff.  How do I extract the text
  from a web page and put it into a variable? (using a URL)  like let's say
I
  want to get the text from www.example.com/hehe/index.html

  Thanks!

    _____

  _____
[Todays Threads] [This Message] [Subscription] [Fast Unsubscribe] [User Settings]

Reply via email to