I always code by hand and my editor of choice is NoteTab Pro
(www.notetab.com). As well as Find and Replace it has a neat function which
strips (X)HTML out of markup leaving just text and script excerpts etc.
Having redesigned a couple of sites in this way, I have yet to trip it up.
Bryan Davis
Lea de Groot wrote:
What are people's preferred techniques for 'screen scraping' existing
sites to get the text from a tag-soup table layout?
When a page has copious links and such, simply copying the text from
the browser doesn't always give enough content to be a useful quick
method.
I do the
Hi Lea,
What are people's preferred techniques for 'screen scraping' existing
sites to get the text from a tag-soup table layout?
$ lynx --dump url
works wonders if you have easy access to lynx (and the site that you're
scraping doesn't have too horrible a structure :).
Cheers,
Andrew
On Thu, 22 Jul 2004 09:18:35 +1000, Lachlan Hardy wrote:
I do the same as Patrick. The regular expressions in DWMX 2004 are
really powerful - once you work out what you're doing.
Yes, once someone said 'regexp' i went Doh! and got on with the job :)
What can I say, except I have a headcold?
document of 500 lines to about 50 or fewer with no change
in the rendered content!
Cheers
Mike Kear
Windsor, NSW, Australia
AFP Webworks
http://afpwebworks.com
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On
Behalf Of Lea de Groot
Subject: Re: [WSG] technique