Re: [WSG] technique of converting to tablefree layout

2004-07-22 Thread Bryan Davis
I always code by hand and my editor of choice is NoteTab Pro
(www.notetab.com). As well as Find and Replace it has a neat function which
strips (X)HTML out of markup leaving just text and script excerpts etc.
Having redesigned a couple of sites in this way, I have yet to trip it up.

Bryan Davis

Micheal Kear wrote:
 I've mostly used the good old MkI delete key - the most-used key on my
 keyboard.  When I started renovating web sites, and using word docs and
 FrontPage sites, I tried using automated methods - search and replace and
 the like - and found there was always something left.  A single b or a
i
 somewhere that affected half the remaining page.  Or a font tag that
 didn't exactly match the search criteria so it would be left.  Or a table
 that I really did want kept would be deleted.These fixes I found would
 often take just as long as going through the page and deleting stuff in
the
 first place.


*
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
* 



Re: [WSG] technique of converting to tablefree layout

2004-07-21 Thread Lachlan Hardy
Lea de Groot wrote:
What are people's preferred techniques for 'screen scraping' existing 
sites to get the text from a tag-soup table layout?
When a page has copious links and such, simply copying the text from 
the browser doesn't always give enough content to be a useful quick 
method.
I do the same as Patrick. The regular expressions in DWMX 2004 are 
really powerful - once you work out what you're doing. The Find  
Replace is the sole reason I use DW over any other text editor - given 
that I handcode only

Cheers,
Lachlan
*
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
* 



Re: [WSG] technique of converting to tablefree layout

2004-07-21 Thread Andrew Sione Taumoefolau
Hi Lea,

 What are people's preferred techniques for 'screen scraping' existing 
 sites to get the text from a tag-soup table layout?

$ lynx --dump url

works wonders if you have easy access to lynx (and the site that you're
scraping doesn't have too horrible a structure :).

Cheers,

Andrew

*
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
* 



Re: [WSG] technique of converting to tablefree layout

2004-07-21 Thread Lea de Groot
On Thu, 22 Jul 2004 09:18:35 +1000, Lachlan Hardy wrote:
 I do the same as Patrick. The regular expressions in DWMX 2004 are 
 really powerful - once you work out what you're doing. 

Yes, once someone said 'regexp' i went Doh! and got on with the job :)
What can I say, except I have a headcold? 
:)

Thanks, all
Lea
-- 
Lea de Groot
Elysian Systems - I Understand the Internet http://elysiansystems.com/
Web Design, Usability, Information Architecture, Search Engine 
Optimisation
Brisbane, Australia
*
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
* 



RE: [WSG] technique of converting to tablefree layout

2004-07-21 Thread Michael Kear
I've mostly used the good old MkI delete key - the most-used key on my
keyboard.  When I started renovating web sites, and using word docs and
FrontPage sites, I tried using automated methods - search and replace and
the like - and found there was always something left.  A single b or a i
somewhere that affected half the remaining page.  Or a font tag that
didn't exactly match the search criteria so it would be left.  Or a table
that I really did want kept would be deleted.These fixes I found would
often take just as long as going through the page and deleting stuff in the
first place.

Another way is to select/copy from the rendered page in a browser.  That way
you only pick up the content if you get the select right, and you don't pick
up all the associated table structure.

Dreamweaver has powerful search and replace functions.  For example, you can
have it delete all font tags, regardless of the attributes, or all span
tags.   And with a single click you can convert the file to XHTML.  It'll go
through the file closing off tags, fixing case, adding quotes to attributes
etc.   

And Dreamweaver's Word Clean-up function is magic.   Watch it reduce a
simple word html document of 500 lines to about 50 or fewer with no change
in the rendered content! 

Cheers
Mike Kear
Windsor, NSW, Australia
AFP Webworks
http://afpwebworks.com


-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] On
Behalf Of Lea de Groot
Subject: Re: [WSG] technique of converting to tablefree layout
[snip]
Yes, once someone said 'regexp' i went Doh! and got on with the job :)
What can I say, except I have a headcold? 
:)

Thanks, all
Lea
-- 
Lea de Groot


*
The discussion list for http://webstandardsgroup.org/
See http://webstandardsgroup.org/mail/guidelines.cfm
for some hints on posting to the list  getting help
*