> Date: Wed, 08 Jun 2016 12:19:02 +0100
> From: [email protected] (akhiezer)
> Subject: Re: [website] r1350 - in html/trunk: . common
>
> > From: Bruce Dubbs <[email protected]>
> > Date: Tue, 7 Jun 2016 14:47:20 -0500
> > Subject: Re: [website] r1350 - in html/trunk: . common
> >
> > akhiezer wrote:
> > >> From: [email protected]
> > >> Date: Tue, 07 Jun 2016 17:43:23 -0000
> > >> Subject: [website] r1350 - in html/trunk: . common
> > >>
> > >   .
> > >   .
> > >> Modified: html/trunk/common/lfsftp.html
> > >   .
> > >   .
> > >> +      <li><a 
> > >> href="http://mirror.jaleco.com/lfs/pub/";>http://mirror.jaleco.com/lfs/pub/</a>
> > >> +          (Washington, DC, USA, 1 Gbps)</li>
> > >   .
> > >   .
> > >>
> > >> Modified: html/trunk/mirrors.html
> > >   .
> > >   .
> > >> +        <li><a href="http://lfs.mirror.jaleco.com/";>Washinton, DC, USA 
> > >> [1 Gbps]</a></li>
> > >
> > >
> > > Different format on different pages, but for essentially same-format
> > > info; any partic reason?
> >
> > Inertia
> >
>
>
>  - a suggested patch, per below.
        .
        .
> cat ./lfs-website.old/html/trunk/mirrors.html | \
>  tr '\n' '\0' | \
>  sed -rue 's@\x00([[:blank:]]*<(h1>Select A Mirror</h1|div 
> id="footer")>)@\n\1@g' | \
>  sed -rue '\|^[[:blank:]]*<h1>Select A Mirror</h1>|{
>   s|([[:blank:]]*)(<li><a 
> href=")([^"]*)(">)([^<]*)(</a>)(</li>)|\1\2\3\4\3\6\n\1    (\5)\7|g ;
>  }' | \


 - meant to add that the sed regexes can of course handle whitespace
more-rigourously (or indeed to flag warnings/errors) by adding some
'[[:blank:]]*'s (without the quotes); mainly re the html '<' & '>',
but also at the 'div id', 'Select A Mirror', &c; &usw.


>  tr '\0' '\n' \
>  > ./lfs-website.new/html/trunk/mirrors.html \
>  2> ./lfs-website.new_html_trunk_mirrors.html.e ;
        .
        .
> diff -Naur ./lfs-website.old//html/trunk/mirrors.html 
> ./lfs-website.new//html/trunk/mirrors.html

 
Also as a further cross-check, one can (also of course) diff text-dump
output from links/lynx.



rgds,
akh





--
-- 
http://lists.linuxfromscratch.org/listinfo/website
FAQ: http://www.linuxfromscratch.org/blfs/faq.html
Unsubscribe: See the above information page

Reply via email to