On Fri, Apr 24, 2009 at 11:35:57AM -0600, Clay Baenziger wrote:
Hi,

>       I think a quick and dirty way to scrape is:
> webRoot="http://mail.opensolaris.org/pipermail/caiman-discuss/2009-April";; \
> wget -O - ${webRoot}/thread.html 2>/dev/null| \
> grep Flag.Day| \
> sed "s#HREF=\"#HREF=\"${webRoot}#"

Autsch! Don't do that, as we know, that once such a bad hack is in place,
nobody will touch it again for a long time. And this is one reason, why
people can't find the information they need - often these links point to
sun internal sites (and foment the feel, that the project is still a
sun internal thing) or point to the wrong place.  E.g.  have a look at
http://opensolaris.org/os/community/on/flag-days/111-115/ : almost every
link to a PSARC results into a "Resource Not Found". Not sure about how
open a opensource project is, when you can't find the
meat/specs/proposals or whatever PSARCs meant to be.

Anyway, if you want to keep people disappointed and away from *Solaris
and keep the "community" as small as possible, go ahead ...

Regards,
jel.
-- 
Otto-von-Guericke University     http://www.cs.uni-magdeburg.de/
Department of Computer Science   Geb. 29 R 027, Universitaetsplatz 2
39106 Magdeburg, Germany         Tel: +49 391 67 12768

Reply via email to