On Fri, Apr 24, 2009 at 11:35:57AM -0600, Clay Baenziger wrote: Hi, > I think a quick and dirty way to scrape is: > webRoot="http://mail.opensolaris.org/pipermail/caiman-discuss/2009-April"; \ > wget -O - ${webRoot}/thread.html 2>/dev/null| \ > grep Flag.Day| \ > sed "s#HREF=\"#HREF=\"${webRoot}#"
Autsch! Don't do that, as we know, that once such a bad hack is in place, nobody will touch it again for a long time. And this is one reason, why people can't find the information they need - often these links point to sun internal sites (and foment the feel, that the project is still a sun internal thing) or point to the wrong place. E.g. have a look at http://opensolaris.org/os/community/on/flag-days/111-115/ : almost every link to a PSARC results into a "Resource Not Found". Not sure about how open a opensource project is, when you can't find the meat/specs/proposals or whatever PSARCs meant to be. Anyway, if you want to keep people disappointed and away from *Solaris and keep the "community" as small as possible, go ahead ... Regards, jel. -- Otto-von-Guericke University http://www.cs.uni-magdeburg.de/ Department of Computer Science Geb. 29 R 027, Universitaetsplatz 2 39106 Magdeburg, Germany Tel: +49 391 67 12768