On Fri, Oct 04, 2002 at 07:49:35AM +1000, Matthew Fallshaw wrote: > G'day All, > > I'm trying to pluck Slashdot, but to keep the size of the pluck down, I > don't want to pick up the comments pages. > The Slashdot standard pages match the pattern: > http://slashdot\.org/palm/articles/.* > Comments match the pattern: > http://slashdot\.org/palm/articles/.*comments.shtml > How do I get Plucker to pluck the articles but not the comments?
I wrote a tool called SlashPluck that might do what you want, if you're comfortable with a little bit of tweaking. SlashPluck is a perl script that preprocesses slashdot and formats it in a friendly way for plucker. There is a global variable near the top of the script called $threshold that sets the minimum comment threshold to fetch. Set this value to 6 or higher and you won't get any comments. You can download it here: http://jasonday.home.att.net/code/slashpluck/slashpluck.html Jason -- Jason Day jasonday at http://jasonday.home.att.net worldnet dot att dot net "Of course I'm paranoid, everyone is trying to kill me." -- Weyoun-6, Star Trek: Deep Space 9 _______________________________________________ plucker-list mailing list [EMAIL PROTECTED] http://lists.rubberchicken.org/mailman/listinfo/plucker-list

