On Fri, Oct 04, 2002 at 07:49:35AM +1000, Matthew Fallshaw wrote:
> G'day All,
> 
> I'm trying to pluck Slashdot, but to keep the size of the pluck down, I
> don't want to pick up the comments pages.
> The Slashdot standard pages match the pattern:
> http://slashdot\.org/palm/articles/.*
> Comments match the pattern:
> http://slashdot\.org/palm/articles/.*comments.shtml
> How do I get Plucker to pluck the articles but not the comments?

I wrote a tool called SlashPluck that might do what you want, if you're
comfortable with a little bit of tweaking.  SlashPluck is a perl script that
preprocesses slashdot and formats it in a friendly way for plucker.  There
is a global variable near the top of the script called $threshold that
sets the minimum comment threshold to fetch.  Set this value to 6 or higher
and you won't get any comments.

You can download it here:
  http://jasonday.home.att.net/code/slashpluck/slashpluck.html

Jason
-- 
Jason Day                                       jasonday at
http://jasonday.home.att.net                    worldnet dot att dot net
 
"Of course I'm paranoid, everyone is trying to kill me."
    -- Weyoun-6, Star Trek: Deep Space 9
_______________________________________________
plucker-list mailing list
[EMAIL PROTECTED]
http://lists.rubberchicken.org/mailman/listinfo/plucker-list

Reply via email to