Just joined this list and I'm nost sure if this has been
addressed. The site file for Slashdot in the current
sitescooper doesn't work properly. Here's an updated
"slashdot.site" which does work. Basically the
URL for the "hof" link on the page seems to have changed.
---8<---
# Slashdot.site -- now including comments scored 3 or higher.
# TODO: strip out the so-called "funny" comments ;)
#
# Kornelis Sietsma <korny /at/ sietsma.com>: comments support
# jm: fixed again to use light mode throughout
URL: http://slashdot.org/index.pl?light=1&noboxes=1&noicons=1
Name: SlashDot
Levels: 2
ContentsStart: <A HREF="//slashdot.org/hof.shtml">hof</A>
ContentsEnd: <P><P>\[ <FONT size=2><B>
StoryURL: http://slashdot.org/article.*
StoryStart: <A HREF="//slashdot.org/hof.shtml">hof</A>
StoryEnd: <P>\[ <FONT size=2><B>
# strip out the "login" and "related links" tables, they're irrelevant offline!
# added Feb 2 2000 jm
#
StoryHTMLPreProcess: {
s,<H3>Slashdot Login</H3>.*?<B>The Fine Print:</B>,</FONT></TD></TR></TABLE>,s;
}
# Because slashdot has so many links allowing views of stories with different
# comment levels, formats, etc., we need a way to fix or block them here.
# Unfortunately it's a bit tricky so we need to use perl code. We could just
# ignore the comments, but I guess that's missing the point of slashdot ;)
# added May 18 2000 jm
#
URLProcess: {
# fix the URL; trim out all comment settings and use our own.
s{^(http://slashdot.org/article.pl\?sid=\d+/\d+/\d+/\d+).*}
{$1\&light=1\&noboxes=1\&noicons=1\&mode=nested\&threshold=3}g;
if (!m,^http://slashdot.org/index.pl.light=1\&noboxes=1\&noicons=1,
&& !/mode=nested\&threshold=3/)
{
undef $_; # has to include these two; block it if it does not
}
}
# skip URLs that have been archived
StorySkipURL: http://slashdot.org/interviews/\d+/\d+/\d+/\d+.shtml
StoryHeadline: <TITLE>Slashdot \| (.*?)</TITLE>
---8<---
--
Caleb Epstein | bklyn . org | unfair competition, n.:
cae at | Brooklyn Dust | Selling cheaper than we do.
bklyn dot org | Bunny Mfg. |
_______________________________________________
Sitescooper-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/sitescooper-talk