On Sun, 28 Oct 2001, Tim Kynerd wrote:

> Hello,
>
> I'm new to the list.  I just downloaded sitescooper and started using it
> today, and there are a couple of issues I wanted to raise.
>
> - Neither the default site (www.sitescooper.org) nor the European mirror
> site would allow me to access the /doc subdirectory ("Forbidden - You don't
> have access to /doc/ on this server.").  The USA mirror did, though.  (I did
> get the docs with the distribution, but I was at work earlier today and
> wanted to get the info on how to write site files from the Web site and
> print it out.)
> - I've written my own site file for the Washington Post, which I'm trying to
> test.  Having made some changes to it, I wanted to test the changes, so I
> ran sitescooper with the -fullrefresh option.  But it reads the initial page
> (this is a 3-level site) and stops, saying, "Washington Post: no new
> stories, ignoring."  WHY, when I use the -fullrefresh option, would it do
> this?  ARRRRGH!
>
> Any help would be appreciated.

Sorry to reply to myself, but I wanted to provide an update.

The first problem, with the Web site, remains AFAIK; I haven't been on the
Web site since Saturday, so I can't say for sure.

The second problem turned out to be initially a problem with my site file --
my regexps weren't getting matched (duh).  But even after I fixed them, I
wasn't picking up as much content as I would like.  I *think* this is
because the links I want scooped are in a table.  Will setting
"ContentsUseTableSmarts" to 0 solve this?  For the time being, I solved it
by copying the HTML page to my hard drive and editing it, then scooping that
file; this works beautifully.  But I was planning to contribute this site
file once I got it working, and I suppose I can't do so as long as it's
dependent on another file for the scooping to work -- or?

-- Tim Kynerd

Sunrise in Stockholm today:  8:00
Sunset in Stockholm today:  17:03
My rail transit photos at http://www.kynerd.nu


_______________________________________________
Sitescooper-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/sitescooper-talk

Reply via email to