> I've been trying to create a site file for the Trekweb site (Headlines
> section). I can get the main page fine, but it doesn't seem to want to
> follow the links to the actual articles. I've been playing with the
> StoryURL parameters to limit the scoop to follow links for just the
> articles, but it just gets the front page.
I didn't have any that matched trekweb or any other Star Trek
enthusiast sites, but these might help along your interests:
<PODS>
<SITE TITLE="SCIFI.COM delivered directly to your handheld
device! Includes news, schedules, and other SCI FI facts and
information." DOMAIN="scifi.com" PROTECTED="no"
ROOTPAGE="index.html" PROTOCOL="http">
</SITE>
<SERVER>
<ADDRESS>208.243.116.240</ADDRESS>
<TYPE>apache</TYPE>
<VERSION>1.3.6</VERSION>
</SERVER>
<URI>
http://www.scifi.com/handheld/
</URI>
</PODS>
<PODS>
<SITE TITLE="Star Trek Episode Guide" DOMAIN="tiler.com"
PROTECTED="no" ROOTPAGE="default.htm" PROTOCOL="http">
</SITE>
<SERVER>
<ADDRESS>209.61.156.43</ADDRESS>
<TYPE>IIS</TYPE>
<VERSION>5.0</VERSION>
</SERVER>
<URI>
http://tiler.com/StarTrek/pda/default.htm
</URI>
</PODS>
Yes, as you can see, I'm trying to XML'ize the PODS content. I'm
not quite done with the schema yet. Still trying to figure out how to best
layout the elements vs. attributes.
startrek.com has a wireless section, which I'm trying to get my
way into now, just have to figure out their pagename.
http://www.startrek.com/wireless/
I'm still trying to pick up sitescooper's syntax, to figure out
how best to nail down these .site files. Once that's done, I can probably
give this a whack and help you out.
> URL: http://trekweb.com/Headlines/ <http://trekweb.com/Headlines/>
> Name: TrekWeb
> Levels: 2
> ContentsStart: <p><b>H</b>
> ContentsEnd: <!---RIGHT
> StoryURL: http://talk.trekweb.com/articles/\d+/\d+/\d+/\d+.html
> <http://talk.trekweb.com/articles/\d+/\d+/\d+/\d+.html>
> StoryCacheable: 0
> ContentsDiff: 1
/d
_______________________________________________
Sitescooper-talk mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/sitescooper-talk