erskine, michael said:

> I have a better location for UK TV listings (http://www.ananova.com/) which
> includes program descriptions and some perl to extract the data (using
> HTML::TableExtract) and invoke iSiloWeb to convert it. I have been using
> this successfully for a while without knowlege of sitescooper so I was
> unaware of the duplication of effort. I intend to create a .site file for
> sitescooper but this may take some time as I haven't had chance to read the
> docs yet. In the meantime, UK users may find this useful. I am sending the
> script as an attachment but in the case that it is lost in the mailing list
> mechanics, the source is available for viewing from
> http://unformat.port5.com/afaik/tvlistings.html and for downloading from
> http://unformat.port5.com/afaik/tvlistings.pl .

Hi Michael,

That's pretty cool.    I agree btw, the Ananova listings are the most
useful UK/Ireland listings on the web, as far as I can see.

> I hope that this is of use to some of you -- probably those of you in the
> UK!

And Ireland ;)

BTW If I ever get hold of one of the TiVo-style PCS SnapStream video
decoder cards, imho a web-scraper of www.ananova.com will be the first
thing I'll write, so I can hook in listings.

This also ties in with a cool web-scraping site I encountered recently:
http://www.scrml.org/ . Well worth a look.   These guys are basically
writing scraping code a la sitescooper, but it also descends into the page
to scrape out specific elements (e.g. "time", "date", "channel", "program
name", "program description") and marks 'em up as XML, for export to any
app: to a handheld, to the web, to a TV-recording app, etc. etc.  A *very*
interesting idea.

I've been chatting to them, and it could be a useful future direction for
sitescooper to support...

--j.

_______________________________________________
Sitescooper-talk mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/sitescooper-talk

Reply via email to