I wonder if Yahoo might be a better target for scraping.
http://tv.yahoo.com/listings

On 7/6/07, Lan Barnes <[EMAIL PROTECTED]> wrote:
OK, Tcl'ers, who wants to have some FUN! There is a MythTV project crying
out for Tcl.

BACKGROUND

MythTV is the OSS Linux-based home brew Tivo. It rocks. Historically Myth
got its TV listings for the data base by screen scraping the zap2it site
(http://www.zap2it.com/). zap2it got pissed because of all the hits by
bots and played cat-n-mouse, changing their interface almost weekly.
Finally zap2it caved and created zap2it labs that allowed Myth people to
register, fill out a survey, and download XML listings for their
region/service. It has been free.

Now zap2it has announced that they're pulling that service in September.
No offer (yet) to make it a paid service, just bye bye. The Myth users
mailing list, which is exceptionally vapid at best, has been an avalanche
of pointless hand-wringing over this. "Can't we DO something?!"

THE PROJECT

Yes, we can! We can write a more flexible screen scraper for zap2it, a
better cat to play with their mouse.

I'm picturing something done with Expect and the Tcl HTML add-in. Config
files for the user's home data (zip code, channel selection, cable or
dish, etc). And the output should be the same XML that zap2it labs has
been providing.

And flexible flexible flexible.

Anyone wanna play (and, yes, Mr. Penix, I'm talkin' to _you_)?

--
Lan Barnes

SCM Analyst              Linux Guy
Tcl/Tk Enthusiast        Biodiesel Brewer





--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg


--
[email protected]
http://www.kernel-panic.org/cgi-bin/mailman/listinfo/kplug-lpsg

Reply via email to