I have a hunch that could be what NPR is doing, unfortunately.  I like
to listen to Morning Edition later in the day sometimes, and there's no
podcast for that.

I was wondering how hard it would be to scrape (<UGH!>) the download
links on the Rundown page?  From a scraping perspective, it looks pretty
easy, but putting the logic into a plugin might take some more work.

http://www.npr.org/templates/rundowns/rundown.php?prgId=3

The Rundown page already has the MP3 direct link, it has the date
navigation to go back and forth in time (a few months, it looks like). 
Since they're using templates (yeah!), a generic scraper could be
created.  I'm up for writing the scraping logic, but I'm so out of touch
on plugins, I wouldn't know where to start.

.... brain spinning .... 

Actually, it looks like you have that already covered with your Parsers
directory.  The source file is pretty well blocked out - 
you look for the 2nd "<!-- END CLASS="STORYCONTENT" -->"
Then look for the next <input> field and parse the value ("<input
type="hidden" id="title152653944" value="Presbyterians Have Varied Views
On Gay Marriage"></input>"
then you look for the next download link, and grab the URL ("<li><a
class="download"
href="http://pd.npr.org/anon.npr-mp3/npr/me/2012/05/20120514_me_01.mp3?dl=1";><span>Download</span></a></li>")

Any tips on how I could try and implement this?

Thanks!
-Dan


------------------------------------------------------------------------
plympton's Profile: http://forums.slimdevices.com/member.php?userid=12955
View this thread: http://forums.slimdevices.com/showthread.php?t=95118

_______________________________________________
plugins mailing list
[email protected]
http://lists.slimdevices.com/mailman/listinfo/plugins

Reply via email to