Re: get_iplayer repair update #1

2014-11-04 Thread Alan Foster
Thank you for your hard work on get_iplayer.

Alan.

On 1 November 2014 00:45, dinkypumpkin dinkypump...@gmail.com wrote:
 get_iplayer has been more or less repaired, but there are still some wounds.
 I'm going to release what I have on Sunday.  I'm on the road next week, so
 I've run out of time to do more for the time being. Consider it a stopgap
 until progress can be made on other fronts. This is where things are:

 1. I've disabled code related to the discontinued feeds, so you shouldn't
 get any more bogus values in your metadata tags.  You should also see
 thumbnails again in files  7 days old downloaded via PID.

 2. The new release will support entry of multiple PIDs.

 3. I've more or less restored the 7 day cache for TV and radio.  There are
 still some holes in it:

 a. It is not possible to search for audiodescribed versions of programmes.
 I haven't been able to source that information.  If anyone has any clues on
 the subject, chime in - but not if your suggestion is to scrape the iPlayer
 site.  That isn't on the table right just yet.

 You can still download audiodescribed versions, but you'll have to look for
 them on the iPlayer site.  Signed versions should still be flagged in the
 get_iplayer cache, but some may be missing.  Again, check the iPlayer site
 if in doubt.

 I've changed get_iplayer to always scrape the related episode page to look
 for audiodescribed/signed versions when requested, so hopefully more
 downloads will be successful.  I found a number of cases where the playlist
 data for recent programmes didn't contain identifiers for audiodescribed
 versions even though they existed on the iPlayer site.

 b. It is not possible to search radio programmes by category. TV programmes
 still have category information. There is a source for radio category
 information, but it uniformly foundered on Radio 4 and Radio 4 Extra, which
 is where the categories are most meaningful.  I know that is going to break
 some PVR searches, but the alternative is a support headache I can't absorb.

 c. I can't vouch that every programme from the previous 7 days will show up
 in the cache. As always, you can use the PID for any programme not in the
 cache. By the same token, I can't vouch that every programme in the cache
 will be downloadable.  The new feeds contain noticeably more programmes,
 some due to the inclusion of web-only stuff. With the heavier load, cache
 refreshes are noticeably slower than with the old feeds, ca. 90 seconds for
 me for tv+radio.

 2. The more-or-less restored cache depends on some old data feeds lingering
 at the BBC.  Recent events have taught us that they could disappear without
 warning, so I've implemented a fallback mechanism. There will be a new
 option that will switch the cache to refresh from the channel schedule pages
 instead of the old data feeds.  However, this fallback is also limited:

 a. It is not possible to search for audiodescribed or signed versions of
 programmes.  That information isn't in the schedule pages.

 b. It is not possible to search TV or radio programmes by category. Again,
 that information isn't in the schedule pages.

 c. Cache refresh is slow, ca. 4+ minutes for a full TV and radio refresh for
 me.  The time could be cut by about 1/3 by removing regional TV channel
 variations, but it cuts out 50+ programmes, so I've left them in for the
 present.

 d. It appears that fewer programmes from the previous 7 days get cached
 compared to the feeds.  Part of that is because the schedule pages don't
 show most web-only programmes.  Part of it may also be because I'm checking
 availability info in the schedule pages more strictly than whatever produces
 the data feeds.  Again, you can use the PID for anything not in the cache.

 e. The only plus to using the schedule pages to populate the cache is that
 it becomes possible to expand your cache out to 30 days.  It seems to work
 OK, if you have 10-15 minutes to refresh your cache.  There will be an
 option for this.

 f. I've given you enough rope to hang yourself, but don't put this fallback
 option into regular use unless it becomes necessary - seriously.  It's only
 there to avoid weeks like this one.  I won't be interested in hearing how
 slow it is or how it doesn't locate some particular programme.  And for
 pete's sake *don't* use it with the Web PVR.  If you insist on playing
 around with it, you'll probably want to bump up --expiry to some gigantic
 number and refresh your cache manually as needed.

 3. Looking further ahead

 Some things that have been floated here in the past few days:

 a. Programme data services: If somebody implements something along these
 lines, I'm sure get_iplayer could be integrated with it.  It's clear that
 get_iplayer would never be able to access Nitro if and when it's ever opened
 up.  But, if somebody can repackage Nitro data for wider use, that would be
 pretty useful.

 b. iPlayer site scraping: This could also be the foundation of a 

Re: get_iplayer repair update #1

2014-11-01 Thread Guy Rolls

On 01/11/2014 00:45, dinkypumpkin wrote:
get_iplayer has been more or less repaired, but there are still some 
wounds...
A huge THANK YOU for your previous, and continuing, efforts to keep 
get_iPlayer alive.  It is much appreciated by many people.



___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: get_iplayer repair update #1

2014-11-01 Thread peter
Can I also say a massive thank you to Dinky and others who have worked 
in double quick time to get a fix. I have used get_iplayer for several 
years but only joined the mailing list when disaster seemed to strike 
late last week. The response has been a fantastic demonstration of open 
source community work, and I hope a more permanent solution will be 
possible in the coming weeks (or months). Thank you also to Jon for 
maintaining the ppa for ubuntu users (like me -- I also use get_iplayer 
on the raspberry pi). Reading the posts has made me want to start 
learning perl or python, but I am probably too old (50) and am stuck 
at present learning scratch with my younger son (aged 6).


On Sat, 1 Nov, 2014 at 9:52 PM, Peter S Kirk peter.k...@isauk.biz 
wrote:
On 1 Nov 2014 at 1:52, Jeremy Nicoll - ml get_iplayer Jeremy Nicoll - 
ml 
get_iplayer jn.ml.gti...@wingsandbeaks.org.uk wrote:



 dinkypumpkin dinkypump...@gmail.com wrote:
 
  get_iplayer has been more or less repaired, but there are still 
some

  wounds.  I'm going to release what I have on Sunday
 
 This is excellent news, and I have to say I'm impressed by the 
amount you've

 managed to do in such a short time.
 
 (I realise that we may have been lucky with the time of year; I 
should think
 that it's a bad time for any sort of pumpkin to be out of the ... 
house? 
 shed?  field?  ... wherever a pumpkin normally feels at home.  I've 
seen
 some dreadful sights in the last 12 hours - pumpkins being carried 
around by

 witches etc...)


Dinky,

As Jeremy says, a huge thank you for the effort you and others are 
putting 
into work arounds for the BBC changes. Very much appreciated.


I too am pleased you survived another Halloween uncut :)

Jeremy, well done - very amusing and made me laugh :)

Cheers,

Peter

PS Ignore previous copy.com referal link for bonus - it no longer 
works as 
copy have implemented a max five referals policy :( 


I have other referal links which will give the 5GB bonus - PM me.

Sorry if you have joined and not received bonus 5GB,  with details 
and I 
will make a complaint as I was not informed of this change.


___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer



___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


get_iplayer repair update #1

2014-10-31 Thread dinkypumpkin
get_iplayer has been more or less repaired, but there are still some 
wounds.  I'm going to release what I have on Sunday.  I'm on the road 
next week, so I've run out of time to do more for the time being. 
Consider it a stopgap until progress can be made on other fronts. This 
is where things are:


1. I've disabled code related to the discontinued feeds, so you 
shouldn't get any more bogus values in your metadata tags.  You should 
also see thumbnails again in files  7 days old downloaded via PID.


2. The new release will support entry of multiple PIDs.

3. I've more or less restored the 7 day cache for TV and radio.  There 
are still some holes in it:


a. It is not possible to search for audiodescribed versions of 
programmes.  I haven't been able to source that information.  If anyone 
has any clues on the subject, chime in - but not if your suggestion is 
to scrape the iPlayer site.  That isn't on the table right just yet.


You can still download audiodescribed versions, but you'll have to look 
for them on the iPlayer site.  Signed versions should still be flagged 
in the get_iplayer cache, but some may be missing.  Again, check the 
iPlayer site if in doubt.


I've changed get_iplayer to always scrape the related episode page to 
look for audiodescribed/signed versions when requested, so hopefully 
more downloads will be successful.  I found a number of cases where the 
playlist data for recent programmes didn't contain identifiers for 
audiodescribed versions even though they existed on the iPlayer site.


b. It is not possible to search radio programmes by category. TV 
programmes still have category information. There is a source for radio 
category information, but it uniformly foundered on Radio 4 and Radio 4 
Extra, which is where the categories are most meaningful.  I know that 
is going to break some PVR searches, but the alternative is a support 
headache I can't absorb.


c. I can't vouch that every programme from the previous 7 days will show 
up in the cache. As always, you can use the PID for any programme not in 
the cache. By the same token, I can't vouch that every programme in the 
cache will be downloadable.  The new feeds contain noticeably more 
programmes, some due to the inclusion of web-only stuff. With the 
heavier load, cache refreshes are noticeably slower than with the old 
feeds, ca. 90 seconds for me for tv+radio.


2. The more-or-less restored cache depends on some old data feeds 
lingering at the BBC.  Recent events have taught us that they could 
disappear without warning, so I've implemented a fallback mechanism. 
There will be a new option that will switch the cache to refresh from 
the channel schedule pages instead of the old data feeds.  However, this 
fallback is also limited:


a. It is not possible to search for audiodescribed or signed versions of 
programmes.  That information isn't in the schedule pages.


b. It is not possible to search TV or radio programmes by category. 
Again, that information isn't in the schedule pages.


c. Cache refresh is slow, ca. 4+ minutes for a full TV and radio refresh 
for me.  The time could be cut by about 1/3 by removing regional TV 
channel variations, but it cuts out 50+ programmes, so I've left them in 
for the present.


d. It appears that fewer programmes from the previous 7 days get cached 
compared to the feeds.  Part of that is because the schedule pages don't 
show most web-only programmes.  Part of it may also be because I'm 
checking availability info in the schedule pages more strictly than 
whatever produces the data feeds.  Again, you can use the PID for 
anything not in the cache.


e. The only plus to using the schedule pages to populate the cache is 
that it becomes possible to expand your cache out to 30 days.  It seems 
to work OK, if you have 10-15 minutes to refresh your cache.  There will 
be an option for this.


f. I've given you enough rope to hang yourself, but don't put this 
fallback option into regular use unless it becomes necessary - 
seriously.  It's only there to avoid weeks like this one.  I won't be 
interested in hearing how slow it is or how it doesn't locate some 
particular programme.  And for pete's sake *don't* use it with the Web 
PVR.  If you insist on playing around with it, you'll probably want to 
bump up --expiry to some gigantic number and refresh your cache manually 
as needed.


3. Looking further ahead

Some things that have been floated here in the past few days:

a. Programme data services: If somebody implements something along these 
lines, I'm sure get_iplayer could be integrated with it.  It's clear 
that get_iplayer would never be able to access Nitro if and when it's 
ever opened up.  But, if somebody can repackage Nitro data for wider 
use, that would be pretty useful.


b. iPlayer site scraping: This could also be the foundation of a 
programme data service instead of Nitro.  It is also the only real hope 
for get_iplayer to regain a full-featured desktop 

RE: get_iplayer repair update #1

2014-10-31 Thread David Lake (dlake)
Wireshark MAY tell you that they're using the Nitro API with a defined key

I couldn't possibly comment though.

-Original Message-
From: get_iplayer [mailto:get_iplayer-boun...@lists.infradead.org] On Behalf Of 
Steven Maude
Sent: 31 October 2014 18:08
To: get_iplayer
Subject: Re: get_iplayer repair update #1

On 01/11/2014 00:45, dinkypumpkin wrote:

 Some things that have been floated here in the past few days:
 snip lots

Yet another one, which I'll mention again here as it was lost in the deluge. 
Does anyone know how the BBC's mobile apps work? Are they simply web-based 
frontends or do they use some private API? If the latter, then it might be 
possible to use that to retrieve metadata, with the advantages that it's 
unlikely to have support dropped overnight or be as fragile as scraping.

This post gives some guidance on how to investigate: 
http://timrogers.uk/2014/07/12/discovering-private-apis-with-charles-app/

Anyway, great work; look forward to the update and it's definitely appreciated.

Hope you get chance to relax a bit over the weekend!

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: get_iplayer repair update #1

2014-10-31 Thread Jeremy Nicoll - ml get_iplayer
dinkypumpkin dinkypump...@gmail.com wrote:

 get_iplayer has been more or less repaired, but there are still some
 wounds.  I'm going to release what I have on Sunday

This is excellent news, and I have to say I'm impressed by the amount you've
managed to do in such a short time.

(I realise that we may have been lucky with the time of year; I should think
that it's a bad time for any sort of pumpkin to be out of the ... house? 
shed?  field?  ... wherever a pumpkin normally feels at home.  I've seen
some dreadful sights in the last 12 hours - pumpkins being carried around by
witches etc...)


 c. External search  ...  I'll have my pony now, thanks.

Whoosh?  That's certainly over my head.  Unless it's an insight into a
pumpkin's dietary preferences?


Regarding cacheing from feeds vv scraping... cacheing seems to me to have
been good for capturing a snapshot of all the available programmes at a
point in time.  But for anyone who's a regular user of get_iplayer there's a
lot of overlap in the cache's contents from one version/refresh to the next.
 I'm not so sure that scraping the schedules to see what the BBC plan to
transmit in just the following day, say, would be so bad, especially if it
was limited by user selection to just the channels/stations that a user was
most interested in.  It could mean that it'd take a few days for such a user
to build up a list of programmes that should be available, but then actual
availability checks per programme could wait until someone tried to download
one.

-- 
Jeremy Nicoll - my opinions are my own.

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer