Re: Curating "In Our Time" (IOT) downloads.

2022-07-06 Thread Ralph Corderoy
Hi Budge,

> > Some of the ‘Unsorted’ ones have a PID and ‘./get_iplayer -i --pid
> > b075t5mn’ shows
> > 
> >  categories:  Factual,History,Discussion & Talk
> >  category:Factual
...
> where did you get the categories line above?

It's in the output of running the get_iplayer program with the arguments
I showed above.  But another reply has pointed out the same detail is
probably in each of the MP3 files, which is easier to access.

-- 
Cheers, Ralph.

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread David Cantrell

On 05/07/2022 21:04, Budge wrote:

On 05/07/2022 19:00, David Cantrell wrote:

$ AtomicParsley In_Our_Time_-_John_Bull_m0018nsd_other.m4a --textdata
...
Atom "©grp" contains: Factual,History,Discussion & Talk
...
Atom "©gen" contains: Factual


Not now at the machine where my GiP history resides but I have meanwhile 
been confused further by the above reference to "Factual."  I have not 
seen any of my existing files which have been entered into a "Factual" 
subdirectory.  I only have the five directories Culture, History, 
Philosophy, Religion and Science.  Is there another category "Factual?"


Note that there are two fields that contain "Factual".

Back when I worked on the iPlayer back-end, categories were, if I 
remember correctly, a multi-layered beast. I assume that they still are, 
and that "Factual" is the top level, which contains a "History" 
sub-category, which contains a "Discussion & Talk" sub-category.


Of course, that doesn't mean that they still organise things that way. 
But nevertheless, one of the categories you were interested in was 
"History", and that appears in the "©grp" atom for that particular episode.


--
David Cantrell

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Budge

On 05/07/2022 19:00, David Cantrell wrote:

On 05/07/2022 09:42, Budge wrote:
I have been listening to IOT for years and have these downloads saved 
for use locally.
Through time the BBC have delivered these programmes in slightly 
different formats and I believe they are now also available from an 
archive as podcasts, but I already have my own archive, albeit in 
various formats.


My problem is that in the beginning the downloads were filtered, I 
think by BBC but possibly by my filters long ago, into five categories 
according to subject.  The categories were Culture, History, 
Philosophy, Religion and Science ...
Most media files contain metadata tags, including those downloaded from 
the BBC. For mp3 files use `id3info` to see them. For m4a files use the 
idiotically-named `AtomicParsley`. For example:


$ AtomicParsley In_Our_Time_-_John_Bull_m0018nsd_other.m4a --textdata

Atom "stik" contains: Normal
Atom "cprt" contains: 2022 British Broadcasting Corporation, all rights
   reserved
Atom "©nam" contains: John Bull
Atom "©ART" contains: BBC Radio 4
Atom "aART" contains: BBC Radio
Atom "©alb" contains: In Our Time
Atom "©grp" contains: Factual,History,Discussion & Talk
Atom "©wrt" contains: BBC Sounds
Atom "©gen" contains: Factual
Atom "©cmt" contains: Melvyn Bragg and guests discuss the origins and
   evolution of the satirical everyman figure
Atom "©day" contains: 2022-06-30T09:00:00+01:00
Atom "©lyr" contains: Melvyn Bragg and guests discuss the origin of this
   personification of the English everyman and his development as both
   British and Britain in the following centuries. He first appeared
   blahblahblah ...



Hi David,
Not now at the machine where my GiP history resides but I have meanwhile 
been confused further by the above reference to "Factual."  I have not 
seen any of my existing files which have been entered into a "Factual" 
subdirectory.  I only have the five directories Culture, History, 
Philosophy, Religion and Science.  Is there another category "Factual?"


I note the line Atom "©gen" contains: Factual.  Is this genre?  I wonder 
if when I rebuilt my system I didn't get the pvr instruction correct and 
if I ran it again this might do the sorting for me as the metadata will 
still be available.  Any ideas?



___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread David Cantrell

On 05/07/2022 09:42, Budge wrote:
I have been listening to IOT for years and have these downloads saved 
for use locally.
Through time the BBC have delivered these programmes in slightly 
different formats and I believe they are now also available from an 
archive as podcasts, but I already have my own archive, albeit in 
various formats.


My problem is that in the beginning the downloads were filtered, I think 
by BBC but possibly by my filters long ago, into five categories 
according to subject.  The categories were Culture, History, Philosophy, 
Religion and Science ...
Most media files contain metadata tags, including those downloaded from 
the BBC. For mp3 files use `id3info` to see them. For m4a files use the 
idiotically-named `AtomicParsley`. For example:


$ AtomicParsley In_Our_Time_-_John_Bull_m0018nsd_other.m4a --textdata

Atom "stik" contains: Normal
Atom "cprt" contains: 2022 British Broadcasting Corporation, all rights
  reserved
Atom "©nam" contains: John Bull
Atom "©ART" contains: BBC Radio 4
Atom "aART" contains: BBC Radio
Atom "©alb" contains: In Our Time
Atom "©grp" contains: Factual,History,Discussion & Talk
Atom "©wrt" contains: BBC Sounds
Atom "©gen" contains: Factual
Atom "©cmt" contains: Melvyn Bragg and guests discuss the origins and
  evolution of the satirical everyman figure
Atom "©day" contains: 2022-06-30T09:00:00+01:00
Atom "©lyr" contains: Melvyn Bragg and guests discuss the origin of this
  personification of the English everyman and his development as both
  British and Britain in the following centuries. He first appeared
  blahblahblah ...

--
David Cantrell

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Chris Walker
On Tue, 5 Jul 2022 09:42:06 +0100
Budge  wrote:

> I have been listening to IOT for years and have these downloads saved
> for use locally. Through time the BBC have delivered these programmes

[snip]

> I am reluctant to download them all again so am posting a request for
> help please to curate these files and sort the unsorted programmes
> into the original categories.  Is this possible and if so how

This may not be the best approach but it's where I would start.
Assuming your download_history file has captured all of your downloaded
IOT files, and again, assuming you're on linux as I have no idea how to
do this on Windows, I would do this :-
grep 'In Our Time' ~/.get_iplayer/Download_history to produce a list of
programmes for your chosen search criteria. The first 8 characters
comprise the pid. If I had better programming skills! I could exclude
everything except the pid. Then do a for loop around that list
'get_iplayer --info '$pid and then extract the category or whatever
else it is you want.

I trust somebody else will now come along and polish the concoction that
I've produced ;-)

-- 
 __ __| |_ __ __  ..
/ _/ _` \ V  V /  |  mailto:cdw_pcm...@the-walker-household.co.uk  |
\__\__,_|\_/\_/   ||

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Budge

On 05/07/2022 14:12, Ralph Corderoy wrote:

Hi Budge,


file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Science/153
 
In_Our_Time_Archive_Science_-_IOT_The_Royal_Society_and_British_Science_Episode_4_iots_20100107-0900a.mp3
file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_-_716._The_Sikh_Empire_b075t5mn_default.m4a
file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_-_A_Midsummer_Nights_Dream__podcast.m4a
file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_With_Melvyn_Bragg_-_IOT_Zen_04_Dec_14_iot_20141204-1140a.mp3


Some of the ‘Unsorted’ ones have a PID and ‘./get_iplayer -i --pid
b075t5mn’ shows

 categories:  Factual,History,Discussion & Talk
 category:Factual

Is ‘Factual’ the kind of thing which would replace ‘Unsorted’?


Hi Ralph,
As far as I am aware there is no Factual category, only the five I mentioned 
earlier.  The directory Unsorted was created by me as a holding place until I 
sorted this out, which has taken longer than I intended!
where did you get the categories line above?

 


___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Ralph Corderoy
Hi Budge,

> file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Science/153
>  
> In_Our_Time_Archive_Science_-_IOT_The_Royal_Society_and_British_Science_Episode_4_iots_20100107-0900a.mp3
> file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_-_716._The_Sikh_Empire_b075t5mn_default.m4a
> file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_-_A_Midsummer_Nights_Dream_m00046rp_podcast.m4a
> file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_With_Melvyn_Bragg_-_IOT_Zen_04_Dec_14_iot_20141204-1140a.mp3

Some of the ‘Unsorted’ ones have a PID and ‘./get_iplayer -i --pid
b075t5mn’ shows

categories:  Factual,History,Discussion & Talk
category:Factual

Is ‘Factual’ the kind of thing which would replace ‘Unsorted’?

-- 
Cheers, Ralph.

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Budge

On 05/07/2022 12:01, Jonathan Bryden wrote:

I am not sure if I am understanding the problem correctly so please forgive me 
if my answer is over simplistic but the BBC lists all episodes by genre so 
wouldn't the quickest way be to go through each of the lists in turn dropping 
the relevant episodes from your unsorted folder into the correct genre folder. 
I doubt it would take long.

For example the science grouping is here 
https://www.bbc.co.uk/programmes/p01gyd7j?page=1

If it helps I have a numbered list of episodes since 2012 in the correct order.

Hope that helps a bit

Jonathan

-Original Message-
From: get_iplayer  On Behalf Of Budge
Sent: 05 July 2022 09:42
To: get_iplayer 
Subject: Curating "In Our Time" (IOT) downloads.

I have been listening to IOT for years and have these downloads saved for use 
locally.
Through time the BBC have delivered these programmes in slightly different 
formats and I believe they are now also available from an archive as podcasts, 
but I already have my own archive, albeit in various formats.

My problem is that in the beginning the downloads were filtered, I think by BBC but 
possibly by my filters long ago, into five categories according to subject.  The 
categories were Culture, History, Philosophy, Religion and Science.  This seems to have 
stopped around 2012, possible due to BBC format changes and since then they have all been 
saved in my system as "Unsorted" and for a while these were also numbered but 
are no longer, possibly due to changes in my own GiP setup over the years.

I am reluctant to download them all again so am posting a request for help 
please to curate these files and sort the unsorted programmes into the original 
categories.  Is this possible and if so how please?  All help much appreciated 
as it has been a while since I worked on GiP.
Budge

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer

Hi Jonathan,
Seems like a plan.  I had no idea there already was a list by genre available.  
Will start there.

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Budge

On 05/07/2022 12:26, Ralph Corderoy wrote:

Hi Budge,


The categories were Culture, History, Philosophy, Religion and
Science.  This seems to have stopped around 2012, possible due to BBC
format changes and since then they have all been saved in my system as
"Unsorted" and for a while these were also numbered but are no longer,
possibly due to changes in my own GiP setup over the years.


Please show the list some example filenames, both those old ones which
are in their correct category and some new ‘unsorted’ ones.  This will
tell us what information can be gleaned from them, e.g. an iPlayer PID.


Hi Ralph,
Many thanks btw to all for the prompt replies.
Here is the last file in the Science list:-

file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Science/153
 
In_Our_Time_Archive_Science_-_IOT_The_Royal_Society_and_British_Science_Episode_4_iots_20100107-0900a.mp3

I have no idea how the numbering was achieved and looking at the metadata the 
file is showing as a podcast.

From the unsorted directory in my system they fall into two types, those 
already numbered although again I have no recollection of how they were 
numbered, and those without numbers.

Here is the first of the list:-
file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_-_716._The_Sikh_Empire_b075t5mn_default.m4a.
  The numbering stopped at 815.  The files then have a slightly different look. 
 Here is the first of the un-numbered files:-

file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_-_A_Midsummer_Nights_Dream_m00046rp_podcast.m4a

Here is the last on my database:-

file:///home/alastair/NFS_Multimedia_NFS/AV_multimedia/Music/Radio_Programme/In_Our_Time_Unsorted/In_Our_Time_With_Melvyn_Bragg_-_IOT_Zen_04_Dec_14_iot_20141204-1140a.mp3

Will these help?
Meanwhile I shall take a look at the wiki and also on BBC archive.
Regards,
Budge





 


___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Ralph Corderoy
Hi Budge,

> The categories were Culture, History, Philosophy, Religion and
> Science.  This seems to have stopped around 2012, possible due to BBC
> format changes and since then they have all been saved in my system as
> "Unsorted" and for a while these were also numbered but are no longer,
> possibly due to changes in my own GiP setup over the years.

Please show the list some example filenames, both those old ones which
are in their correct category and some new ‘unsorted’ ones.  This will
tell us what information can be gleaned from them, e.g. an iPlayer PID.

-- 
Cheers, Ralph.

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread fred.d
Couldn't find the current episodes on iplayer so not sure if they have 
these categories available, but that will be limited anyway.


Tried IMDB - they have an episode list but no info other than title

Tried the tvdb, but unsurprisingly not there.

Wikipedia provides a list however with a lot of info. 
https://en.wikipedia.org/wiki/List_of_In_Our_Time_programmes


Not your categories tho, so it would be a manual exercise to import the 
list into a spreadsheet and add a category column, wouldn't think it 
would take more than an hour with this info as you know how you want to 
categorise them.
Then use the titles and categories from that list with some awk/sed 
wizardry (or other renaming program if you have one)  to effect the 
changes to the filenames.


I'm assuming that you aren't Windows but if you are, have a look at 
"advanced renamer", there may be an extension to extract info from the 
relevant audio files and if you're lucky then the category is in there 
but hidden. Takes a couple of hours to wrap your head around the 
advanced options and test, but I have used this for my photography files 
for years as I prefer the primary info to be in the file name. I still 
use other tools to organise but...


Once you've done it, consider uploading to the Wiki or IMDB.

Good luck.


On 05/07/2022 09:42, Budge wrote:
I have been listening to IOT for years and have these downloads saved 
for use locally.
Through time the BBC have delivered these programmes in slightly 
different formats and I believe they are now also available from an 
archive as podcasts, but I already have my own archive, albeit in 
various formats.


My problem is that in the beginning the downloads were filtered, I 
think by BBC but possibly by my filters long ago, into five categories 
according to subject.  The categories were Culture, History, 
Philosophy, Religion and Science.  This seems to have stopped around 
2012, possible due to BBC format changes and since then they have all 
been saved in my system as "Unsorted" and for a while these were also 
numbered but are no longer, possibly due to changes in my own GiP 
setup over the years.


I am reluctant to download them all again so am posting a request for 
help please to curate these files and sort the unsorted programmes 
into the original categories.  Is this possible and if so how please?  
All help much appreciated as it has been a while since I worked on GiP.

Budge

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Re: Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Jeremy Nicoll - ml gip

Oops, re-sending, as my first reply wasn't to the list.

On 2022-07-05 09:42, Budge wrote:


I am reluctant to download them all again so am posting a request for
help please to curate these files and sort the unsorted programmes
into the original categories.  Is this possible and if so how please?
All help much appreciated as it has been a while since I worked on
GiP.


If for every episode you've downloaded you renamed the files to include
some or all of the subject details, then you might be able to look for
significant keywords in those subjects.  Eg find all the unsorted files
whose subject includes "church", or "god" or "bishop" ... then place
them in your "religion" folder.  Or perhaps place them in a temporary
"probably religion" folder and cast an eye over their subjects so that
you aren't misled by subjects like "waiting for godot".

If your files don't have subject detail in them, is there not a way
to use g_ip to list available files without downloading them - that
might give you long lists of prior and recent podcasts/programme
details.  They might already be listed in your programme title caches
and those you did download in the download history files.

--
Jeremy Nicoll - my opinions are my own

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer


Curating "In Our Time" (IOT) downloads.

2022-07-05 Thread Budge

I have been listening to IOT for years and have these downloads saved for use 
locally.
Through time the BBC have delivered these programmes in slightly different 
formats and I believe they are now also available from an archive as podcasts, 
but I already have my own archive, albeit in various formats.

My problem is that in the beginning the downloads were filtered, I think by BBC but 
possibly by my filters long ago, into five categories according to subject.  The 
categories were Culture, History, Philosophy, Religion and Science.  This seems to have 
stopped around 2012, possible due to BBC format changes and since then they have all been 
saved in my system as "Unsorted" and for a while these were also numbered but 
are no longer, possibly due to changes in my own GiP setup over the years.

I am reluctant to download them all again so am posting a request for help 
please to curate these files and sort the unsorted programmes into the original 
categories.  Is this possible and if so how please?  All help much appreciated 
as it has been a while since I worked on GiP.
Budge

___
get_iplayer mailing list
get_iplayer@lists.infradead.org
http://lists.infradead.org/mailman/listinfo/get_iplayer