On 11/01/2013 10:52, Kapitano wrote:
I'm actually trying to use BBC subtitles as a text corpus for linguisic
analysis, and was hoping to avoid downloading duplicates. But if the
metadata doesn't indicate subtitle presence, there's no (easy) way for
GiP to keep track of which srt files have already been downloaded. I'll
just use duplicate file removal software.

Do you need the download date in the file name? If you only use invariant substitution parameters (<name>, <pid>, etc.) in your file name format, get_iplayer won't create duplicates or overwrite previous downloads, as long as you're writing to the same directory. You can always get the download date from the file attributes, so you could concoct a simple script to archive your subtitle files and append the download date to the filename in the process.


_______________________________________________
get_iplayer mailing list
[email protected]
http://lists.infradead.org/mailman/listinfo/get_iplayer

Reply via email to