On 11/01/2013 10:52, Kapitano wrote:
I'm actually trying to use BBC subtitles as a text corpus for linguisic analysis, and was hoping to avoid downloading duplicates. But if the metadata doesn't indicate subtitle presence, there's no (easy) way for GiP to keep track of which srt files have already been downloaded. I'll just use duplicate file removal software.
Do you need the download date in the file name? If you only use invariant substitution parameters (<name>, <pid>, etc.) in your file name format, get_iplayer won't create duplicates or overwrite previous downloads, as long as you're writing to the same directory. You can always get the download date from the file attributes, so you could concoct a simple script to archive your subtitle files and append the download date to the filename in the process.
_______________________________________________ get_iplayer mailing list [email protected] http://lists.infradead.org/mailman/listinfo/get_iplayer

