rafidaslam added a comment.

Thanks for the ideas @eflyjason @zhuyifei1999

I'm agree with the -keep_old parameter..

BTW I wanna ask, is actually latest is just a pointer to a dated revision (f.ex 20171220) ? I've found that every files in the latest revision have -rss.xml file that contains metadata that links the file to a dated revision. I think we can use that.

So by using that, I think the implementation will be:
File that downloaded with latest revision will be renamed into their "real" date revision f.ex idwiki-latest-abstract.xml -> idwiki-20171103-abstract.xml based on their -rss.xml file.
Then do check if there's a file with the same name, if the file exist, the script will not download the dump again, vice versa.
Then if -keep_old is False, we delete all previous dumps with the same name


TASK DETAIL
https://phabricator.wikimedia.org/T183667

EMAIL PREFERENCES
https://phabricator.wikimedia.org/settings/panel/emailpreferences/

To: rafidaslam
Cc: gerritbot, rafidaslam, divadsn, eflyjason, pywikibot-bugs-list, Aklapper, Xqt, zhuyifei1999, jayvdb, siebrand, Zoranzoki21, Framawiki, Cpaulf30, Bright1055, Baloch007, Darkminds3113, Toppole69, Lordiis, Adik2382, Mine0901, Jayprakash12345, Th3d3v1ls, Ramalepe, Liugev6, Magul, Tbscho, MayS, Beeyan, Lewizho99, Mdupont, JJMC89, Maathavan, MtDu, D3r1ck01, Avicennasis, Dalba, Masti, Alchimista, Rxy
_______________________________________________
pywikibot-bugs mailing list
[email protected]
https://lists.wikimedia.org/mailman/listinfo/pywikibot-bugs

Reply via email to