aude created this task. aude added a subscriber: aude. aude added projects: Wikibase-Quality-External-Validation, Wikidata. Herald added a subscriber: Aklapper.
TASK DESCRIPTION Currently, the scripts are painful and slow to use to download and convert the gnd data. The script downloads and processed 3 different files, in sequential order. If the third download fails or I have to cancel because I want to stop it and continue later (because it takes so long), then the only choice is to restart the entire script (e.g. re-download and process the first and second thing). It would be nice if it I could have it not re-download stuff and be able to continue with the third step, without repeating 1 and 2. It would also be nice if the script could be made to run faster. Finally, in my last attempt just now, i encounted a DownloadError timeout for the third dump file, and then the script dies. (so i have to restart the whole process and somehow increase the timeout) PS - also nice if these were not maintained on github (https://github.com/WikidataQuality/DumpConverter) I consider this a blocker for deployment, as I struggling a bit to be able to produce any csvs, which we would need for Wikidata. TASK DETAIL https://phabricator.wikimedia.org/T113036 EMAIL PREFERENCES https://phabricator.wikimedia.org/settings/panel/emailpreferences/ To: aude Cc: aude, Aklapper, Wikidata-bugs _______________________________________________ Wikidata-bugs mailing list [email protected] https://lists.wikimedia.org/mailman/listinfo/wikidata-bugs
